Artificial Intelligence

BEAM: Bi-level Memory-adaptive Algorithmic Evolution for LLM-Powered Heuristic Design
Avatar
librarian
0 views
Cycle-Consistent Search: Question Reconstructability as a Proxy Reward for Search Agent Training
Avatar
librarian
0 views
DocSeeker: Structured Visual Reasoning with Evidence Grounding for Long Document Understanding
Avatar
librarian
0 views
Transferable Expertise for Autonomous Agents via Real-World Case-Based Learning
Avatar
librarian
1 view
RePAIR: Interactive Machine Unlearning through Prompt-Aware Model Repair
Avatar
librarian
0 views
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time
Avatar
Haozhe Wang
3 views
Context Kubernetes: Declarative Orchestration of Enterprise Knowledge for Agentic AI Systems
Avatar
Charafeddine Mouzouni
4 views
Retrieval Is Not Enough: Why Organizational AI Needs Epistemic Infrastructure
Avatar
librarian
3 views
GenTac: Generative Modeling and Forecasting of Soccer Tactics
Avatar
Weidi Xie
4 views
Detecting Safety Violations Across Many Agent Traces
Avatar
librarian
3 views
VeriSim: A Configurable Framework for Evaluating Medical AI Under Realistic Patient Noise
Avatar
Sina Mansouri
2 views
Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs
Avatar
librarian
4 views
From Perception to Planning: Evolving Ego-Centric Task-Oriented Spatiotemporal Reasoning via Curriculum Learning
Avatar
librarian
3 views
Agent^2 RL-Bench: Can LLM Agents Engineer Agentic RL Post-Training?
Avatar
librarian
3 views
From Safety Risk to Design Principle: Peer-Preservation in Multi-Agent LLM Systems and Its Implications for Orchestrated Democratic Discourse Analysis
Avatar
librarian
19 views
Ads in AI Chatbots? An Analysis of How Large Language Models Navigate Conflicts of Interest
Avatar
Addison Wu
11 views
SkillClaw: Let Skills Evolve Collectively with Agentic Evolver
Avatar
librarian
33 views
KnowU-Bench: Towards Interactive, Proactive, and Personalized Mobile Agent Evaluation
Avatar
Zhengxi Lu
9 views
SUPERNOVA: Eliciting General Reasoning in LLMs with Reinforcement Learning on Natural Instructions
Avatar
librarian
6 views
Activation Steering for Aligned Open-ended Generation without Sacrificing Coherence
Avatar
librarian
8 views
From Phenomenological Fitting to Endogenous Deduction: A Paradigm Leap via Meta-Principle Physics Architecture
Avatar
Helong Hu
6 views
Aligning Agents via Planning: A Benchmark for Trajectory-Level Reward Modeling
Avatar
librarian
8 views
U-CECE: A Universal Multi-Resolution Framework for Conceptual Counterfactual Explanations
Avatar
librarian
6 views
EVGeoQA: Benchmarking LLMs on Dynamic, Multi-Objective Geo-Spatial Exploration
Avatar
librarian
7 views
Reason in Chains, Learn in Trees: Self-Rectification and Grafting for Multi-turn Agent Policy Optimization
Avatar
librarian
5 views
How Much LLM Does a Self-Revising Agent Actually Need?
Avatar
librarian
5 views
Beyond Compromise: Pareto-Lenient Consensus for Efficient Multi-Preference LLM Alignment
Avatar
librarian
11 views
ACE-Bench: Agent Configurable Evaluation with Scalable Horizons and Controllable Difficulty under Lightweight Environments
Avatar
librarian
15 views
MARL-GPT: Foundation Model for Multi-Agent Reinforcement Learning
Avatar
librarian
14 views
Claw-Eval: Toward Trustworthy Evaluation of Autonomous Agents
Avatar
librarian
17 views
Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents
Avatar
librarian
12 views
Can Large Language Models Reinvent Foundational Algorithms?
Avatar
librarian
13 views