Artificial Intelligence

Unveiling Causal Reasoning in Large Language Models: Reality or Mirage?
Avatar
Haoang Chi
0 views
Active Inference AI Systems for Scientific Discovery
Avatar
librarian
0 views
TableMoE: Neuro-Symbolic Routing for Structured Expert Reasoning in
  Multimodal Table Understanding
Avatar
librarian
2 views
Spatial Mental Modeling from Limited Views
Avatar
Qineng Wang
0 views
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Avatar
librarian
1 view
Exploring and Exploiting the Inherent Efficiency within Large Reasoning
  Models for Self-Guided Efficiency Enhancement
Avatar
librarian
8 views
SwarmAgentic: Towards Fully Automated Agentic System Generation via
  Swarm Intelligence
Avatar
Yao Zhang
14 views
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated
  Agent Intelligence
Avatar
librarian
8 views
Doppelgänger Method: Breaking Role Consistency in LLM Agent via
  Prompt-based Transferable Adversarial Attack
Avatar
librarian
8 views
GUI-Robust: A Comprehensive Dataset for Testing GUI Agent Robustness in
  Real-World Anomalies
Avatar
Unknown Unknown
10 views
From Points to Places: Towards Human Mobility-Driven Spatiotemporal
  Foundation Models via Understanding Places
Avatar
Mohammad Hashemi
8 views
AgentDistill: Training-Free Agent Distillation with Generalizable MCP
  Boxes
Avatar
librarian
7 views
Optimizing Length Compression in Large Reasoning Models
Avatar
Tianyi Zhou
8 views
Stream-Omni: Simultaneous Multimodal Interactions with Large
  Language-Vision-Speech Model
Avatar
librarian
9 views
Avoiding Obfuscation with Prover-Estimator Debate
Avatar
librarian
6 views
Weakest Link in the Chain: Security Vulnerabilities in Advanced
  Reasoning Models
Avatar
librarian
9 views
PB$^2$: Preference Space Exploration via Population-Based Methods in
  Preference-Based Reinforcement Learning
Avatar
librarian
18 views
GenPlanX. Generation of Plans and Execution
Avatar
librarian
32 views
A Study on Individual Spatiotemporal Activity Generation Method Using
  MCP-Enhanced Chain-of-Thought Large Language Models
Avatar
librarian
43 views
Breaking Bad Molecules: Are MLLMs Ready for Structure-Level Molecular
  Detoxification?
Avatar
Fei-Yue Wang
45 views
Spurious Rewards: Rethinking Training Signals in RLVR
Avatar
Rulin Shao
42 views
How Do People Revise Inconsistent Beliefs? Examining Belief Revision in
  Humans with User Studies
Avatar
Stylianos Vasileiou
52 views
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction
  and Planning
Avatar
Nicolas Ballas
52 views
VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement
  Learning
Avatar
librarian
62 views
Measuring Data Science Automation: A Survey of Evaluation Tools for AI
  Assistants and Agents
Avatar
Irene Testini
63 views
Reinforcing Multimodal Understanding and Generation with Dual
  Self-rewards
Avatar
librarian
74 views
GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection
  Behavior
Avatar
librarian
71 views
$τ^2$-Bench: Evaluating Conversational Agents in a Dual-Control
  Environment
Avatar
Victor Barres
72 views
Solving Inequality Proofs with Large Language Models
Avatar
librarian
72 views
Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to
  Model Optimisation
Avatar
Christopher Subia-Waud
68 views
Control Tax: The Price of Keeping AI in Check
Avatar
Mikhail Terekhov
114 views
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Avatar
librarian
115 views