Artificial Intelligence

Control Tax: The Price of Keeping AI in Check
Avatar
Mikhail Terekhov
22 views
Truly Self-Improving Agents Require Intrinsic Metacognitive Learning
Avatar
librarian
16 views
LLM-First Search: Self-Guided Exploration of the Solution Space
Avatar
librarian
16 views
Just Enough Thinking: Efficient Reasoning with Adaptive Length Penalties
  Reinforcement Learning
Avatar
librarian
16 views
Interpretability by Design for Efficient Multi-Objective Reinforcement
  Learning
Avatar
Qiyue Xia
20 views
TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management
  in LLM-based Agentic Multi-Agent Systems
Avatar
librarian
20 views
AgentMisalignment: Measuring the Propensity for Misaligned Behaviour in
  LLM-Based Agents
Avatar
Akshat Naik
20 views
macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Avatar
Pei Yang
20 views
Does Thinking More always Help? Understanding Test-Time Scaling in
  Reasoning Models
Avatar
Soumya Suvra Ghosal
20 views
Linear Spatial World Models Emerge in Large Language Models
Avatar
Matthieu Tehenan
21 views
DPO Learning with LLMs-Judge Signal for Computer Use Agents
Avatar
librarian
20 views
The Limits of Predicting Agents from Behaviour
Avatar
Alexis Bellot
26 views
Sample, Predict, then Proceed: Self-Verification Sampling for Tool Use
  of LLMs
Avatar
librarian
26 views
Corrigibility as a Singular Target: A Vision for Inherently Reliable
  Foundation Models
Avatar
librarian
26 views
Data-to-Dashboard: Multi-Agent LLM Framework for Insightful
  Visualization in Enterprise Analytics
Avatar
Ran Zhang
50 views
ROTATE: Regret-driven Open-ended Training for Ad Hoc Teamwork
Avatar
Caroline Wang
48 views
Comparative of Genetic Fuzzy regression techniques for aeroacoustic
  phenomenons
Avatar
librarian
50 views
Fortune: Formula-Driven Reinforcement Learning for Symbolic Table
  Reasoning in Language Models
Avatar
librarian
47 views
Let's Reason Formally: Natural-Formal Hybrid Reasoning Enhances LLM's
  Math Capability
Avatar
librarian
49 views
ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Avatar
librarian
47 views
AI Mathematician: Towards Fully Automated Frontier Mathematical Research
Avatar
librarian
48 views
HDDLGym: A Tool for Studying Multi-Agent Hierarchical Problems Defined
  in HDDL with OpenAI Gym
Avatar
Ngoc La
49 views
Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular
  Chemical Operations
Avatar
librarian
50 views
The Multilingual Divide and Its Impact on Global AI Safety
Avatar
librarian
52 views
MRSD: Multi-Resolution Skill Discovery for HRL Agents
Avatar
librarian
51 views
Policy Induction: Predicting Startup Success via Explainable
  Memory-Augmented In-Context Learning
Avatar
Xianling Mu
54 views
Assured Autonomy with Neuro-Symbolic Perception
Avatar
librarian
49 views
MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs
Avatar
librarian
52 views
Learning Individual Behavior in Agent-Based Models with Graph Diffusion
  Networks
Avatar
Francesco Cozzi
52 views
Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive
  Logic Programming
Avatar
Yang Yang
51 views
Temporal Sampling for Forgotten Reasoning in LLMs
Avatar
librarian
54 views
Agentic AI Process Observability: Discovering Behavioral Variability
Avatar
librarian
52 views