Computation and Language

MeMo: Memory as a Model

MeMo: Memory as a Model

Computation and Language
Avatar
Ryan Quek
48 views
The Impossibility Triangle of Long-Context Modeling
Avatar
librarian
35 views
GiVA: Gradient-Informed Bases for Vector-Based Adaptation
Avatar
Neeraj Gangwar
53 views
A Multimodal Text- and Graph-Based Approach for Open-Domain Event Extraction from Documents
Avatar
librarian
59 views
Chat2Workflow: A Benchmark for Generating Executable Visual Workflows with Natural Language
Avatar
librarian
66 views
CD2CR: Co-reference Resolution Across Documents and Domains
Avatar
k-m-smit2
57 views
Demystifying OPD: Length Inflation and Stabilization Strategies for Large Language Models
Avatar
librarian
105 views
ClawBench: Can AI Agents Complete Everyday Online Tasks?
Avatar
librarian
83 views
Synthetic Sandbox for Training Machine Learning Engineering Agents
Avatar
Yuhang Zhou
88 views
Grounded Token Initialization for New Vocabulary in LMs for Generative Recommendation
Avatar
Daiwei Chen
103 views
AstroConcepts: A Large-Scale Multi-Label Classification Corpus for Astrophysics
Avatar
librarian
75 views
AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents
Avatar
librarian
79 views
F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
154 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
146 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
92 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
97 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
106 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
92 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
94 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
110 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
111 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
124 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
113 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
182 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
180 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
223 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
194 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
223 views
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
Avatar
librarian
222 views
Instella: Fully Open Language Models with Stellar Performance
Avatar
librarian
241 views
Kimi Linear: An Expressive, Efficient Attention Architecture
Avatar
librarian
378 views
Tongyi DeepResearch Technical Report

Tongyi DeepResearch Technical Report

Computation and Language
Avatar
librarian
328 views