Computation and Language

F2LLM-v2: Inclusive, Performant, and Efficient Embeddings for a Multilingual World
Avatar
librarian
16 views
Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Avatar
librarian
40 views
Learning When to Attend: Conditional Memory Access for Long-Context LLMs
Avatar
Aditya Chattopadhyay
9 views
Knowledge Distillation with Structured Chain-of-Thought for Text-to-SQL
Avatar
Khushboo Thaker
22 views
SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning
Avatar
librarian
21 views
Instruction set for the representation of graphs
Avatar
Ezequiel López-Rubio
21 views
Monitoring Emergent Reward Hacking During Generation via Internal Activations
Avatar
librarian
22 views
CHIMERA: Compact Synthetic Data for Generalizable LLM Reasoning
Avatar
Xinyu Zhu
20 views
Team of Thoughts: Efficient Test-time Scaling of Agentic Systems through Orchestrated Tool Calling
Avatar
librarian
39 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Dr. Murat ALTUN
50 views
Multi-LLM Thematic Analysis with Dual Reliability Metrics: Combining Cohen's Kappa and Semantic Similarity for Qualitative Research Validation
Avatar
Nilesh Jain
46 views
UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward
Avatar
librarian
110 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Salman
92 views
Memory in the Age of AI Agents

Memory in the Age of AI Agents

Computation and Language
Avatar
librarian
144 views
Non-Resolution Reasoning: A Framework for Preserving Semantic Ambiguity in Language Models
Avatar
Kei Saito
114 views
Latent Collaboration in Multi-Agent Systems
Avatar
librarian
151 views
Generalist Foundation Models Are Not Clinical Enough for Hospital Operations
Avatar
librarian
152 views
Instella: Fully Open Language Models with Stellar Performance
Avatar
librarian
172 views
Kimi Linear: An Expressive, Efficient Attention Architecture
Avatar
librarian
264 views
Tongyi DeepResearch Technical Report

Tongyi DeepResearch Technical Report

Computation and Language
Avatar
librarian
242 views
Agent Data Protocol: Unifying Datasets for Diverse, Effective
  Fine-tuning of LLM Agents
Avatar
librarian
253 views
FlatQuant: Flatness Matters for LLM Quantization
Avatar
丰辰 何
323 views
Reinforcement Learning on Pre-Training Data
Avatar
librarian
503 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
wang tuo
450 views
FlexOlmo: Open Language Models for Flexible Data Use
Avatar
librarian
400 views
Pre-Trained Policy Discriminators are General Reward Models
Avatar
librarian
352 views
MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs
Avatar
librarian
357 views
Answer Matching Outperforms Multiple Choice for Language Model
  Evaluation
Avatar
librarian
363 views
SynapseRoute: An Auto-Route Switching Framework on Dual-State Large
  Language Model
Avatar
librarian
407 views
On the Predictive Power of Representation Dispersion in Language Models
Avatar
librarian
427 views
STACK: Adversarial Attacks on LLM Safeguard Pipelines
Avatar
librarian
378 views
The Trilemma of Truth in Large Language Models
Avatar
Germans Savcisens
334 views