Computation and Language

Constrained Entropic Unlearning: A Primal-Dual Framework for Large
  Language Models
Avatar
librarian
2 views
Critique-GRPO: Advancing LLM Reasoning with Natural Language and
  Numerical Feedback
Avatar
librarian
18 views
ATLAS: Learning to Optimally Memorize the Context at Test Time
Avatar
librarian
61 views
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning
  Engineering
Avatar
librarian
46 views
LoLA: Low-Rank Linear Attention With Sparse Caching
Avatar
librarian
44 views
DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural
  Language and Reinforcement Learning
Avatar
Jiahao Xu
44 views
Learning Composable Chains-of-Thought
Avatar
librarian
44 views
"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken
  Language Understanding
Avatar
Alkis Koudounas
50 views
THiNK: Can Large Language Models Think-aloud?
Avatar
Yongan Yu
48 views
Do Large Language Models Excel in Complex Logical Reasoning with Formal
  Language?
Avatar
Jin Jiang
46 views
MASLab: A Unified and Comprehensive Codebase for LLM-based Multi-Agent
  Systems
Avatar
librarian
45 views
R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs
  via Reinforcement Learning
Avatar
librarian
48 views
Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous
  Concept Space
Avatar
librarian
44 views
A Federated Splitting Framework for LLMs: Security, Efficiency, and
  Adaptability
Avatar
librarian
43 views
VerifyBench: Benchmarking Reference-based Reward Systems for Large
  Language Models
Avatar
librarian
42 views
BIM-GPT: a Prompt-Based Virtual Assistant Framework for BIM Information
  Retrieval
Avatar
Hervé Onguéné
59 views
Learning Dynamics in Continual Pre-Training for Large Language Models
Avatar
librarian
51 views
ComPO: Preference Alignment via Comparison Oracles
Avatar
librarian
51 views
Reasoning Models Don't Always Say What They Think
Avatar
Yanda Chen
59 views
Whisper-LM: Improving ASR Models with Language Models for Low-Resource
  Languages
Avatar
Hussein Kedir
64 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
경택 오
119 views
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Avatar
yorba
92 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
Ilya Baimetov
302 views
A Pipeline For Discourse Circuits From CCG
Avatar
ScienceCast Board
246 views
All That's 'Human' Is Not Gold: Evaluating Human Evaluation of Generated
  Text
Avatar
Yael Flax
252 views
Meta-path Augmented Response Generation
Avatar
ScienceCast Board
240 views
CliNER 2.0: Accessible and Accurate Clinical Concept Extraction
Avatar
Sasa Pure
220 views
A Hybrid Architecture for Multi-Party Conversational Systems
Avatar
priaon-flag
229 views
Analyzing the Structure of Attention in a Transformer Language Model
Avatar
levymoshe16
245 views
Direct Neural Machine Translation with Task-level Mixture of Experts
  models
Avatar
Isidora Tourni
245 views
Transformers as Soft Reasoners over Language
Avatar
ScienceCast Board
249 views
Attention Is All You Need

Attention Is All You Need

Computation and Language
Avatar
ScienceCast Board
508 views