Machine Learning

MesaNet: Sequence Modeling by Locally Optimal Test-Time Training
Avatar
Johannes von Oswald
15 views
Kinetics: Rethinking Test-Time Scaling Laws
Avatar
librarian
20 views
Horizon Reduction Makes RL Scalable
Avatar
librarian
19 views
OpenThoughts: Data Recipes for Reasoning Models
Avatar
librarian
18 views
Not All Tokens Are Meant to Be Forgotten
Avatar
librarian
19 views
Global optimization of graph acquisition functions for neural
  architecture search
Avatar
Calvin Tsay
50 views
Distortion of AI Alignment: Does Preference Optimization Optimize for
  Preferences?
Avatar
Paul Go¨lz
48 views
REOrdering Patches Improves Vision Models
Avatar
librarian
46 views
On Learning Verifiers for Chain-of-Thought Reasoning
Avatar
Maria-Florina Balcan
48 views