Science Cast

Layerwise Dynamics for In-Context Classification in Transformers

librarianApril 14, 2026 3:58am

Views (4)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Layerwise Dynamics for In-Context Classification in Transformers

arXivPDFApril 13, 2026 12:00am

Authors

Patrick Lutz, Themistoklis Haris, Arjun Chandra, Aditya Gangrade, Venkatesh Saligrama

Abstract

Transformers can perform in-context classification from a few labeled examples, yet the inference-time algorithm remains opaque. We study multi-class linear classification in the hard no-margin regime and make the computation identifiable by enforcing feature- and label-permutation equivariance at every layer. This enables interpretability while maintaining functional equivalence and yields highly structured weights. From these models we extract an explicit depth-indexed recursion: an end-to-end identified, emergent update rule inside a softmax transformer, to our knowledge the first of its kind. Attention matrices formed from mixed feature-label Gram structure drive coupled updates of training points, labels, and the test probe. The resulting dynamics implement a geometry-driven algorithmic motif, which can provably amplify class separation and yields robust expected class alignment.

TwitterandLinkedIn

0 comments

Add comment

Layerwise Dynamics for In-Context Classification in Transformers

Layerwise Dynamics for In-Context Classification in Transformers

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments