Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 4 days ago • 28
LoopFormer: Elastic-Depth Looped Transformers for Latent Reasoning via Shortcut Modulation Paper • 2602.11451 • Published 17 days ago • 15
TokSuite: Measuring the Impact of Tokenizer Choice on Language Model Behavior Paper • 2512.20757 • Published Dec 23, 2025 • 18