arxiv:2509.24510
Patrik Wolf
patrikwolf
ยท
AI & ML interests
Test-time training, preference learning, alignment, theory
Recent Activity
upvoted a paper 8 days ago
dLLM: Simple Diffusion Language Modeling upvoted a paper 21 days ago
Sanity Checks for Sparse Autoencoders: Do SAEs Beat Random Baselines? upvoted a paper 26 days ago
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation