arxiv:2602.10693
floyed shen
floyed
AI & ML interests
None yet
Recent Activity
upvoted a paper about 15 hours ago
LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know? submitted a paper 21 days ago
Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information upvoted a paper 28 days ago
From Generic Correlation to Input-Specific Credit in On-Policy Self Distillation