arxiv:2504.00502
zuijiang
zuijiang
AI & ML interests
None yet
Recent Activity
upvoted a paper 20 days ago
CoBA-RL: Capability-Oriented Budget Allocation for Reinforcement Learning in LLMs upvoted a paper 21 days ago
Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning