Bai Yang
ShacklesLay
AI & ML interests
None yet
Recent Activity
upvoted a paper 28 days ago
TL-GRPO: Turn-Level RL for Reasoning-Guided Iterative Optimization liked
a Space 4 months ago
HuggingFaceTB/smol-training-playbook upvoted a paper 7 months ago
VisionThink: Smart and Efficient Vision Language Model via Reinforcement
Learning