Tan's picture

4 1

Tan

RiccardTo

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 16 hours ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

upvoted a paper 7 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

upvoted a paper 7 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

View all activity

Organizations

None yet

upvoted a paper about 16 hours ago

Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss

Paper • 2512.23447 • Published 1 day ago • 70

upvoted a paper 7 days ago

Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies

Paper • 2512.19673 • Published 8 days ago • 60

upvoted a paper 7 months ago

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

Paper • 2505.22653 • Published May 28 • 43

upvoted a paper 11 months ago

Autonomy-of-Experts Models

Paper • 2501.13074 • Published Jan 22 • 44

liked a model over 1 year ago

Ori/llama-2-13b-peft-strategyqa-with-ret-at-1

Updated Sep 22, 2023 • 5 • 1