Renjie's picture

Renjie

Renjie-Ranger

·

https://renjie-ranger.github.io/

AI & ML interests

LLM Post-Training

Recent Activity

upvoted a paper 3 days ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 3 days ago

Rethinking the Divergence Regularization in LLM RL

updated a dataset about 1 month ago

Renjie-Ranger/FCP_big_math_pro_SFT

View all activity

Organizations

None yet

Renjie-Ranger 's papers 4

arxiv:2509.22638

arxiv:2506.07712

arxiv:2404.07584

arxiv:2402.14008