Models
Datasets
Spaces
Buckets new
Docs
Enterprise
Pricing
Log In
Sign Up

laner ten's picture

2 7

laner ten

that113

·

AI & ML interests

None yet

Organizations

None yet

Collections 2

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161
Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18, 2025 • 25
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15

Running

3.84k

The Ultra-Scale Playbook

🌌

3.84k

The ultimate guide to training LLM on large GPU Clusters
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3, 2025 • 32
RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71
microsoft/Magma-8B

Robotics • 9B • Updated Dec 10, 2025 • 682 • 415

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 161
Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24, 2025 • 320
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning

Paper • 2507.14111 • Published Jul 18, 2025 • 25
MaPPO: Maximum a Posteriori Preference Optimization with Prior Knowledge

Paper • 2507.21183 • Published Jul 27, 2025 • 15

Running

3.84k

The Ultra-Scale Playbook

🌌

3.84k

The ultimate guide to training LLM on large GPU Clusters
Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3, 2025 • 32
RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 71
microsoft/Magma-8B

Robotics • 9B • Updated Dec 10, 2025 • 682 • 415

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs