ARPO - a dongguanting Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

dongguanting 's Collections

AEPO

ARPO

ARPO

updated 7 days ago

The official datasets and model checkpoints of ARPO

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158
dongguanting/Qwen3-8B-ARPO-DeepSearch

8B • Updated Jul 29 • 16 • 2
dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated 7 days ago • 7
dongguanting/Qwen3-14B-ARPO-DeepSearch

Text Generation • 15B • Updated Aug 12 • 15 • 5
dongguanting/ARPO-RL-DeepSearch-1K

Viewer • Updated Oct 17 • 1.07k • 99 • 6
dongguanting/Qwen2.5-7B-ARPO

Text Generation • 8B • Updated Aug 19 • 85 • 2
dongguanting/Llama3.1-8B-ARPO

Text Generation • 8B • Updated Aug 12 • 13 • 1
dongguanting/Qwen2.5-3B-ARPO

Text Generation • 3B • Updated Aug 12 • 12 • 3
dongguanting/ARPO-SFT-54K

Viewer • Updated Oct 17 • 54.6k • 149 • 14
dongguanting/ARPO-RL-Reasoning-10K

Viewer • Updated Oct 17 • 10k • 151 • 4

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs