SAGE - a baohao Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

baohao 's Collections

SAGE

SAGE

updated 7 days ago

Self-Hinting Language Models Enhance Reinforcement Learning

baohao/aime24

Viewer • Updated 9 days ago • 30 • 8
baohao/aime25

Viewer • Updated 9 days ago • 30 • 5
baohao/amc23

Viewer • Updated 9 days ago • 40 • 8
baohao/olympiadbench

Viewer • Updated 9 days ago • 675 • 9
baohao/minerva_math

Viewer • Updated 9 days ago • 272 • 8
baohao/math500

Viewer • Updated 9 days ago • 500 • 6
baohao/gpqa

Viewer • Updated 9 days ago • 198 • 8
baohao/mmlu_pro

Viewer • Updated 9 days ago • 12k • 24
baohao/sage_train

Viewer • Updated 8 days ago • 15k • 13
baohao/luffy_train

Viewer • Updated 8 days ago • 15k • 9
baohao/scaf-grpo_train

Viewer • Updated 8 days ago • 15k • 8
Self-Hinting Language Models Enhance Reinforcement Learning

Paper • 2602.03143 • Published 13 days ago • 29
baohao/sage_validation

Viewer • Updated 9 days ago • 1.67k • 8
baohao/SAGE_Llama-3.2-3B-Instruct

4B • Updated 4 days ago • 17
baohao/SAGE_Qwen2.5-7B-Instruct

8B • Updated 4 days ago • 15
baohao/SAGE_Qwen3-4B-Instruct-2507

4B • Updated 4 days ago • 9
baohao/SAGE-light_Qwen2.5-7B-Instruct

8B • Updated 4 days ago • 17
baohao/SAGE-light_Llama-3.2-3B-Instruct

4B • Updated 4 days ago • 11
baohao/SAGE-light_Qwen3-4B-Instruct-2507

4B • Updated 4 days ago • 13

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs