https://github.com/dhcode-cpp/X-R1
xiaodongguaAIGC
xiaodongguaAIGC
AI & ML interests
RLHF
Organizations
None yet
models 9
xiaodongguaAIGC/X-R1-3B-CN
Text Generation • 3B • Updated
• 1 • 2
xiaodongguaAIGC/X-R1-3B
Text Generation • 3B • Updated
• 2 • 2
xiaodongguaAIGC/X-R1-1.5B
Text Generation • 2B • Updated
• 2
xiaodongguaAIGC/X-R1-0.5B
Text Generation • 0.5B • Updated
• 3 • 1
xiaodongguaAIGC/xdg-math-step
Text Generation • 8B • Updated
• 5 • 1
xiaodongguaAIGC/xdg-math-step-0118
Text Generation • 8B • Updated
• 1
xiaodongguaAIGC/xdg-math-prm-lora
Updated
xiaodongguaAIGC/xdg-llama-3-8B
Text Generation • 8B • Updated
• 10 • 5
xiaodongguaAIGC/llama-3-debug
Text Generation • 16.5M • Updated
• 328 • 2
datasets 16
xiaodongguaAIGC/X-R1-TAL-SCQ5K
Viewer
• Updated
• 10k • 11 • 3
xiaodongguaAIGC/X-R1-TAL-SCQ2K
Viewer
• Updated
• 3.33k • 5 • 1
xiaodongguaAIGC/X-R1-7500
Viewer
• Updated
• 12.5k • 41 • 2
xiaodongguaAIGC/X-R1-1500
Viewer
• Updated
• 2.5k • 5
xiaodongguaAIGC/X-R1-750
Viewer
• Updated
• 1.25k • 292 • 4
xiaodongguaAIGC/step_sft
Viewer
• Updated
• 84.2k • 11
xiaodongguaAIGC/step_prm
Viewer
• Updated
• 108k • 39
xiaodongguaAIGC/math_step_sft
Viewer
• Updated
• 12.5k • 8
xiaodongguaAIGC/GSM8k_step_sft
Viewer
• Updated
• 8.79k • 4
xiaodongguaAIGC/prm800k_step_sft
Viewer
• Updated
• 121k • 10