lihaoxin2020/qwen3.5-4B-evidence-subagent-gpt54-single-jina-v2-no-urls-64k-no-exact-cutoff-full-sft-v1-lr2e-5 Image-Text-to-Text • 5B • Updated 11 days ago • 255
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step250 Text Generation • 196k • Updated Apr 26 • 8
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step300 Text Generation • 196k • Updated Apr 26 • 379 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gem3-flash-step150 Text Generation • 196k • Updated Apr 26 • 275 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step200 Text Generation • 196k • Updated Apr 26 • 666 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt54-step150 Text Generation • 196k • Updated Apr 26 • 197 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step200 Text Generation • 196k • Updated Apr 24 • 232 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step150 Text Generation • 196k • Updated Apr 24 • 233 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step200 Text Generation • 196k • Updated Apr 24 • 233 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step150 Text Generation • 196k • Updated Apr 24 • 229 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-evolving-rubric-gpt41-step100 Text Generation • 196k • Updated Apr 23 • 284 •
lihaoxin2020/qwen3-4b-sft-gpt54-ep2-instance-rubric-gpt41-step100 Text Generation • 196k • Updated Apr 23 • 258 •
lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step100 Text Generation • 196k • Updated Apr 21 • 5 •
lihaoxin2020/qwen3-4b-refiner-gpt54-rubric-v3-2-rl-lr5e-6-step50 Text Generation • 196k • Updated Apr 21 • 6 •
lihaoxin2020/qwen3-4b-refiner-gpt54-instance-rubric-gpt54-grpo-step50 Text Generation • 196k • Updated Apr 20 • 3 •
lihaoxin2020/qwen3-4B-refiner-sft-rl-balanced-resume-step100 Text Generation • 196k • Updated Apr 14 • 4 •
lihaoxin2020/qwen3-4B-refiner-3201-rl-balanced-step50 Text Generation • 196k • Updated Apr 12 • 1 • 1