arxiv:2603.01162
🤝 Open to Collab
Kai Ye
Kyleyee
AI & ML interests
None yet
Organizations
models 181
Kyleyee/DrDPO_hh-seed3
Text Generation • 2B • Updated • 24
Kyleyee/DrDPO_hh-seed4
Text Generation • 2B • Updated • 25
Kyleyee/DrDPO_hh-seed5
Text Generation • 2B • Updated • 22
Kyleyee/DrDPO_hh-seed2
Text Generation • 2B • Updated • 22
Kyleyee/cDPO_hh-seed3
Text Generation • 2B • Updated • 22
Kyleyee/CPO_hh-seed3
Text Generation • 2B • Updated • 22
Kyleyee/CPO_hh-seed5
Text Generation • 2B • Updated • 22
Kyleyee/CPO_hh-seed2
Text Generation • 2B • Updated • 21
Kyleyee/ORPO_hh-seed3
Text Generation • 2B • Updated • 22
Kyleyee/CPO_hh-seed4
Text Generation • 2B • Updated • 22
datasets 210
Kyleyee/eval-hh-clean
Viewer • Updated • 11.8k • 226
Kyleyee/eval-hh-seed
Viewer • Updated • 11.8k • 329
Kyleyee/eval-hh-all
Viewer • Updated • 11.8k • 151
Kyleyee/detect_CN_by_scenari_result
Viewer • Updated • 7k • 124
Kyleyee/detect_CN
Viewer • Updated • 7k • 81
Kyleyee/test_with_label_ood_format
Viewer • Updated • 7k • 6
Kyleyee/grpo_v7_2_correct_wrong_400
Viewer • Updated • 400 • 12
Kyleyee/OOD_data
Viewer • Updated • 1.2k • 21
Kyleyee/arithmetic-few-shot
Viewer • Updated • 500 • 6 • 1
Kyleyee/arithmetic-test
Viewer • Updated • 500 • 6 • 1