The ODIN and the policies trained by ODIN
Lichang Chen
Lichang-Chen
AI & ML interests
NLP and ML
Organizations
models 64
Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-3
Text Generation • 15B • Updated
• 4
Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-2
Text Generation • 15B • Updated
• 6
Lichang-Chen/Qwen2.5-14B-Instruct-star-nl-3Rounds-iter-1
Text Generation • 15B • Updated
• 1
Lichang-Chen/game-play-point25-50
Text Generation • 8B • Updated
• 1
Lichang-Chen/multi-attempts-multi-examples-Jan9
Text Generation • 8B • Updated
• 5
Lichang-Chen/multi-turn-Jan5
Text Generation • 8B • Updated
• 1
Lichang-Chen/multi-turn-Jan4
Text Generation • 8B • Updated
Lichang-Chen/llama3-dpo-single-turn-point2247-dec15
Text Generation • 8B • Updated
• 1
Lichang-Chen/llama-8b-gemini-point60-100-wo-cot
Text Generation • 8B • Updated
• 1
Lichang-Chen/llama-8b-gemini-point21-60-wo-cot
Text Generation • 8B • Updated
• 1
datasets 18
Lichang-Chen/omnixR-data
Viewer
• Updated
• 1.4k • 5
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter3
Viewer
• Updated
• 800 • 11
Lichang-Chen/llama_sft_dpo_bold_list_attack_eval_iter2
Viewer
• Updated
• 800 • 1
Lichang-Chen/llama_sft_dpo_bold_list_attack_iter1
Viewer
• Updated
• 800 • 3
Lichang-Chen/dpo_it_attack_list_and_bold
Viewer
• Updated
• 800 • 4
Lichang-Chen/llama3_it_dpo_attack_list_2epoch
Viewer
• Updated
• 800 • 2
Lichang-Chen/llama3_it_dpo_attack_bold_2epoch
Viewer
• Updated
• 800 • 4
Lichang-Chen/dpo_it_unbiased_ver3
Viewer
• Updated
• 800 • 4
Lichang-Chen/list_training_pairs
Viewer
• Updated
• 1k • 4
Lichang-Chen/bold_training_pairs
Viewer
• Updated
• 745 • 3