CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-all-vanilla-400k
Updated
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-500k-vanilla-100k
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-ablation-finegrained-200k
Text Generation
• 3B • Updated • 2
CriteriaPO/qwen2.5-3b-dpo-vanilla-in-finegrained
Text Generation
• 3B • Updated • 3
CriteriaPO/qwen2.5-3b-dpo-finegrained
Text Generation
• 3B • Updated • 24
• CriteriaPO/qwen2.5-3b-dpo-coarse
Text Generation
• 3B • Updated • 28
• CriteriaPO/qwen2.5-3b-dpo-mini
Text Generation
• 3B • Updated • 24
• CriteriaPO/qwen2.5-3b-dpo-vanilla
Text Generation
• 3B • Updated • 26
• CriteriaPO/llama3.2-3b-dpo-coarse
Text Generation
• 3B • Updated • 27
• CriteriaPO/llama3.2-3b-dpo-finegrained
Text Generation
• 3B • Updated • 15
• CriteriaPO/llama3.2-3b-dpo-vanilla
Text Generation
• 3B • Updated • 27
• CriteriaPO/llama3.2-3b-dpo-mini
Text Generation
• 3B • Updated • 17
• CriteriaPO/qwen2.5-3b-sft-10
Text Generation
• 3B • Updated • 18
• CriteriaPO/llama3.2-3b-sft-10
Text Generation
• 3B • Updated • 30
•