AI & ML interests
None yet
Organizations
skyai798/STAR-1_DeepSeek-R1-Distill-Llama-8B_sft-complete-dpo
Text Generation
• 8B • Updated
• 5
8B • Updated
• 1
Text Generation
• 8B • Updated
• 2
skyai798/llama-dpo-r2-new
Updated
skyai798/qwen2-dpo-r1-1v2
8B • Updated
• 1
skyai798/qwen2_safe_40000_helpful_40000_qwen_beta_0.2_lr_1.0e-6_seed_17
Updated
skyai798/qwen2_safe_20000_helpful_40000_qwen_beta_0.2_lr_5.0e-7_seed_120
Updated
Text Generation
• 8B • Updated
• 5
skyai798/saferlhf_ultra_sft
Text Generation
• 8B • Updated
• 9
skyai798/safety_v2_math_v1
Text Generation
• 8B • Updated
• 3
skyai798/safety-math-mix-sft
8B • Updated
skyai798/openmathinstruct2-mix-sft
Text Generation
• 8B • Updated
• 3