AI & ML interests
None defined yet.
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-instruction_prompt-RL
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_prompt_diversity-RL
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_reflections-RL
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_sample_order-RL
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_prompt_diversity-SFT
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_reflections-SFT
SkillFactory/ablation-Qwen2.5-1.5B-Instruct-no_sample_order-SFT
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-QwQ-1k_rows-SFT
8B • Updated • 2
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-SkillFactory-10k_rows-RL
Updated
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-SkillFactory-1k_rows-RL
Updated
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-QwQ-1k_rows-RL
Updated
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-QwQ-10k_rows-RL
Updated
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-RL
Updated
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-SkillFactory-10k_rows-SFT
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-SkillFactory-1k_rows-SFT
SkillFactory/openthoughts-Qwen2.5-7B-Instruct-QwQ-10k_rows-SFT
SkillFactory/cd3arg-Olmo-3-7B-Instruct-SFT-R1-SFT
7B • Updated • 1
SkillFactory/cd3arg-Qwen2.5-7B-Instruct-SkillFactory-RL
8B • Updated • 2
SkillFactory/cd3arg-Qwen2.5-7B-Instruct-R1-RL
SkillFactory/cd3arg-Qwen2.5-7B-Instruct-RL
8B • Updated • 2
SkillFactory/cd3arg-Qwen2.5-7B-Instruct-SkillFactory-SFT
8B • Updated • 3
SkillFactory/cd3arg-Qwen2.5-7B-Instruct-R1-SFT
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-BoLT-RL
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-STaR-RL
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-SkillFactory-RL
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-RL
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-R1-RL
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-R1-SFT
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-STaR-SFT
SkillFactory/cd3arg-Qwen2.5-1.5B-Instruct-BoLT-SFT