mehmetkeremturkcan/SmollerLM-48M-Instruct-ft-sft Text Generation • 47.4M • Updated Jan 29, 2025 • 20 • 2
deepseek-ai/DeepSeek-Coder-V2-Lite-Instruct Text Generation • 16B • Updated Jul 3, 2024 • 202k • • 533
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs Paper • 2601.03559 • Published 25 days ago • 13