Running 3.86k The Ultra-Scale Playbook π 3.86k The ultimate guide to training LLM on large GPU Clusters
Running Featured 1.35k FineWeb: decanting the web for the finest text data at scale π· 1.35k Explore and download the FineWeb webβtext dataset
nvidia/Llama-3_3-Nemotron-Super-49B-v1_5-NVFP4 Text Generation β’ 26B β’ Updated Nov 27, 2025 β’ 3.63k β’ 20
Running on CPU Upgrade Featured 3.18k The Smol Training Playbook π 3.18k The secrets to building world-class LLMs
IlyaGusev/gemma-2-2b-it-abliterated Text Generation β’ 3B β’ Updated Jul 31, 2024 β’ 1.44k β’ β’ 50
Vikhrmodels/Vikhr-Llama-3.2-1B-Instruct Text Generation β’ 1B β’ Updated Sep 27, 2024 β’ 2.24k β’ β’ 46
Vikhrmodels/Vikhr-Qwen-2.5-0.5B-instruct-GGUF Text Generation β’ 0.5B β’ Updated Oct 6, 2024 β’ 282 β’ 9