Running 3.76k The Ultra-Scale Playbook ๐ 3.76k The ultimate guide to training LLM on large GPU Clusters
meta-llama/Llama-3.1-8B-Instruct Text Generation โข 8B โข Updated Sep 25, 2024 โข 8.36M โข โข 5.65k