-
The Ultra-Scale Playbook
๐3.88kThe ultimate guide to training LLM on large GPU Clusters
-
The Smol Training Playbook
๐3.2kThe secrets to building world-class LLMs
-
FineWeb: decanting the web for the finest text data at scale
๐ท1.36kExplore and download the FineWeb webโscale text dataset
-
Unlocking On-Policy Distillation for Any Model Family
๐110Visualize onโpolicy distillation token alignment
Aditya Bhosale
croeasusking
ยท
AI & ML interests
None yet
Recent Activity
updated a collection 9 days ago
HF Books liked a Space 9 days ago
dlouapre/eiffel-tower-llama updated a collection 24 days ago
HF BooksOrganizations
None yet