nvidia/NVIDIA-Nemotron-3-Super-120B-A12B-BF16 Text Generation • 124B • Updated 2 days ago • 27.3k • 225
Nemotron-Pre-Training-Datasets Collection Large scale pre-training datasets used in the Nemotron family of models. • 12 items • Updated about 1 hour ago • 121
Running on CPU Upgrade 187 The Synthetic Data Playbook: Generating Trillions of the Finest Tokens 📝 187 Visualize synthetic data experiments as an interactive bookshelf