Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 8 days ago • 40
NeMo Gym Collection Collection of RL verifiable data for NeMo Gym • 13 items • Updated 8 days ago • 31
Qwen2.5-Coder Collection Code-specific model series based on Qwen2.5 • 40 items • Updated about 9 hours ago • 350
BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions Paper • 2406.15877 • Published Jun 22, 2024 • 48
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12, 2024 • 147
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning Paper • 2311.11077 • Published Nov 18, 2023 • 29