Data The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 105 facebook/OMol25 Updated about 19 hours ago • 236 ScienceOne-AI/S1-Base-671B 684B • Updated Sep 11, 2025 • 32 • 30 ibm-granite/granite-docling-258M-mlx Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 3.03k • 99
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 105
Data The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 105 facebook/OMol25 Updated about 19 hours ago • 236 ScienceOne-AI/S1-Base-671B 684B • Updated Sep 11, 2025 • 32 • 30 ibm-granite/granite-docling-258M-mlx Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 3.03k • 99
The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale Paper • 2406.17557 • Published Jun 25, 2024 • 105