Reward model Reward modelling RLHFlow/SHP-standard Viewer • Updated May 9, 2024 • 93.3k • 12 transZ/shp Viewer • Updated Jan 23 • 10.3k • 10 RLHFlow/HH-RLHF-Helpful-standard Viewer • Updated Apr 27, 2024 • 115k • 119 • 4 transZ/anthropic_helpful_test Viewer • Updated Jan 23 • 2.33k • 4
Science stuffs Related to science and stuffs Nbardy/science-theory-textbooks Viewer • Updated Feb 11, 2024 • 73k • 373 • 8 openai/frontierscience Viewer • Updated Dec 16, 2025 • 160 • 6.65k • 164 MegaScience/TextbookReasoning Viewer • Updated Jul 24, 2025 • 652k • 719 • 32 MegaScience/MegaScience Viewer • Updated Jul 24, 2025 • 1.25M • 6.65k • 129
Good data Collection of cool data wikimedia/wikipedia Viewer • Updated Jan 9, 2024 • 61.6M • 108k • 1.2k graelo/wikipedia Viewer • Updated Sep 10, 2023 • 105M • 1.97k • 71 open-r1/ioi Viewer • Updated Mar 12, 2025 • 270 • 62 • 10 fhai50032/GRPO-SFT-0.5k-4096 Viewer • Updated Feb 12, 2025 • 532 • 4
Science stuffs Related to science and stuffs Nbardy/science-theory-textbooks Viewer • Updated Feb 11, 2024 • 73k • 373 • 8 openai/frontierscience Viewer • Updated Dec 16, 2025 • 160 • 6.65k • 164 MegaScience/TextbookReasoning Viewer • Updated Jul 24, 2025 • 652k • 719 • 32 MegaScience/MegaScience Viewer • Updated Jul 24, 2025 • 1.25M • 6.65k • 129
Reward model Reward modelling RLHFlow/SHP-standard Viewer • Updated May 9, 2024 • 93.3k • 12 transZ/shp Viewer • Updated Jan 23 • 10.3k • 10 RLHFlow/HH-RLHF-Helpful-standard Viewer • Updated Apr 27, 2024 • 115k • 119 • 4 transZ/anthropic_helpful_test Viewer • Updated Jan 23 • 2.33k • 4
Good data Collection of cool data wikimedia/wikipedia Viewer • Updated Jan 9, 2024 • 61.6M • 108k • 1.2k graelo/wikipedia Viewer • Updated Sep 10, 2023 • 105M • 1.97k • 71 open-r1/ioi Viewer • Updated Mar 12, 2025 • 270 • 62 • 10 fhai50032/GRPO-SFT-0.5k-4096 Viewer • Updated Feb 12, 2025 • 532 • 4