Scale AI
company
Verified
AI & ML interests
None defined yet.
Recent Activity
Papers
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
datasets
19
ScaleAI/SA2_bowlstack0
Viewer
•
Updated
•
200
•
7
ScaleAI/dummy_mcp
Viewer
•
Updated
•
16
•
46
ScaleAI/PRBench
Viewer
•
Updated
•
1.65k
•
825
•
4
ScaleAI/researchrubrics
Viewer
•
Updated
•
101
•
134
•
9
ScaleAI/swe-oec-claude-expert
Viewer
•
Updated
•
1.27k
•
62
•
1
ScaleAI/VisualToolBench
Viewer
•
Updated
•
1.19k
•
109
•
1
ScaleAI/TutorBench
Viewer
•
Updated
•
1.47k
•
193
ScaleAI/SWE-bench_Pro
Viewer
•
Updated
•
731
•
14.7k
•
39
ScaleAI/BioRiskEval
Viewer
•
Updated
•
156k
•
61
ScaleAI/TutorBench_sample
Viewer
•
Updated
•
30
•
26