AI & ML interests
Building breatkthrough AI to solve the world's biggest problems.
Recent Activity
View all activity
Papers
TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs
Organization Card
spaces 13
pinned
Running
20
AstaBench Leaderboard
🥇
View benchmark leaderboards
pinned
Running
422
Reward Bench Leaderboard
📐
Explore RewardBench model rankings and scores
pinned
Sleeping
2
HREF Leaderboard
📐
Browse and search HREF leaderboard data
pinned
Running
91
Zebra Logic Bench
🦓
Show leaderboard and explore model puzzle results
pinned
Running
3
SUPER Leaderboard
🤖
Display a static leaderboard from a JSON file
pinned
Running
53
ZeroEval Leaderboard
📊
Embed ZeroEval for evaluation
models 858
allenai/Olmo-Hybrid-7B
Text Generation • Updated
• 15.4k • 33
allenai/Olmo-Hybrid-Think-SFT-7B
Text Generation • Updated
• 601 • 10
allenai/Olmo-Hybrid-Instruct-DPO-7B
Text Generation • 7B • Updated
• 2.67k • 14
allenai/Olmo-Hybrid-Instruct-SFT-7B
Text Generation • Updated
• 1.48k • 10
allenai/FlexOlmo-7x7B-1T-RT
Text Generation • 33B • Updated
• 119 • 7
allenai/FlexOlmo-7x7B-1T
Text Generation • 33B • Updated
• 237 • 39
allenai/Flex-public-7B-1T
Text Generation • 7B • Updated
• 295 • 5
allenai/Flex-reddit-2x7B-1T
Text Generation • 12B • Updated
• 4.87k • 7
allenai/Flex-pes2o-2x7B-1T
Text Generation • 12B • Updated
• 195 • 2
allenai/Flex-news-2x7B-1T
Text Generation • 12B • Updated
• 198 • 3
datasets 420
allenai/Molmo2-VideoPoint
Viewer
• Updated
• 1.32M • 377 • 5
allenai/Dolci-Think-SFT-Olmo-Hybrid-Tool-Use-SA
Viewer
• Updated
• 1.6k • 65 • 6
allenai/Dolci-Think-SFT-Olmo-Hybrid
Viewer
• Updated
• 2.93M • 203 • 7
allenai/Sera-4.6-Lite-47000
Viewer
• Updated
• 31k • 94 • 1
allenai/molmospaces
Viewer
• Updated
• 772k • 7.21k • 39
allenai/molmo2-single-object-track
Viewer
• Updated
• 368k • 62
allenai/molmo2-reasonvos
Viewer
• Updated
• 458 • 159 • 1
allenai/molmo2-burst
Viewer
• Updated
• 2.86k • 29
allenai/molmo2-mevis-valid
Viewer
• Updated
• 2.24k • 153
allenai/molmo2-ref-davis17
Viewer
• Updated
• 1.38k • 22