Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sumuk Shashidhar's picture
10 9 17

Sumuk Shashidhar PRO

sumuks
RutvaP's profile picture abidlabs's profile picture Tonic's profile picture
·
https://sumuk.org
  • sumukx
  • sumukshashidhar
  • sumuks

AI & ML interests

Evaluations, Reasoning, Long Term Planning

Recent Activity

updated a dataset 15 days ago
sumuks/preference-atlas-rewards
published a dataset 15 days ago
sumuks/preference-atlas-rewards
liked a dataset 16 days ago
sumuks/preference-atlas
View all activity

Organizations

Blog-explorers's profile picture Verifiers For Code's profile picture Preference Agents's profile picture Sumuk's Archived Content's profile picture UIUC Conversational AI Lab's profile picture self-planner's profile picture Nerdy Face's profile picture Sumuk's Testing Grounds!'s profile picture Spiral Works's profile picture Your Bench's profile picture Sumuk's Second Set of Archived Content's profile picture InfoHunt's profile picture TextCleanLM's profile picture Sumuk's First Archival Storage Volume's profile picture popper's profile picture Sumuk's Archival Storage 2's profile picture Sumuk's Archival Storage 3's profile picture

Articles 1

Article
4

Getting Started with YourBench

Papers 5

arxiv:2505.01592
arxiv:2504.20090
arxiv:2504.01833
arxiv:2410.03731

models 0

None public yet

datasets 28

sumuks/preference-atlas-rewards

Viewer • Updated 15 days ago • 5.03k • 29

sumuks/preference-atlas

Viewer • Updated 15 days ago • 329k • 102 • 1

sumuks/reward-bench-2

Viewer • Updated 16 days ago • 1.87k • 43

sumuks/helpsteer3

Viewer • Updated 17 days ago • 49.1k • 284

sumuks/helpsteer3-easy

Viewer • Updated 23 days ago • 7.93k • 29

sumuks/helpsteer-pairwise-grading

Viewer • Updated 28 days ago • 51.8k • 19

sumuks/rupo-eval-logs-helpsteer3-1

Viewer • Updated 29 days ago • 1.43k • 55

sumuks/helpsteer3-rupo

Viewer • Updated 30 days ago • 38.2k • 170

sumuks/persuasiveness_detection

Viewer • Updated Feb 6 • 3.94k • 11

sumuks/rupo-eval-humanlike-dpo-dataset-lbhr-2

Preview • Updated Feb 6 • 20
View 28 datasets
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs