Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
44
EvalEval Bot
EvalEvalBot
Follow
evijit's profile picture
1 follower
·
2 following
AI & ML interests
None yet
Recent Activity
new
activity
2 days ago
evaleval/EEE_datastore:
Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)
new
activity
2 days ago
evaleval/EEE_datastore:
Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
new
activity
2 days ago
evaleval/EEE_datastore:
Add HELM AIR-Bench v1.16.0 results
View all activity
Organizations
EvalEvalBot
's activity
All
Models
Datasets
Spaces
Buckets
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
evaleval/EEE_datastore
2 days ago
Add alphaXiv SOTA evaluations (27,976 records, 1,646 benchmarks)
10
#26 opened about 2 months ago by
simpod
Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
6
#65 opened 4 days ago by
karthikchundi
Add HELM AIR-Bench v1.16.0 results
4
#70 opened 4 days ago by
yifanmai
updated
a dataset
2 days ago
evaleval/EEE_datastore
Viewer
•
Updated
2 days ago
•
11.6k
•
6.94k
•
19
New activity in
evaleval/EEE_datastore
3 days ago
[Submission] Fix win_rate scale (0-1) and merge Fibble variants into composite benchmark
1
#71 opened 3 days ago by
drchangliu
New activity in
evaleval/EEE_datastore
4 days ago
[ACL Shared Task] Add AlpacaEval 1.0 and 2.0 leaderboard data (324 models)
1
#69 opened 4 days ago by
karthikchundi
[ACL Shared Task] Add SWE-bench Verified official leaderboard data
8
#63 opened 5 days ago by
jatinganhotra
[ACL Shared Task] Add BountyBench (DetectWorkflow) evaluation results
1
#67 opened 4 days ago by
mrpfisher
Add HELM Capabilities v1.15.0 results
1
#64 opened 4 days ago by
yifanmai
New activity in
evaleval/EEE_datastore
8 days ago
[ACL Shared Task] Add Artificial Analysis LLM results
2
#62 opened 8 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
9 days ago
[ACL Shared Task] Add Arcadia Impact Inspect evaluation results
🚀
2
5
#57 opened 11 days ago by
mrpfisher
New activity in
evaleval/EEE_datastore
11 days ago
Parquet for dataset viewer
#59 opened 11 days ago by
EvalEvalBot
Generating Parquets
2
#58 opened 11 days ago by
EvalEvalBot
[ACL Shared Task] Add ARC-AGI leaderboard results
11
#55 opened 19 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
12 days ago
[ACL Shared Task] Add SciArena leaderboard results
8
#54 opened 20 days ago by
Cerru02
[ACL Shared Task] Add Wordle Arena & Fibble Arena evaluation results
27
#35 opened about 1 month ago by
drchangliu
New activity in
evaleval/EEE_datastore
13 days ago
[ACL Shared Task] Add BFCL leaderboard results
5
#56 opened 19 days ago by
Cerru02
New activity in
evaleval/EEE_datastore
21 days ago
Upload Theory of Mind
4
#53 opened 21 days ago by
SirGankalot
Upload Theory of Mind
19
#38 opened about 1 month ago by
SirGankalot
New activity in
evaleval/EEE_datastore
22 days ago
Upload 5 files
1
#52 opened 25 days ago by
lmushro
Load more