Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

MixEval

community
https://mixeval.github.io/
NiJinjie
Psycoy
Activity Feed

AI & ML interests

LLM & LMM evaluation

Recent Activity

yuexiang96  authored a paper about 1 month ago
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents
yuexiang96  authored a paper about 1 month ago
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution
yuexiang96  authored a paper about 1 month ago
Simulating Environments with Reasoning Models for Agent Training
View all activity

Jinjie Ni's profile picture Fuzhao Xue's profile picture Xiang Yue's profile picture Deepanway's profile picture Bo Li's profile picture David Junhao ZHANG's profile picture Yifan Song's profile picture

models 0

None public yet

datasets 2

MixEval/MixEval-X

Viewer • Updated Feb 15, 2025 • 7.68k • 246 • 10

MixEval/MixEval

Viewer • Updated Sep 27, 2024 • 5k • 141 • 24
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs