Spaces:

opencompass
/

ATLAS

Sleeping

App Files Files Community

ATLAS / README.md

“pangjh3”

modified: README.md

4126a18 26 days ago

preview code

raw

history blame contribute delete

759 Bytes

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

metadata

title: ATLAS Benchmark
emoji: 🧪
colorFrom: green
colorTo: indigo
sdk: gradio
app_file: app.py
pinned: true
license: apache-2.0
short_description: ATLAS for Frontier Scientific Benchmark
sdk_version: 5.43.1
hf_oauth: true
tags:
  - leaderboard
  - science
  - benchmark
  - evaluation

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

ATLAS is a high-difficulty, multidisciplinary benchmark for frontier scientific reasoning. It is designed to evaluate the capabilities of large language models (LLMs) in scientific reasoning across seven core scientific fields covering the key domains of AI for Science (AI4S):

Mathematics
Physics
Chemistry
Biology
Computer Science
Earth Science
Materials Science