Spaces:

opencompass
/

ATLAS

Sleeping

ATLAS / README.md

“pangjh3”

modified: README.md

4126a18 26 days ago

759 Bytes

	---
	title: ATLAS Benchmark
	emoji: 🧪
	colorFrom: green
	colorTo: indigo
	sdk: gradio
	app_file: app.py
	pinned: true
	license: apache-2.0
	short_description: ATLAS for Frontier Scientific Benchmark
	sdk_version: 5.43.1
	hf_oauth: true
	tags:
	- leaderboard
	- science
	- benchmark
	- evaluation
	---

	# ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

	ATLAS is a high-difficulty, multidisciplinary benchmark for frontier scientific reasoning. It is designed to evaluate the capabilities of large language models (LLMs) in scientific reasoning across seven core scientific fields covering the key domains of AI for Science (AI4S):

	- Mathematics
	- Physics
	- Chemistry
	- Biology
	- Computer Science
	- Earth Science
	- Materials Science