zai-org
/

GLM-5

Text Generation

Model card Files Files and versions

Add evaluation results for GPQA, HLE

#22

by SaylorTwift HF Staff - opened about 13 hours ago

base: refs/heads/main

←

from: refs/pr/22

Discussion Files changed

Add evaluation results for GPQA, HLE25f841a1

about 13 hours ago

•

edited about 13 hours ago

Evaluation Results

This PR adds evaluation results extracted from the Model Card.

Benchmarks:

HLE: 30.5
HLE: 50.4
GPQA: 86.0

Files created:

.eval_results/hle.yaml
.eval_results/hle_with_tools.yaml
.eval_results/gpqa.yaml

Update .eval_results/hle.yaml22c99c0c

Update .eval_results/hle_with_tools.yamlf7d34fbc

Update .eval_results/hle_with_tools.yamla185b9b5

Update .eval_results/hle_with_tools.yamlc3a7b733

Update .eval_results/gpqa.yaml0dbd2351

Update .eval_results/hle_with_tools.yamlced4d530

ZHANGYUXUAN-zR changed pull request status to merged about 12 hours ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment