GLM-4.7 / .eval_results /evasionbench.yaml
FutureMa's picture
Create .eval_results/evasionbench.yaml
2f8cb47 verified
raw
history blame
185 Bytes
- dataset:
id: FutureMa/EvasionBench
task_id: evasion_bench
value: 82.91
date: "2026-02-10"
source:
url: https://arxiv.org/abs/2601.09142
name: EvasionBench Paper