FutureMa commited on
Commit
35591b9
·
verified ·
1 Parent(s): 9d90cf8

Add EvasionBench evaluation results

Browse files

## Summary

Adding evaluation results from EvasionBench benchmark for managerial evasion detection in earnings call Q&A.

## Details

- **Benchmark**: [EvasionBench](https://huggingface.co/datasets/FutureMa/EvasionBench)
- **Paper**: [arXiv:2601.09142](https://arxiv.org/abs/2601.09142)
- **Metric**: Macro-F1
- **Score**: 78.16%

## Reference

For more details about the evaluation methodology, please refer to:
- [EvasionBench Dataset](https://huggingface.co/datasets/FutureMa/EvasionBench)
- [Project Page](https://iiiiqiiii.github.io/EvasionBench)

Files changed (1) hide show
  1. .eval_results/evasionbench.yaml +8 -0
.eval_results/evasionbench.yaml ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ - dataset:
2
+ id: FutureMa/EvasionBench
3
+ task_id: evasion_bench
4
+ value: 78.16
5
+ date: "2026-02-10"
6
+ source:
7
+ url: https://arxiv.org/abs/2601.09142
8
+ name: EvasionBench Paper