WritingBench: A Comprehensive Benchmark for Generative Writing Paper • 2503.05244 • Published Mar 7, 2025 • 20
NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models Paper • 2410.11805 • Published Oct 15, 2024 • 14