Language, Intelligence, and Model Evaluation Lab

non-profit

https://limenlp.github.io/

AI & ML interests

Natural Language Processing

Papers

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Video-Based Reward Modeling for Computer-Use Agents

View all Papers

lime-nlp 's datasets 8

lime-nlp/OS-Blind

Updated Apr 14 • 21 • 6

lime-nlp/ExeVR-53k

Updated Mar 12 • 228 • 4

lime-nlp/Synthetic_Unanswerable_Math

Viewer • Updated May 21, 2025 • 36.8k • 51 • 14

lime-nlp/DeepScaleR_Difficulty

Viewer • Updated Apr 10, 2025 • 5.06M • 78 • 11

lime-nlp/orz_math_difficulty

Viewer • Updated Apr 10, 2025 • 6.18M • 58

lime-nlp/MATH_Difficulty

Viewer • Updated Apr 10, 2025 • 1.61M • 38

lime-nlp/GSM8K_Difficulty

Viewer • Updated Apr 9, 2025 • 1.13M • 48 • 1

lime-nlp/safer-instruct

Viewer • Updated Mar 25, 2025 • 11.2k • 37 • 1