Natural Language Processing
The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents
Video-Based Reward Modeling for Computer-Use Agents