ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning Paper • 2602.02192 • Published 13 days ago • 12
Surprisal Guided Selection Collection Training at test-time for kernel optimization • 2 items • Updated 3 days ago • 1
Surprisal Guided Selection Collection Training at test-time for kernel optimization • 2 items • Updated 3 days ago • 1
OpenSec: Incident Response Agent Calibration Collection OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 3 days ago • 1
OpenSec: Incident Response Agent Calibration Collection OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 3 days ago • 1
OpenSec: Incident Response Agent Calibration Collection OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 3 days ago • 1