Submitted by Patrick Jiang 53 Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses chroma 637 2