Trained ExpRL checkpoints. Paper link: https://arxiv.org/abs/2606.17024
Violet Xiang PRO
violetxi
AI & ML interests
None yet
Recent Activity
updated a model about 1 hour ago
violetxi/opsd-physics-qwen3-4b-forward-kl-psonly published a model about 1 hour ago
violetxi/opsd-physics-qwen3-4b-forward-kl-psonly updated a model 3 days ago
violetxi/qwen35-4b-terminal-wm-summary-mixed