Jarrod Barnes's picture

Jarrod Barnes PRO

Jarrodbarnes

·

https://arc.computer

AI & ML interests

Continual Learning, Reinforcement Learning

Recent Activity

upvoted an article 1 day ago

Forge: Scalable Agent RL Framework and Algorithm

upvoted an article 2 days ago

Learn the Hugging Face Kernel Hub in 5 Minutes

liked a model 2 days ago

snap-stanford/humanlm-opinion

View all activity

Organizations

upvoted an article 1 day ago

Article

Forge: Scalable Agent RL Framework and Algorithm

3 days ago

•

113

upvoted an article 2 days ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

Jun 12, 2025

•

156

liked a model 2 days ago

snap-stanford/humanlm-opinion

Text Generation • 8B • Updated 3 days ago • 22 • 7

liked a model 3 days ago

Qwen/Qwen3-Coder-Next

Text Generation • 80B • Updated 13 days ago • 271k • • 870

liked a dataset 3 days ago

facebook/principia-collection

Viewer • Updated Dec 19, 2025 • 554k • 147 • 41

liked 2 models 3 days ago

Qwen/Qwen3-Coder-30B-A3B-Instruct

Text Generation • 31B • Updated Dec 3, 2025 • 652k • • 944

zhuyaoyu/CodeV-R1-RL-Qwen-7B

Text Generation • 8B • Updated Jun 20, 2025 • 185 • 6

upvoted a paper 3 days ago

ECHO-2: A Large-Scale Distributed Rollout Framework for Cost-Efficient Reinforcement Learning

Paper • 2602.02192 • Published 14 days ago • 12

liked a model 3 days ago

Jarrodbarnes/opensec-gdpo-4b

Text Generation • 4B • Updated 4 days ago • 81 • 1

upvoted a collection 4 days ago

Surprisal Guided Selection

Training at test-time for kernel optimization • 2 items • Updated 4 days ago • 1

updated a collection 4 days ago

Surprisal Guided Selection

Training at test-time for kernel optimization • 2 items • Updated 4 days ago • 1

updated a model 4 days ago

Jarrodbarnes/opensec-gdpo-4b

Text Generation • 4B • Updated 4 days ago • 81 • 1

updated a collection 4 days ago

OpenSec: Incident Response Agent Calibration

OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 4 days ago • 1

upvoted a collection 4 days ago

OpenSec: Incident Response Agent Calibration

OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 4 days ago • 1

updated a collection 4 days ago

OpenSec: Incident Response Agent Calibration

OpenSec is a dual-control RL environment, dataset, and evaluation suite that measures agent calibration on incident response tasks. • 4 items • Updated 4 days ago • 1

upvoted a collection 4 days ago

Agent World Model

4 items • Updated 5 days ago • 7

liked a model 4 days ago

zai-org/GLM-5

Text Generation • 754B • Updated 3 days ago • 128k • • 1.2k