GoLongRL
-
GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment
Paper • 2605.19577 • Published • 57 -
Kwai-Klear/GoLongRL-30B-A3B
Text Generation • 31B • Updated • 206 • 8 -
Kwai-Klear/GoLongRL-4B
Text Generation • 4B • Updated • 137 • 4 -
Kwai-Klear/GoLongRL
Viewer • Updated • 23k • 484 • 13