LaSeR - a Keven16 Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Keven16 's Collections

LaSeR

LaSeR

updated Oct 17

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding"

Keven16/ORZ-7B-LaSeR

8B • Updated Oct 15 • 11 • 1
Keven16/Qwen2.5-7B-LaSeR

8B • Updated Oct 15 • 8
Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated Oct 15 • 5
Keven16/LaSeR_training_data

Viewer • Updated Oct 16 • 104k • 57 • 2
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published Oct 16 • 39

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs