ComPO

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

PeterLauLukCh authored a paper 27 days ago

Reward-free Alignment for Conflicting Objectives

PeterLauLukCh submitted a paper 28 days ago

Reward-free Alignment for Conflicting Objectives

PeterLauLukCh authored a paper 2 months ago

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

View all activity

PeterLauLukCh

authored a paper 27 days ago

Reward-free Alignment for Conflicting Objectives

Paper • 2602.02495 • Published Feb 2 • 2

PeterLauLukCh

submitted a paper to Daily Papers 28 days ago

Reward-free Alignment for Conflicting Objectives

Paper • 2602.02495 • Published Feb 2 • 2

PeterLauLukCh

authored 2 papers 2 months ago

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Paper • 2512.16912 • Published Dec 18, 2025 • 13

GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators

Paper • 2512.19682 • Published Dec 22, 2025 • 19

PeterLauLukCh

submitted a paper to Daily Papers 3 months ago

Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward

Paper • 2512.16912 • Published Dec 18, 2025 • 13

PeterLauLukCh

updated a collection 5 months ago

Ablation on Scaling

Collection

2 items • Updated Oct 4, 2025

PeterLauLukCh

updated a Space 5 months ago

README

🦀

Released Models of ComPO

PeterLauLukCh

updated a model 7 months ago

ComparisonPO/Mistral-7B-Instruct-A0.21B-ComPO

7B • Updated Aug 4, 2025 • 2

PeterLauLukCh

published a model 7 months ago

ComparisonPO/Mistral-7B-Instruct-A0.21B-ComPO

7B • Updated Aug 4, 2025 • 2

PeterLauLukCh

updated a model 8 months ago

ComparisonPO/Mistral-7B-Instruct-ComPO-3300pert-300iter-2

7B • Updated Jul 21, 2025

PeterLauLukCh

published a model 8 months ago

ComparisonPO/Mistral-7B-Instruct-ComPO-3300pert-300iter-2

7B • Updated Jul 21, 2025

PeterLauLukCh

updated a collection 8 months ago

Ablation on Scaling

Collection

2 items • Updated Oct 4, 2025

PeterLauLukCh

updated a model 8 months ago

ComparisonPO/Mistral-7B-Instruct-ComPO-3300pert-300iter

7B • Updated Jul 13, 2025 • 1

PeterLauLukCh

published a model 8 months ago

ComparisonPO/Mistral-7B-Instruct-ComPO-3300pert-300iter

7B • Updated Jul 13, 2025 • 1

PeterLauLukCh

authored 4 papers 8 months ago

updated a model 11 months ago

ComparisonPO/Gemma-2-9b-it-SimPO-ComPO

9B • Updated Apr 10, 2025 • 3

PeterLauLukCh

updated a collection 11 months ago

SimPO+ComPO

Collection

3 items • Updated 3 days ago

AI & ML interests

Recent Activity

Team members 2

ComparisonPO's activity

README