蔡正舟's picture

8

蔡正舟

conctsai

AI & ML interests

None yet

Recent Activity

authored a paper 4 days ago

Learning to Self-Verify Makes Language Models Better Reasoners

authored a paper 4 days ago

Look Before You Leap: Autonomous Exploration for LLM Agents

authored a paper 4 days ago

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

View all activity

Organizations

None yet

authored 3 papers 4 days ago

Learning to Self-Verify Makes Language Models Better Reasoners

Paper • 2602.07594 • Published Feb 7 • 3

Look Before You Leap: Autonomous Exploration for LLM Agents

Paper • 2605.16143 • Published 18 days ago • 9

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Paper • 2605.27141 • Published 7 days ago • 16

upvoted a paper 5 days ago

VitaBench 2.0: Evaluating Personalized and Proactive Agents in Long-Term User Interactions

Paper • 2605.27141 • Published 7 days ago • 16

upvoted 6 papers 14 days ago

HodgeCover: Higher-Order Topological Coverage Drives Compression of Sparse Mixture-of-Experts

Paper • 2605.13997 • Published 20 days ago • 5

Look Before You Leap: Autonomous Exploration for LLM Agents

Paper • 2605.16143 • Published 18 days ago • 9

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

Paper • 2605.14539 • Published 19 days ago • 5

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 20 days ago • 59

Distilling Long-CoT Reasoning through Collaborative Step-wise Multi-Teacher Decoding

Paper • 2605.02290 • Published 29 days ago • 40

Nudging Beyond the Comfort Zone: Efficient Strategy-Guided Exploration for RLVR

Paper • 2605.15726 • Published 18 days ago • 34

upvoted a paper about 1 month ago

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Paper • 2604.02268 • Published Apr 2 • 101

published a dataset about 1 year ago

conctsai/video-r1-image

Updated Apr 11, 2025 • 5