1 15 2

Kaiyuan Chen

Lucky2022

AI & ML interests

None yet

Recent Activity

upvoted a paper about 4 hours ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

upvoted a paper 9 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

authored a paper 19 days ago

Virtual Width Networks

View all activity

Organizations

upvoted a paper about 4 hours ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 2 days ago • 23

upvoted a paper 9 days ago

P1: Mastering Physics Olympiads with Reinforcement Learning

Paper • 2511.13612 • Published 23 days ago • 132

authored a paper 19 days ago

Virtual Width Networks

Paper • 2511.11238 • Published 26 days ago • 35

upvoted a paper 23 days ago

Virtual Width Networks

Paper • 2511.11238 • Published 26 days ago • 35

upvoted a paper 26 days ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 28 days ago • 194

upvoted a collection 4 months ago

Seed-OSS

Collection

Seed-OSS Open-Source Models • 3 items • Updated Aug 20 • 58

authored a paper 6 months ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published Jun 16 • 8

upvoted a paper 6 months ago

xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations

Paper • 2506.13651 • Published Jun 16 • 8

liked 2 datasets 6 months ago

xbench/ScienceQA

Viewer • Updated Jun 18 • 100 • 62 • 8

xbench/DeepSearch

Viewer • Updated Jun 18 • 100 • 324 • 12

upvoted 3 papers 7 months ago

upvoted 2 papers 8 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21 • 77

Learning to Reason under Off-Policy Guidance

Paper • 2504.14945 • Published Apr 21 • 88

upvoted 3 papers 10 months ago

RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback

Paper • 2312.00849 • Published Dec 1, 2023 • 12

MiniCPM-V: A GPT-4V Level MLLM on Your Phone

Paper • 2408.01800 • Published Aug 3, 2024 • 89

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 22

commented a paper 10 months ago

Unveiling Downstream Performance Scaling of LLMs: A Clustering-Based Perspective

Paper • 2502.17262 • Published Feb 24 • 22 •

authored a paper 10 months ago

CogDPM: Diffusion Probabilistic Models via Cognitive Predictive Coding

Paper • 2405.02384 • Published May 3, 2024

Kaiyuan Chen

AI & ML interests

Recent Activity

Organizations

Lucky2022's activity