chenzehao's picture

chenzehao

chhao

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

upvoted a paper 2 months ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

upvoted a paper 2 months ago

AI Can Learn Scientific Taste

View all activity

Organizations

None yet

upvoted 7 papers 2 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20, 2025 • 110

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 427

Attention Residuals

Paper • 2603.15031 • Published Mar 16 • 185

Demystifing Video Reasoning

Paper • 2603.16870 • Published Mar 17 • 372

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published Mar 12 • 53

Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training

Paper • 2603.12255 • Published Mar 12 • 91

upvoted 8 papers 3 months ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 197

Qwen3-Coder-Next Technical Report

Paper • 2603.00729 • Published Feb 28 • 65

Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization

Paper • 2602.23008 • Published Feb 26 • 37

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published Feb 23 • 524

VESPO: Variational Sequence-Level Soft Policy Optimization for Stable Off-Policy LLM Training

Paper • 2602.10693 • Published Feb 11 • 221

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Paper • 2602.08354 • Published Feb 9 • 266

TermiGen: High-Fidelity Environment and Robust Trajectory Synthesis for Terminal Agents

Paper • 2602.07274 • Published Feb 6 • 210

Less is Enough: Synthesizing Diverse Data in Feature Space of LLMs

Paper • 2602.10388 • Published Feb 11 • 245

upvoted 3 papers 4 months ago

Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization

Paper • 2506.17252 • Published Jun 8, 2025 • 2

Real-Time Aligned Reward Model beyond Semantics

Paper • 2601.22664 • Published Jan 30 • 15

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Paper • 2602.08222 • Published Feb 9 • 290