3 47 7

Jiawei Gu

kuvvi

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

upvoted a paper 11 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

upvoted a paper 19 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

View all activity

Organizations

upvoted a paper about 13 hours ago

Draft-OPD: On-Policy Distillation for Speculative Draft Models

Paper • 2605.29343 • Published 6 days ago • 25

upvoted a paper 11 days ago

π-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows

Paper • 2605.14678 • Published 15 days ago • 102

upvoted a paper 19 days ago

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling

Paper • 2605.13301 • Published 21 days ago • 159

upvoted a paper 20 days ago

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning

Paper • 2605.06326 • Published 27 days ago • 26

upvoted a paper about 1 month ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published Apr 26 • 33

updated a model about 2 months ago

kuvvi/aoao007

15B • Updated Apr 14 • 1

published a model about 2 months ago

kuvvi/aoao007

15B • Updated Apr 14 • 1

upvoted 2 papers 2 months ago

GEMS: Agent-Native Multimodal Generation with Memory and Skills

Paper • 2603.28088 • Published Mar 30 • 86

VGGRPO: Towards World-Consistent Video Generation with 4D Latent Reward

Paper • 2603.26599 • Published Mar 27 • 66

upvoted 2 papers 3 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 429

TOPReward: Token Probabilities as Hidden Zero-Shot Rewards for Robotics

Paper • 2602.19313 • Published Feb 22 • 26

upvoted 3 papers 4 months ago

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning

Paper • 2602.11748 • Published Feb 12 • 38

P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads

Paper • 2602.09443 • Published Feb 10 • 59

Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations

Paper • 2602.05885 • Published Feb 5 • 28

authored a paper 4 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

upvoted a paper 4 months ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published Jan 26 • 48

upvoted a collection 4 months ago

AdaReasoner

Collection

AdaReasoner: Models and Datasets • 12 items • Updated Jan 28 • 3

upvoted a paper 5 months ago

DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models

Paper • 2512.24165 • Published Dec 30, 2025 • 52

liked 2 datasets 6 months ago

ThinkMorph/Visual_Search

Viewer • Updated Nov 3, 2025 • 6.99k • 167 • 1

ThinkMorph/Spatial_Navigation

Viewer • Updated Nov 3, 2025 • 6k • 92 • 2

Jiawei Gu

AI & ML interests

Recent Activity

Organizations

kuvvi's activity