1 20 1

Zijian Wu PRO

Jakumetsu

zjwu0522

AI & ML interests

AGI

Recent Activity

upvoted a paper 8 days ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

upvoted a paper 11 days ago

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

upvoted a paper 12 days ago

Kimi K2.5: Visual Agentic Intelligence

View all activity

Organizations

upvoted a paper 8 days ago

Secure Code Generation via Online Reinforcement Learning with Vulnerability Reward Model

Paper • 2602.07422 • Published 16 days ago • 21

upvoted a paper 11 days ago

When to Memorize and When to Stop: Gated Recurrent Memory for Long-Context Reasoning

Paper • 2602.10560 • Published 12 days ago • 28

upvoted a paper 12 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 21 days ago • 238

authored a paper 17 days ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 21 days ago • 238

upvoted a paper 18 days ago

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published 19 days ago • 34

upvoted a paper 26 days ago

AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning

Paper • 2601.18631 • Published 28 days ago • 47

upvoted a paper 3 months ago

Reasoning with Confidence: Efficient Verification of LLM Reasoning Steps via Uncertainty Heads

Paper • 2511.06209 • Published Nov 9, 2025 • 19

upvoted 2 papers 4 months ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published Nov 5, 2025 • 129

Defeating the Training-Inference Mismatch via FP16

Paper • 2510.26788 • Published Oct 30, 2025 • 31

commented 2 papers 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176 •

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176 •

upvoted a paper 5 months ago

GEM: A Gym for Agentic LLMs

Paper • 2510.01051 • Published Oct 1, 2025 • 90

authored a paper 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176

upvoted a paper 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176

commented a paper 5 months ago

MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP Use

Paper • 2509.24002 • Published Sep 28, 2025 • 176 •

upvoted 2 papers 5 months ago

Language Models Can Learn from Verbal Feedback Without Scalar Rewards

Paper • 2509.22638 • Published Sep 26, 2025 • 70

Variational Reasoning for Language Models

Paper • 2509.22637 • Published Sep 26, 2025 • 69

updated a dataset 6 months ago

Jakumetsu/mcpmark-trajectory-log

Updated Sep 9, 2025 • 4

published a dataset 6 months ago

Jakumetsu/mcpmark-trajectory-log

Updated Sep 9, 2025 • 4

upvoted a paper 8 months ago

Balancing Truthfulness and Informativeness with Uncertainty-Aware Instruction Fine-Tuning

Paper • 2502.11962 • Published Feb 17, 2025 • 38

Zijian Wu PRO

AI & ML interests

Recent Activity

Organizations

Jakumetsu's activity