8 143 8

Harold Chen

Harold328

https://haroldchen19.github.io/

HaroldChen19

AI & ML interests

Computer Vision

Recent Activity

upvoted a paper about 5 hours ago

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

upvoted a paper 1 day ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

upvoted a paper 1 day ago

NITP: Next Implicit Token Prediction for LLM Pre-training

View all activity

Organizations

None yet

upvoted a paper about 5 hours ago

Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking

Paper • 2606.03985 • Published 1 day ago • 28

upvoted 2 papers 1 day ago

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Paper • 2605.30351 • Published 7 days ago • 24

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published 11 days ago • 27

upvoted 2 papers 2 days ago

Representation Forcing for Bottleneck-Free Unified Multimodal Models

Paper • 2605.31604 • Published 6 days ago • 51

COLLEAGUE.SKILL: Automated AI Skill Generation via Expert Knowledge Distillation

Paper • 2605.31264 • Published 6 days ago • 100

upvoted 3 papers 5 days ago

AdaState: Self-Evolving Anchors for Streaming Video Generation

Paper • 2605.30349 • Published 7 days ago • 11

GenClaw: Code-Driven Agentic Image Generation

Paper • 2605.30248 • Published 7 days ago • 35

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 7 days ago • 132

upvoted 2 papers 6 days ago

GEM: Generative Supervision Helps Embodied Intelligence

Paper • 2605.28548 • Published 8 days ago • 40

Self-Improving Language Models with Bidirectional Evolutionary Search

Paper • 2605.28814 • Published 8 days ago • 58

upvoted 2 papers 7 days ago

Soap2Soap: Long Cinematic Video Remaking via Multi-Agent Collaboration

Paper • 2605.17423 • Published 18 days ago • 33

On-Policy Adversarial Flow Distillation for Autoregressive Video Generation

Paper • 2605.26105 • Published 10 days ago • 18

upvoted 3 papers 8 days ago

WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation

Paper • 2605.25874 • Published 10 days ago • 101

PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion

Paper • 2605.23902 • Published 13 days ago • 45

PhotoFlow: Agentic 3D Virtual Photography Missions

Paper • 2605.23771 • Published 13 days ago • 26

upvoted a paper 9 days ago

SkillOpt: Executive Strategy for Self-Evolving Agent Skills

Paper • 2605.23904 • Published 13 days ago • 219

upvoted a paper 12 days ago

One Sentence, One Drama: Personalized Short-Form Drama Generation via Multi-Agent Systems

Paper • 2605.22144 • Published 14 days ago • 10

upvoted a paper 14 days ago

AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration

Paper • 2605.20025 • Published 16 days ago • 185

upvoted a paper 15 days ago

Lance: Unified Multimodal Modeling by Multi-Task Synergy

Paper • 2605.18678 • Published 17 days ago • 78

upvoted a paper 16 days ago

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

Paper • 2605.11739 • Published 22 days ago • 59

Harold Chen

AI & ML interests

Recent Activity

Organizations

Harold328's activity