Xiao

Yang1213112131

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SpatialTree: How Spatial Abilities Branch Out in MLLMs

upvoted a paper 16 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

liked a Space about 2 months ago

camel-ai/Paper2Poster

View all activity

Organizations

None yet

upvoted a paper 3 days ago

SpatialTree: How Spatial Abilities Branch Out in MLLMs

Paper • 2512.20617 • Published 3 days ago • 41

upvoted a paper 16 days ago

StereoWorld: Geometry-Aware Monocular-to-Stereo Video Generation

Paper • 2512.09363 • Published 17 days ago • 70

liked a Space about 2 months ago

Paper2Poster

🚀

updated a dataset about 2 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29 • 1.69M • 45 • 1

authored a paper about 2 months ago

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Paper • 2505.23155 • Published May 29 • 2

upvoted a paper about 2 months ago

PreFM: Online Audio-Visual Event Parsing via Predictive Future Modeling

Paper • 2505.23155 • Published May 29 • 2

upvoted an article about 2 months ago

Article

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI

Oct 28

•

liked a dataset 2 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29 • 1.69M • 45 • 1

published a dataset 2 months ago

Yang1213112131/PreFM

Viewer • Updated Oct 29 • 1.69M • 45 • 1

upvoted a paper 3 months ago

UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models

Paper • 2509.21760 • Published Sep 26 • 14

upvoted 4 papers 4 months ago

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Paper • 2509.09674 • Published Sep 11 • 80

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Paper • 2509.09174 • Published Sep 11 • 61

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10 • 128

From Editor to Dense Geometry Estimator

Paper • 2509.04338 • Published Sep 4 • 92

upvoted 3 papers 6 months ago

upvoted a paper 7 months ago

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

Paper • 2506.09513 • Published Jun 11 • 101

Xiao

AI & ML interests

Recent Activity

Organizations

Yang1213112131's activity

Paper2Poster

Cosmos Predict 2.5 & Transfer 2.5: Evolving the World Foundation Models for Physical AI