2 6 3

Xichen Pan PRO

xcpan

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

submitted a paper 3 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

updated a dataset 7 months ago

xcpan/jdb

View all activity

Organizations

upvoted a paper 3 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Paper • 2606.14700 • Published 6 days ago • 11

submitted a paper to Daily Papers 3 days ago

RepFusion: Leveraging Multimodal Priors for Denoising in Representation Space

Paper • 2606.14700 • Published 6 days ago • 11

updated a dataset 7 months ago

xcpan/jdb

Viewer • Updated Nov 22, 2025 • 4.19M • 4

published a dataset 7 months ago

xcpan/jdb

Viewer • Updated Nov 22, 2025 • 4.19M • 4

updated a model 7 months ago

xcpan/llava7b_ddt6_1152_trainddt_adaln_shift4k_3e4_8

Updated Nov 22, 2025

published a model 7 months ago

xcpan/llava7b_ddt6_1152_trainddt_adaln_shift4k_3e4_8

Updated Nov 22, 2025

published a dataset 7 months ago

xcpan/t2i_training_10m

Updated Nov 16, 2025 • 2

liked a model 8 months ago

Efficient-Large-Model/Sana_1600M_512px_diffusers

Text-to-Image • Updated Jan 10, 2025 • 3

upvoted a paper 9 months ago

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Paper • 2509.26625 • Published Sep 30, 2025 • 44

liked a dataset 12 months ago

xcpan/MetaQuery_Instruct_2.4M_512res

Viewer • Updated Jun 30, 2025 • 2.26M • 2.81k • 8

updated 2 datasets 12 months ago

xcpan/MetaQuery_Instruct_2.4M_512res

Viewer • Updated Jun 30, 2025 • 2.26M • 2.81k • 8

xcpan/MetaQuery_Instruct_2.4M

Viewer • Updated Jun 30, 2025 • 2.28M • 2.38k • 8

updated a collection 12 months ago

MetaQuery Instruction Tuning Data

Collection

We downsample high-resolution images so that the shorter side is 1024 pixels (MetaQuery_Instruct_2.4M) or 512 pixels (MetaQuery_Instruct_2.4M_512res) • 2 items • Updated Jun 24, 2025 • 1

published 2 datasets 12 months ago

xcpan/MetaQuery_Instruct_2.4M_512res

Viewer • Updated Jun 30, 2025 • 2.26M • 2.81k • 8

xcpan/MetaQuery_Instruct_2.4M

Viewer • Updated Jun 30, 2025 • 2.28M • 2.38k • 8

authored 4 papers about 1 year ago

Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis

Paper • 2505.10046 • Published May 15, 2025 • 9

PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop

Paper • 2503.09595 • Published Mar 12, 2025

Transfer between Modalities with MetaQueries

Paper • 2504.06256 • Published Apr 8, 2025 • 2

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14, 2025 • 100

Xichen Pan PRO

AI & ML interests

Recent Activity

Organizations

xcpan's activity