1 5 1

Hejun Dong

fickle1101

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

upvoted a paper about 2 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

commentedon a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

View all activity

Organizations

upvoted a paper 1 day ago

OVO-S-Bench: A Hierarchical Benchmark for Streaming Spatial Intelligence in Multimodal LLMs

Paper • 2606.03890 • Published 4 days ago • 30

upvoted a paper about 2 months ago

MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale

Paper • 2604.04771 • Published Apr 6 • 123

commented a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 137 •

upvoted a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 137

updated a Space 2 months ago

MinerU Diffusion V1 0320 2.5B

🦀

demo of MinerU-Diffusion

published a Space 2 months ago

MinerU Diffusion V1 0320 2.5B

🦀

demo of MinerU-Diffusion

updated a model 4 months ago

fickle1101/nolayout_final_108k

3B • Updated Jan 30 • 1

published a model 4 months ago

fickle1101/nolayout_final_108k

3B • Updated Jan 30 • 1

updated a model 4 months ago

fickle1101/no_merger_pm2x_custom_lr_best_results

Updated Jan 26

published a model 4 months ago

fickle1101/no_merger_pm2x_custom_lr_best_results

Updated Jan 26

updated a model 5 months ago

fickle1101/no_merger_pm2x_s2_4e_best

3B • Updated Jan 20 • 1

published a model 5 months ago

fickle1101/no_merger_pm2x_s2_4e_best

3B • Updated Jan 20 • 1

liked a model 8 months ago

opendatalab/MinerU2.5-2509-1.2B

Image-Text-to-Text • 1B • Updated Apr 9 • 69.4k • 360

upvoted a paper 8 months ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26, 2025 • 165

published a model 12 months ago

fickle1101/native_qwen2_5_vit_ocr

1B • Updated Jun 16, 2025 • 1

updated a model 12 months ago

fickle1101/native_qwen2_5_vit_ocr

1B • Updated Jun 16, 2025 • 1

upvoted a paper about 1 year ago

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

Paper • 2504.09925 • Published Apr 14, 2025 • 39

Hejun Dong

AI & ML interests

Recent Activity

Organizations

fickle1101's activity

MinerU Diffusion V1 0320 2.5B

MinerU Diffusion V1 0320 2.5B