FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring Paper • 2512.04390 • Published 5 days ago • 6
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model Paper • 2512.01030 • Published 8 days ago • 16
MMaDA-Parallel: Multimodal Large Diffusion Language Models for Thinking-Aware Editing and Generation Paper • 2511.09611 • Published 26 days ago • 68
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published Oct 16 • 104
Optimized Table Tokenization for Table Structure Recognition Paper • 2305.03393 • Published May 5, 2023 • 1
PubTables-1M: Towards comprehensive table extraction from unstructured documents Paper • 2110.00061 • Published Sep 30, 2021 • 3