VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published 10 days ago • 120
VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval Paper • 2602.08099 • Published 10 days ago • 120
Find your Needle: Small Object Image Retrieval via Multi-Object Attention Optimization Paper • 2503.07038 • Published Mar 10, 2025
EffoVPR: Effective Foundation Model Utilization for Visual Place Recognition Paper • 2405.18065 • Published May 28, 2024
Fast Autoregressive Video Diffusion and World Models with Temporal Cache Compression and Sparse Attention Paper • 2602.01801 • Published 16 days ago • 28