Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2502.13923

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429
Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 264

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 54

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 21 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Personal Favourites

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Vision Language Models: 2025 Update

This collection includes all the models, datasets and Spaces mentioned in the blog Vision Language Models: 2025 Update

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 149k • 1.83k
Running

Featured

363

Qwen2.5 Omni 7B Demo

🏆

363

Generate text and speech from text, audio, images, and videos
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 101k • 1.27k

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

AI Paper of the Day

A collection of papers that I think are interesting, one added each day

about 21 hours ago

Can Large Language Models Understand Context?

Paper • 2402.00858 • Published Feb 1, 2024 • 23
OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 85
Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18, 2024 • 151
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity

Paper • 2401.17072 • Published Jan 30, 2024 • 25

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

Personal Favourites

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 429
Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 317
Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 264

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211

LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update

Paper • 2106.13914 • Published Jun 26, 2021 • 1
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

Paper • 2506.15196 • Published Jun 18 • 3
Ascend HiFloat8 Format for Deep Learning

Paper • 2409.16626 • Published Sep 25, 2024 • 1
Recipes for Pre-training LLMs with MXFP8

Paper • 2506.08027 • Published May 30 • 1

Vision Language Models: 2025 Update

This collection includes all the models, datasets and Spaces mentioned in the blog Vision Language Models: 2025 Update

Qwen/Qwen2.5-Omni-7B

Any-to-Any • 11B • Updated Apr 30 • 149k • 1.83k
Running

Featured

363

Qwen2.5 Omni 7B Demo

🏆

363

Generate text and speech from text, audio, images, and videos
Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published Mar 26 • 166
openbmb/MiniCPM-o-2_6

Any-to-Any • 9B • Updated Oct 5 • 101k • 1.27k

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 211
Gemma 3 Technical Report

Paper • 2503.19786 • Published Mar 25 • 54

R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization

Paper • 2503.10615 • Published Mar 13 • 17
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Paper • 2503.10630 • Published Mar 13 • 6
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning

Paper • 2503.09516 • Published Mar 12 • 36
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 88

Previous
1
2
3
4
Next

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs