Collections
Discover the best community collections!
Collections trending this week
-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝29 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 40 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 775 • 11 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 2.39k • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 31.1k • 78 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 23.9k • • 394 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 477k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 80.4k • • 750
-
Qwen3 VL Demo
😻364Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-to-Text • 236B • Updated • 77.2k • 362 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-to-Text • 236B • Updated • 146k • 355 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 6.06k • 24
-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝29 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 40 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 775 • 11 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 2.39k • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 31.1k • 78 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 23.9k • • 394 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 477k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 80.4k • • 750
-
Qwen3 VL Demo
😻364Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-to-Text • 236B • Updated • 77.2k • 362 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-to-Text • 236B • Updated • 146k • 355 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 6.06k • 24