Collections
Discover the best community collections!
Collections trending this week
-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝28 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 40 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 706 • 11 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 2.3k • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 29.9k • 78 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 24.4k • • 394 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 474k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 78k • • 749
-
Qwen3 VL Demo
😻364Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-to-Text • 236B • Updated • 73.9k • 361 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-to-Text • 236B • Updated • 136k • 354 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 6.19k • 24
-
kakaocorp/kanana-2-30b-a3b-thinking-2601
Text Generation • 31B • Updated • 538 • 50 -
kakaocorp/kanana-2-30b-a3b-instruct-2601
Text Generation • 31B • Updated • 302 • 48 -
kakaocorp/kanana-2-30b-a3b-mid-2601
Text Generation • 31B • Updated • 37 • 29 -
kakaocorp/kanana-2-30b-a3b-base-2601
Text Generation • 31B • Updated • 933 • 28
-
Falcon-H1-Tiny: A series of extremely small, yet powerful language models redefining capabilities at small scale
📝28 -
Learnable Multipliers: Freeing the Scale of Language Model Matrix Layers
Paper • 2601.04890 • Published • 40 -
tiiuae/Falcon-H1-Tiny-90M-Instruct
Text Generation • 91.1M • Updated • 706 • 11 -
tiiuae/Falcon-H1-Tiny-90M-Instruct-GGUF
91.1M • Updated • 2.3k • 9
-
Qwen/Qwen3-235B-A22B-Thinking-2507-FP8
Text Generation • 235B • Updated • 29.9k • 78 -
Qwen/Qwen3-235B-A22B-Thinking-2507
Text Generation • 235B • Updated • 24.4k • • 394 -
Qwen/Qwen3-235B-A22B-Instruct-2507-FP8
Text Generation • 235B • Updated • 474k • 139 -
Qwen/Qwen3-235B-A22B-Instruct-2507
Text Generation • 235B • Updated • 78k • • 749
-
Qwen3 VL Demo
😻364Interact with a chatbot that handles text and images
-
Qwen/Qwen3-VL-235B-A22B-Thinking
Image-to-Text • 236B • Updated • 73.9k • 361 -
Qwen/Qwen3-VL-235B-A22B-Instruct
Image-to-Text • 236B • Updated • 136k • 354 -
Qwen/Qwen3-VL-235B-A22B-Thinking-FP8
Image-Text-to-Text • 236B • Updated • 6.19k • 24
-
kakaocorp/kanana-2-30b-a3b-thinking-2601
Text Generation • 31B • Updated • 538 • 50 -
kakaocorp/kanana-2-30b-a3b-instruct-2601
Text Generation • 31B • Updated • 302 • 48 -
kakaocorp/kanana-2-30b-a3b-mid-2601
Text Generation • 31B • Updated • 37 • 29 -
kakaocorp/kanana-2-30b-a3b-base-2601
Text Generation • 31B • Updated • 933 • 28