Ai - a chethan62 Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

chethan62 's Collections

STT

TTS

Ai

spaces

webgpu

papers

models

Ai

updated about 10 hours ago

bosonai/higgs-audio-v2-generation-3B-base

Text-to-Speech • Updated Jul 28, 2025 • 167k • 658
rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 255k • 1.25k
Skywork/Skywork-UniPic-1.5B

Any-to-Any • Updated Sep 8, 2025 • 32 • 114
OmniSVG/OmniSVG

Text Generation • Updated Jul 21, 2025 • 261 • 188
LiquidAI/LFM2-VL-450M

Image-Text-to-Text • 0.5B • Updated Jan 5 • 12.6k • 144
Running

19

onnx-asr demo

🐢

19

ASR demo using onnx-asr
Running on Zero

43

Canary 1B Flash

🐤

43

Canary 1B Flash demo
AIDC-AI/Ovis2.5-9B

Image-Text-to-Text • Updated 7 days ago • 5.31k • 304
Running on Zero

Featured

80

LIA-X

🐠

80

Interactive Portrait Animation and Editing
Runtime error

340

NSFW Face Swap

📈

340

Swap faces in images and enhance them if desired
Running

Featured

1.75k

Realistic Text To Speech Unlimited

🔥

1.75k

Free Text-To-Speech generator with Emotion control (OpenAI)
Runtime error

109

Ovis2.5 2B

📚

109

Lightweight vision for efficient deployment
lodestones/Chroma1-HD

Text-to-Image • Updated Oct 23, 2025 • 10.5k • 331
silveroxides/Chroma-GGUF

Text-to-Image • 9B • Updated Sep 11, 2025 • 7.38k • 236
Clybius/Chroma-GGUF

9B • Updated Apr 29, 2025 • 291 • 28
QuantStack/Chroma1-Base-GGUF

Text-to-Image • 9B • Updated Aug 23, 2025 • 532 • 8
stepfun-ai/Step-Audio-2-mini

Any-to-Any • 8B • Updated 5 days ago • 2.1k • 250
QuantStack/Chroma1-Flash-GGUF

Text-to-Image • 9B • Updated Aug 24, 2025 • 597 • 10
mradermacher/NuMarkdown-8B-Thinking-GGUF

8B • Updated Aug 7, 2025 • 1.48k • 13
Running on Zero

Featured

268

granite-docling-258M demo

📝

268

Extract and query structured data from document images
openbmb/VoxCPM-0.5B

Text-to-Speech • Updated Sep 19, 2025 • 888 • 766
CypressYang/SongBloom

Text-to-Audio • Updated Oct 11, 2025 • 569 • 125
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10, 2025 • 49.6k • 800
decart-ai/Lucy-Edit-Dev

Video-to-Video • Updated Nov 20, 2025 • 492 • 326
XiaomiMiMo/MiMo-Audio-7B-Instruct

Any-to-Any • 8B • Updated Sep 23, 2025 • 1.6k • 149
Running on CPU Upgrade

76

MiMo-Audio-Chat

💬

76

Chat with Xiaomi MiMo-Audio using voice
QuantStack/Qwen-Image-Edit-2509-GGUF

Image-to-Image • 20B • Updated Oct 18, 2025 • 55.5k • 327
Running on Zero

746

IndexTTS 2 Demo

🏢

746

Generate expressive speech from text and voice reference
tencent/HunyuanImage-3.0

Text-to-Image • Updated 23 days ago • 711k • • 640
Kwai-Klear/Klear-46B-A2.5B-Instruct

Text Generation • Updated Sep 7, 2025 • 11 • 81
deepseek-ai/DeepSeek-V3.1-Terminus

Text Generation • Updated Sep 29, 2025 • 8.69k • • 361
Running

Featured

180

HunyuanImage-3.0

📊

180

Generate images from prompts (PRO users only)
neuphonic/neutts-air

Text-to-Speech • 0.7B • Updated 7 days ago • 10.6k • 849
Kwai-Klear/Klear-46B-A2.5B-Base

Text Generation • 46B • Updated Sep 7, 2025 • 6 • 30
nineninesix/kani-tts-370m

Text-to-Speech • 0.4B • Updated 2 days ago • 868 • 160
LiquidAI/LFM2-Audio-1.5B

Audio-to-Audio • 1B • Updated 27 days ago • 189 • 345
kyutai/stt-2.6b-en

Automatic Speech Recognition • Updated Jun 26, 2025 • 119
Running

18

Fathom DeepResearch

📊

18

DeepResearch with the fathom search and synthesizer models
LiquidAI/LFM2-VL-1.6B

Image-Text-to-Text • 2B • Updated 27 days ago • 2.85k • 221
bartowski/LiquidAI_LFM2-8B-A1B-GGUF

Text Generation • 8B • Updated Oct 8, 2025 • 1.06k • 8
prithivMLmods/DeepCaption-VLA-V2.0-7B

Image-Text-to-Text • 8B • Updated Oct 15, 2025 • 14 • 7
numind/NuMarkdown-8B-Thinking

Image-to-Text • Updated Nov 13, 2025 • 648k • 441
microsoft/BiomedParse

Updated Oct 10, 2025 • 587 • 104
prithivMLmods/Perseus-Doc-VL-0712

Image-Text-to-Text • 8B • Updated Oct 10, 2025 • 4 • 3
Running on Zero

Featured

224

Ovi [local]

🎥

224

Generate Hollywood Style Actors on your Local Machine
microsoft/UserLM-8b

Text Generation • Updated Oct 9, 2025 • 590 • 362
fka/prompts.chat

Viewer • Updated about 17 hours ago • 1.27k • 18.7k • 9.59k
prithivMLmods/Kepler-Qwen3-4B-Super-Thinking

Text Generation • 4B • Updated Sep 27, 2025 • 4 • 5
PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 15 days ago • 13.9k • 1.55k
facebook/MobileLLM-Pro

Text Generation • Updated Nov 11, 2025 • 231 • 160
Open-Bee/Bee-8B-RL

Image-Text-to-Text • 9B • Updated 19 days ago • 44.3k • 77
Running on Zero

MCP

201

Qwen3-VL-Outpost

🔥

201

demo of a collection of qwen3-vl models
lightonai/LightOnOCR-1B-1025

Image-to-Text • Updated 8 days ago • 130k • 234
internlm/JanusCoderV-8B

Image-Text-to-Text • 9B • Updated Oct 30, 2025 • 36 • 13
moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated Dec 16, 2025 • 29.7k • 541
nvidia/omnivinci

Feature Extraction • Updated 21 days ago • 1.04k • 170
nvidia/audio-flamingo-3-hf

Audio-Text-to-Text • Updated 23 days ago • 129k • 173
tencent/HunyuanOCR

Image-Text-to-Text • Updated Jan 13 • 1.08M • 553
Running on A100

235

Omnilingual ASR Media Transcription

🌍

235

Transcribe audio/video files into text across many languages
zai-org/GLM-TTS

Text-to-Speech • Updated Jan 12 • 274 • 317
mradermacher/Dolphin-v2-GGUF

3B • Updated Dec 12, 2025 • 333 • 4
LiquidAI/LFM2-2.6B-Exp-GGUF

Text Generation • 3B • Updated Dec 26, 2025 • 35.8k • 62
Running

54

Nemotron Speech Streaming

🎤

54

Real-time speech recognition with NVIDIA Triton
Running on Zero

Featured

94

LightOnOCR 2 1B Demo

🐨

94

Extract text from images and PDFs with OCR
prithivMLmods/GutenOCR-3B-AIO-GGUF

Image-Text-to-Text • 3B • Updated 25 days ago • 1.6k • 3
zai-org/GLM-OCR

Image-to-Text • Updated 11 days ago • 1.15M • 1.09k
shallowdream204/BitDance-14B-64x

Text-to-Image • 15B • Updated 2 days ago • 180 • 34

Collection guide
Browse collections

Company

TOS Privacy About Careers

Website

Models Datasets Spaces Pricing Docs