bosonai/higgs-audio-v2-generation-3B-base
Text-to-Speech
β’
Updated
β’
167k
β’
658
ASR demo using onnx-asr
Canary 1B Flash demo
Interactive Portrait Animation and Editing
Swap faces in images and enhance them if desired
Free Text-To-Speech generator with Emotion control (OpenAI)
Lightweight vision for efficient deployment
Extract and query structured data from document images
Chat with Xiaomi MiMo-Audio using voice
Generate expressive speech from text and voice reference
Generate images from prompts (PRO users only)
DeepResearch with the fathom search and synthesizer models
Generate Hollywood Style Actors on your Local Machine
demo of a collection of qwen3-vl models
Transcribe audio/video files into text across many languages
Real-time speech recognition with NVIDIA Triton
Extract text from images and PDFs with OCR