Kyutai
non-profit
Verified
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
ARC-Encoder: learning compressed text representations for large language models
CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs
-
CASA Gallery
🏠2Video Gallery for CASA: Cross-Attention over Self-Attention
-
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper • 2512.19535 • Published • 12 -
kyutai/CASA-Helium1-VL-2B
Image-Text-to-Text • 3B • Updated • 30 • 8 -
kyutai/CASA-Qwen2_5-VL-3B
Image-Text-to-Text • 4B • Updated • 62 • 2
CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion on long-context streaming inputs
-
CASA Gallery
🏠2Video Gallery for CASA: Cross-Attention over Self-Attention
-
CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion
Paper • 2512.19535 • Published • 12 -
kyutai/CASA-Helium1-VL-2B
Image-Text-to-Text • 3B • Updated • 30 • 8 -
kyutai/CASA-Qwen2_5-VL-3B
Image-Text-to-Text • 4B • Updated • 62 • 2
spaces 5
Running
2
CASA Gallery
🏠
Video Gallery for CASA: Cross-Attention over Self-Attention
Running
10
Hibiki Zero Samples
🏆
Demo samples of the speech translation model Hibiki-Zero.
Running
6
CALM Samples
🤗
Running
1
Unmute Samples
💻
Examples of conversations with Unmute (unmute.sh)
Running
52
Hibiki Samples
🤗
Translate speech in real-time with high fidelity
models 61
kyutai/tts-voices
Updated
• 133
kyutai/CASA-Helium1-VL-2B
Image-Text-to-Text • 3B • Updated
• 30 • 8
kyutai/pocket-tts-without-voice-cloning
Updated
• 36.1k • 18
kyutai/pocket-tts
Updated
• 32k • 521
kyutai/hibiki-zero-3b-pytorch-bf16
Audio-to-Audio • Updated
• 1.22k • 42
kyutai/CASA-Qwen2_5-VL-3B-LiveCC
Video-Text-to-Text • 4B • Updated
• 22 • 4
kyutai/Helium1-VL-2B
Image-Text-to-Text • 3B • Updated
• 10 • 1
kyutai/CASA-Qwen2_5-VL-3B
Image-Text-to-Text • 4B • Updated
• 62 • 2
kyutai/stt-1b-en_fr
Automatic Speech Recognition • Updated
• 120
kyutai/ARC8_Encoder_multi
Feature Extraction • Updated
• 13 • 6
datasets 6
kyutai/Audio-NTREX-4L
Viewer
• Updated
• 3.6k • 404 • 3
kyutai/librispeech_test_clean_enhanced
Viewer
• Updated
• 448 • 218 • 1
kyutai/ARC_finetuning
Preview
• Updated
• 12
kyutai/voices_tts_longeval
Viewer
• Updated
• 1.54k • 25 • 1
kyutai/DailyTalkContiguous
Preview
• Updated
• 11.2k • 19
kyutai/Babillage
Viewer
• Updated
• 465k • 66 • 13