view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context +6 Jul 23, 2024 β’ 239
microsoft/VibeVoice-Realtime-0.5B Text-to-Speech β’ 1B β’ Updated about 7 hours ago β’ 40.5k β’ 487
Baseer: A Vision-Language Model for Arabic Document-to-Markdown OCR Paper β’ 2509.18174 β’ Published Sep 17 β’ 128