soda-research

community

https://soda-audio.github.io/

Activity Feed Request to join this org

AI & ML interests

multimodal, audio, speech, llms

Recent Activity

potsawee authored a paper 1 day ago

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

potsawee updated a model 1 day ago

soda-research/soda-135m-base

potsawee updated a model 1 day ago

soda-research/soda-600m-base

View all activity

Papers

Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens

View all Papers

soda-research 's datasets 15

soda-research/yodas2-mm-semantic

Viewer • Updated Dec 23, 2025 • 896k • 182 • 1

soda-research/yodas2-mm-acoustic

Viewer • Updated Dec 23, 2025 • 896k • 691 • 1

soda-research/commonvoice17-mm-pretrain

Viewer • Updated Dec 8, 2025 • 3.6M • 309

soda-research/peoples-speech-mm-pretrain

Viewer • Updated Dec 8, 2025 • 3.59M • 36

soda-research/libritts-r-mm-pretrain

Viewer • Updated Dec 8, 2025 • 678k • 21

soda-research/emilia-mm-pretrain-fix

Viewer • Updated Dec 6, 2025 • 12.6M • 1.53k

soda-research/libritts-r-mm-tts0

Viewer • Updated Dec 5, 2025 • 334k • 36

soda-research/yodas2-mm-asr

Viewer • Updated Dec 4, 2025 • 4.26M • 71 • 1

soda-research/mls-en-mm-tts0

Viewer • Updated Dec 4, 2025 • 552k • 18

soda-research/emilia-mm-conversational

Viewer • Updated Dec 3, 2025 • 599k • 1

soda-research/emilia-mm-pretrain

Viewer • Updated Nov 3, 2025 • 12.6M • 2 • 1

soda-research/librispeech-mm-eval

Viewer • Updated Nov 2, 2025 • 22.3k • 4

soda-research/librispeech-mm-pretrain

Viewer • Updated Nov 2, 2025 • 562k • 17

soda-research/mls-en-mm-pretrain

Viewer • Updated Oct 30, 2025 • 1.1M • 1

soda-research/yodas2-mm-pretrain

Viewer • Updated Oct 15, 2025 • 8.52M • 2 • 1