Fully open Whisper-style speech foundation models developed by CMU WAVLab: https://www.wavlab.org/activities/2024/owsm/
Yifan Peng
pyf98
AI & ML interests
Multimodal LLMs, Speech-to-Speech, Speech Recognition
Recent Activity
liked
a dataset 4 days ago
inclusionAI/AudioMCQ liked
a model 28 days ago
espnet/owsm_ctc_v3.2_ft_1B liked
a Space 4 months ago
HuggingFaceTB/smol-training-playbook