Xiaoyu Yang

marcoyang

AI & ML interests

ASR, Machine learning

Recent Activity

updated a dataset about 1 month ago

marcoyang/podcast-data

published a dataset about 1 month ago

marcoyang/podcast-data

updated a model about 1 month ago

marcoyang/spear-encoder-streaming-600M-speech-only

View all activity

Organizations

updated a dataset about 1 month ago

marcoyang/podcast-data

Preview • Updated about 1 month ago • 48

published a dataset about 1 month ago

marcoyang/podcast-data

Preview • Updated about 1 month ago • 48

updated a model about 1 month ago

marcoyang/spear-encoder-streaming-600M-speech-only

Updated Nov 7

authored 11 papers about 1 month ago

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Paper • 2211.00508 • Published Oct 31, 2022

Blank-regularized CTC for Frame Skipping in Neural Transducer

Paper • 2305.11558 • Published May 19, 2023

LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization

Paper • 2409.00819 • Published Sep 1, 2024

Delay-penalized CTC implemented based on Finite State Transducer

Paper • 2305.11539 • Published May 19, 2023

k2SSL: A Faster and Better Framework for Self-Supervised Speech Representation Learning

Paper • 2411.17100 • Published Nov 26, 2024

SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation

Paper • 2411.18138 • Published Nov 27, 2024 • 1

SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM

Paper • 2406.06571 • Published Jun 3, 2024

SPEAR: A Unified SSL Framework for Learning Speech and Audio Representations

Paper • 2510.25955 • Published Oct 29

updated 2 models about 1 month ago

marcoyang/spear-base-speech-audio

93.3M • Updated Nov 3 • 23

marcoyang/spear-base-speech

93.3M • Updated Nov 3 • 6

updated a collection about 1 month ago

SPEAR encoders

Collection

The SPEAR encoder models (https://arxiv.org/abs/2510.25955) • 5 items • Updated Nov 3 • 1

published a model about 1 month ago

marcoyang/spear-base-speech-audio

93.3M • Updated Nov 3 • 23

updated a collection about 1 month ago

SPEAR encoders

Collection

The SPEAR encoder models (https://arxiv.org/abs/2510.25955) • 5 items • Updated Nov 3 • 1

published a model about 1 month ago

marcoyang/spear-base-speech

93.3M • Updated Nov 3 • 6

Xiaoyu Yang

AI & ML interests

Recent Activity

Organizations

marcoyang's activity