Voice Activity Detection
pyannote.audio
pyannote
pyannote-audio-pipeline
audio
voice
speech
speaker
speaker-diarization
speaker-change-detection
overlapped-speech-detection
Instructions to use hbredin/api-test with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- pyannote.audio
How to use hbredin/api-test with pyannote.audio:
from pyannote.audio import Pipeline pipeline = Pipeline.from_pretrained("hbredin/api-test") # inference on the whole file pipeline("file.wav") # inference on an excerpt from pyannote.core import Segment excerpt = Segment(start=2.0, end=5.0) from pyannote.audio import Audio waveform, sample_rate = Audio().crop("file.wav", excerpt) pipeline({"waveform": waveform, "sample_rate": sample_rate}) - Notebooks
- Google Colab
- Kaggle
pyannoteAI premium speaker diarization
Warning: this is a work in progress, though it should already be functional.
- Create an API key on
pyannoteAIdashboard. - Install
pyannote.audio4.x (yet to be released butdevelopbranch should do). - Enjoy
pyannoteAIpremium speaker diarization pipeline by changing one single line of code!
from pyannote.audio import Pipeline
- pipeline = Pipeline.from_pretrained('pyannote/speaker-diarization-3.1', token="{huggingface-token}")
+ pipeline = Pipeline.from_pretrained('hbredin/api-test', token="{pyannoteAI-api-key}")
diarization = pipeline("/path/to/audio.wav")
- Downloads last month
- 6
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support