Podcast speaker prediction with Prodigy and scikit-learn

ines (Ines Montani) December 17, 2018, 2:12pm 1

Just came across this really cool project using Prodigy to label audio snippets from the Syntax.fm podcast to train a scikit-learn model to predict who is speaking. The repo includes the whole pipeline, including Prodigy recipes and config and notebooks

Twitter thread with more details:

Topic		Replies	Views
Working with languages not yet supported by Spacy textcat , spacy , solved	18	7338	June 25, 2018
Prediction model using prodigy trained model runs very slow ner , spacy	5	158	December 26, 2024
The model details behind Prodigy usage , spacy , solved	1	423	August 12, 2020
Multi-stage speaker audio classification with `pyannote.sad.manual` and `audio manual` usage , custom , audio	13	2172	September 28, 2020
Can't find recipe or command 'pyannote.scd.binary'. docs , done , audio	6	801	June 24, 2022

Podcast speaker prediction with Prodigy and scikit-learn

Related topics