Audio transcription

bigdatabaracus · April 22, 2019, 6:15am

Just found Prodigy and looks very nice. I’m wondering if it is possible to create an annotation task for audio transcription (for speech recognition). Looks like embedding audio is supported however editing the text shown to the annotator is not. In other words I’m looking to create gold-standard speech transcription data. This would require the annotator to be able to edit the text shown by prodigy.

Thanks a lot.

ines · April 22, 2019, 9:28am

Hi! You could definitely build something like that using a custom HTML template plus a few lines of JavaScript. For example, a textarea that shows some text and an action to update the current task data (slightly simplified example that shows the idea):

<textarea class="textarea">{{text}}</textarea>
<button onClick="updateTask()">Done!</button>

function updateTask() {
    const text = document.querySelector('.textarea').value
    window.prodigy.update({ corrected: text })
}

The record saved in the database could then look something like this:

{"text": "MLP is cool", "corrected": "NLP is cool"}

That said, it’s definitely true that the Prodigy framework focuses a lot on automation and collecting very structured data for machine learning tasks. So it might not be the best fit if you’re looking for an editor for fully manual audio transcription or to collect mostly free-form written user input.

bigdatabaracus · April 23, 2019, 9:27am

Thank you Ines.

Topic		Replies	Views
Editing Text and Linking Audio via Annotation Instructions usage , audio	2	484	August 3, 2022
Multiple annotation types on the same data usage , ner , custom , solved , audio	5	1374	June 25, 2020
Combine audio.manual and audio.transcribe? solved	4	482	September 30, 2022
Upload existing text (previously transcribed) and editing it in Prodigy Audio Transcription recipe enhancement , audio	3	301	November 9, 2023
Question and Answer Tutorial usage , custom , front-end	3	6137	August 10, 2019

Audio transcription

Related topics