Editing Text and Linking Audio via Annotation Instructions

ryanwesslen · August 2, 2022, 6:37pm

Thanks for your question and welcome to the Prodigy community

For audio transcription, you could use the audio.transcribe recipe. If all of your audio files are unique, you could load them along with the transcription as files like .jsonl. You may need to do a small amount of python pre-processing but check out the file loader docs for audio. Just had a similar request earlier this week on how to handle a raw .csv file:

You can combine different recipes to create custom recipe/interfaces using blocks. So if you wanted, you could combine different interfaces like the audio.transcribe with the rel.manual, which would enable labeling spans/relations.

The one tricky part is that if the user had to correct to a transcription, you'd need to update (refresh) the text passed to the rel.manual after the user has used the textbox to correct the transcription. Does this sound right?

For this, you'd likely need to use an update / callback using some JavaScript. There is an example of something similar where we show how you can use a button to change existing text to a different case. In theory, I suspect you could try to do the same with a text box that first provides the original transcription, then a user can edit/correct it. Then they could click the button to activate the call back which then updates the corrected transcript and resends to the rel.manual. I haven't tried this but would be interested to see if it's possible.

Alternatively, perhaps the simplest solution would be to run this in two rounds. Round 1, you simply fix/correct transcriptions with audio.transcribe. Round 2, you use corrected transcriptions only in rel.manual and treat it like a typical span/relations annotation. I tend to prefer simpler tasks then trying to do everything at the same time, so I would likely choose this route.

Thanks again for your question and let us know if you have further questions!

Topic		Replies	Views
Audio transcription usage , custom , front-end , solved	2	836	April 23, 2019
How to use different labels for individual audio files? usage	4	240	October 25, 2023
Combine audio.manual and audio.transcribe? solved	4	478	September 30, 2022
Upload existing text (previously transcribed) and editing it in Prodigy Audio Transcription recipe enhancement , audio	3	301	November 9, 2023
Multi-stage speaker audio classification with `pyannote.sad.manual` and `audio manual` usage , custom , audio	13	2101	September 28, 2020

Editing Text and Linking Audio via Annotation Instructions

Related topics