Change a label in audio.manual recipe

jdddog · July 28, 2021, 12:49am

Is it possible to swap a label in the audio.manual recipe? For instance, to swap an annotation's SPEAKER_1 label to a SPEAKER_2 label?

For instance, I've already used pyannote-audio to automatically annotate speakers. The start and end of the speaker labels are often correct, however, the speaker assignment is more often incorrect. It would be easiest to be able change the label of an annotation to another speaker, rather than deleting and re-creating the speaker annotation from scratch each time.

Thanks for your help.

ines · July 29, 2021, 2:48am

Thanks, this is a really good point! At the moment, we don't have a "selected" state of audio segment, so there's no way to select a region and then change the label (which is how it currently works in the image_manual UI). I think it'd be a very useful feature because re-annotating a region is obviously very inconvenient.

I will add this to my list of enhancements!

jdddog · August 6, 2021, 10:37am

Cool thanks! That would be great

jdddog · August 6, 2021, 8:10pm

Another feature that would be very useful for audio and video annotation is being able to increase the speed of playback, e.g. 1.25x, 1.5x, 2x etc.

ines · August 7, 2021, 3:24am

We currently have this available as the audio_rate setting, which you can define in the config on startup (and which will then apply to all playbacks): Annotation interfaces · Prodigy · An annotation tool for AI, Machine Learning & NLP

It could be nice to also expose this as a setting to toggle at runtime – although, we're always conscious of keeping the UI straightforward and avoiding too many distracting runtime settings and toggles.

jdddog · August 8, 2021, 2:11am

Thanks Ines, I didn't know about that.

I think it would be useful to have that as a UI setting (or keyboard shortcut), so that you can listen quickly to find the specific part of the audio to annotate and then slow it down when doing the actual annotation. It also gets harder to tell the difference between different speakers when the audio is sped up so being able to change the speed at will would be useful in that sense.

Topic		Replies	Views
Multi-stage speaker audio classification with `pyannote.sad.manual` and `audio manual` usage , custom , audio	13	2100	September 28, 2020
How to use different labels for individual audio files? usage	4	240	October 25, 2023
variable audio_rate for audio annotation enhancement , usage , done , audio	15	2109	September 25, 2020
Editing Text and Linking Audio via Annotation Instructions usage , audio	2	477	August 3, 2022
Error with annotation for speaker diarization usage , front-end , audio	5	641	June 24, 2021

Change a label in audio.manual recipe

Related topics