Hello everybody,
I'm about to use prodigy to annotate segments within an audio file. The files are containing news broadcasts. I want to annotate the start and end time of each news segment and assign it the ID of the speaker's manuscript, since I need to be sure that each manuscript is linked to the correct segment of the audio.
The manuscripts with text and ID of each news segment are in the metadata of each audio file. See below for data structure.
Today I dug into the workings of custom recipes, but unfortunately I could not find a way to set the labels from within my recipe. Are recipes even the right spot to start?
Also I want to display the title and text of all the manuscripts which appear in the audio file below the annotation view. I tried around with blocks, but the same problem: I could not find a way to access meta data through the recipe.
The data I pass in with each annotation task looks like this:
{
"audio": "file.mp3",
"text": "file",
"meta": {"file": "file.mp3",
"manuscript":
{"id1": "news text",
"id2": "more news text from another item"}
},
"path": "file.mp3",
"_input_hash": 1234,
"_task_hash": 1235,
"_is_binary": False,
"_view_id": "blocks",
"audio_spans": [
{
"start": 0.0,
"end": 10.0,
"label": "id1",
"id": "id1",
"color": "rgba(255,215,0,0.2)",
},
{
"start": 10.0,
"end": 20.0,
"label": "id2",
"id": "id2",
"color": "rgba(255,215,0,0.2)",
}
],
"answer": "accept",
"_timestamp": 1698064248,
},
Mainly I do have two questions:
- How can I display different labels (the manuscripts' IDs) for each audio file?
- Is it possible to display the text of each manuscript to support the correct assignment of the segments?
Thank you very much for your answers and support!
Best,
redadmiral