Annotate multiple JSONL into multiple Datasets

vaibhav-01 · September 23, 2021, 5:37am

Hi,

I am currently using Prodigy version 1.10 and need some help with my UseCase. What I am doing is embedding Prodigy UI within an iframe in Plotly Dash and every time an annotator starts visits the dashboard, I have to supply him/her with 2 JSONL files of text strings to annotate. Annotated data from these 2 JSNOL files should go into 2 different datasets.

Is there a recipe (could not find anything in the documentation related to this) already coded for where I can supply the names of 2 datasets along with 2 JSONL files and the rest will be taken care of? OR is there a way to pythonically find out when all the strings available in the JSONL file have been annotated, i.e., No Tasks Available?

ines · September 28, 2021, 9:47am

Hi! Sorry for only getting to this now, I think I missed this thread earlier. At the moment, Prodigy expects you to pick one dataset per instance to save the annotations to, but you could work around that by calling db.add_examples explicitly in the update callback of your recipe, based on values in the data.

If you're generating the JSONL files programmatically and can add the destination datasets to the JSON record, you could do something like this:

from prodigy.components.db import connect

# in your recipe, and returned as "update": update
def update(answers):
    db = connect()
    for eg in answers:
        dataset = eg["dataset"]  # name of target dataset in example
        db.add_examples([eg], [dataset])

vaibhav-01 · October 7, 2021, 9:21am

Hi, I tried something like this and it worked. Thanks a lot for helping out. You people are best. Never seen this intensity of you guys in replying to every thread. Thanks.

Topic		Replies	Views
Annotation tasks finish even when more samples are in the jsonl dataset usage , solved , streams	5	445	April 8, 2022
Bulk import textcat examples	2	24	April 29, 2025
Adding data to a Prodigy dataset using db-in - is there a way to filter out/remove duplicate annotations? usage , solved	2	418	January 4, 2023
How to export my annotations?	2	17	March 3, 2025
Image Manual (How to use my .jsonl after I import them) usage , image , solved	6	462	April 4, 2020

Annotate multiple JSONL into multiple Datasets

Related topics