Annotated jsonl as source

Cristiano74 · September 17, 2018, 3:12pm

Dear support,
I’m trying to use the annoted jsonl as source for manuale NER, but I’m not sure the follow the right flow. For example I used:

prodigy dataset my_it9
prodigy ner.manual my_it9 it_core_news_sm covered_warrant.txt -l FIN
prodigy db-out my_it9 export2 -a accept

Then I review the annotations and change them partially:

prodigy ner.manual my_it9 it_core_news_sm ./export2/my_it9.jsonl -l FIN

Now I want to review again the annotations:

prodigy db-out my_it9 export2 -a accept
prodigy ner.manual my_it9 it_core_news_sm ./export2/my_it9.jsonl -l FIN

But at this step I’m not able to re-use the annotated jsonl as source, I mean I don’t see the last marked entities.

Maybe this flow is wrong?

Thanks in advance for any suggestions.

C.

Cristiano74 · September 17, 2018, 4:11pm

I’ve seen a solution here overwriting annotations but I’m wondering if there is any good flow to accomplish this task as well?

Thanks

ines · September 17, 2018, 7:19pm

Yes, I think what's going on here is that you're adding your new, reviewed annotations to the same dataset. So your dataset now contains the old annotations, as well as the new ones. By default, Prodigy is designed to always keep a record of each individual annotation decision so you can always reproduce it – that’s also why it doesn’t just silently overwrite existing records in your dataset.

You might find this thread interesting, which discusses a very similar workflow. I've also posted a more advanced recipe to automate the reviewing process and take random samples.

Topic		Replies	Views
Edit Saved NER Manual Annotations usage , ner , database , solved	4	1390	September 13, 2018
Edit saved annotations ner , solved	4	1372	March 2, 2018
Modify/reannotate existing documents usage , solved , streams	2	702	January 13, 2021
Reviewing/Editing annotated data usage , review , streams	1	964	June 23, 2020
Editing approved NER dataset usage , ner , solved	1	421	April 30, 2020

Annotated jsonl as source

Related topics