Review dataset with multiple input hashes

adamkgoldfarb · June 6, 2021, 3:14pm

Thanks for this helpful clarification!

So just to check my understanding, in this post you note that "by default, Prodigy will skip examples that are already in the dataset you're saving to: so if you've already reviewed the same example before, you won't be asked about it again." But the review recipe considers an example 123 (input hash xyz) with three conflicting annotations as distinct from example 123 (input hash xyz) with four conflicting annotations, and so the recipe will serve example 123 with four annotations, even though input hash xyz is already in the dataset?

I swear I'm not trying to belabor this-- I should probably just look at the code!

Topic		Replies	Views
Prodigy review recipe not entirely clear to me	8	685	June 22, 2023
Review recipe: auto accept identical annotations enhancement , usage , ner , done , solved , review	6	809	August 12, 2021
NER review datasets with partial overlap while keeping all texts usage , ner , best-practices , review	7	638	February 20, 2023
Bug with review recipe in 1.10.2+ done , review	8	703	September 8, 2020
Duplicate entity annotations ner	4	1978	March 13, 2019

Review dataset with multiple input hashes

Related topics