Proper way to add new samples to an existing database

klopez · July 1, 2022, 7:08pm

Okay, I have actually tested this on a dummy db, I did the following:

I set the exclude_by flag to input, and set feed_overlap to true
I created a new text file with some sentences, and started a ner.manual task with labels
I then annotated it with 2 annotators
I then started a review task to adjudicate the annotators
I then added/appended text to the original text file
The annotators only saw the new text in their tasks!
I then ran the review on the newly updated db to see if it would make me review all again. It only loaded the new samples
I then took the input file, copied its contents into a new file, added some new sentences, and used this as input to the task. (same command just different input name/file with same data)
This only showed the annotators new samples
Same with the review task, only new samples.
I then tried adding duplicate samples in the text file to see if it would add the duplicated samples into the task for the annotators, it did not!

Thank you @ryanwesslen for all the help!

Topic		Replies	Views
Re-annotating records usage , database , streams	4	566	May 5, 2020
Change some annotations for existing dataset usage , ner , database , review	1	846	September 23, 2020
Reviewing/Editing annotated data usage , review , streams	1	942	June 23, 2020
re-visiting a partially annotated docment	2	195	March 2, 2023
when to use db-in vs ner.manual usage , ner , database , solved	1	426	October 2, 2020