Proper way to add new samples to an existing database

Okay, I have actually tested this on a dummy db, I did the following:

  1. I set the exclude_by flag to input, and set feed_overlap to true
  2. I created a new text file with some sentences, and started a ner.manual task with labels
  3. I then annotated it with 2 annotators
  4. I then started a review task to adjudicate the annotators
  5. I then added/appended text to the original text file
  6. The annotators only saw the new text in their tasks!
  7. I then ran the review on the newly updated db to see if it would make me review all again. It only loaded the new samples
  8. I then took the input file, copied its contents into a new file, added some new sentences, and used this as input to the task. (same command just different input name/file with same data)
  9. This only showed the annotators new samples
  10. Same with the review task, only new samples.
  11. I then tried adding duplicate samples in the text file to see if it would add the duplicated samples into the task for the annotators, it did not!

Thank you @ryanwesslen for all the help!

1 Like