I am trying to label another round of data with the existing training dataset using ner.teach. I already have one set annotated in dataset “training_1” (silver). My input file has a lot of text data in csv which was used as input for “training_1” (a part of it was done in first round). Now, when i use this command with these args, prodigy should consider the text that is not in ‘training_1’. But in the interface, i am getting the text that was already labeled in ‘training_1’ dataset.
prodigy ner.teach training_2 trained_models_spacy long_text_train.csv --label Labels.txt --patterns Prodigy_Patterns.jsonl --exclude training_1
I dont know why this is not working. Should i match and drop the already tagged text before i give input as csv?