NER Annotation with ner.teach

ines · February 4, 2019, 11:06am

Hi! If you want to use binary annotation with a model in the loop, you’re always giving feedback on the suggestion in this exact context. So you should definitely reject incomplete spans. This way, you’re telling the model “no, this particular analysis of the text is incorrect”, the weights will be updated to reflect that particular decision and the model will “try again” with a different analysis, hopefully moving towards more correct entity boundaries.

That said, if your data contains a lot of fairly abstract multi-token entities like that and the model struggles, it might take pretty long until it converges (or it might not converge at all). You could try adding some --patterns, or collect a small set with ner.manual or ner.make-gold that covers the especially complex entities, pre-train the model with that and then improve that pre-trained model further with ner.teach. You might also want to check out this thread, which discusses an approach for extending entity boundaries with rules: Expanding NER to include neighbouring tokens

Topic		Replies	Views
ner.teach does not suggest multiple tokens usage , ner	4	1389	October 16, 2018
ner.teach - couple of questions ner , done , solved , nightly	9	2720	December 30, 2021
Custom multi-word NER model pipeline usage , ner	2	1024	March 8, 2019
Advice on training NER models with new entities usage , ner , hr	13	4060	January 25, 2019
Two Questions on Teach recipes usage , ner , textcat , solved	5	812	January 27, 2020

NER Annotation with ner.teach

Related topics