Adding new named entities to existing model

Pragma_Tom · November 18, 2021, 4:25pm

Hi, I'm having issues when adding a new entity type to previously trained model. Let's say I have an entity "A" that I want to add to the en_core_web_lg model. This entity is completely new, comprising of words that the model very rarely recognizes in any of its own entity categories. I created annotations for this entity using ner.manual and then used the following command to train and update the base model:

prodigy train A-model --ner annotation_dataset -m en_core_web_lg -L -V

The newly trained model "A-model" does not seem to remember any of the previous en_core_web_lg entities when evaluated using the print stream recipe and also when I verified it using spacy.

My ultimate goal is to add one more NER category to this model (so two in total, A plus a new one, B, added to en_core_web_lg) while not losing any of the native categories. I was planning on doing this iteratively, ie training A first then building upon the A-model with the B entity to create an AB-model.

What am I doing wrong? Do I need to just use the en_core_web_lg to pre annotate and then add my own entities and train a model from scratch?

ljvmiranda921 · November 19, 2021, 2:07am

Hi @Pragma_Tom, welcome to Prodigy!

This is often the case of "catastrophic forgetting" and becomes apparent when some new entities are added to an existing model.

Yes. The best practice is to do "pseudo-rehearsal" i.e., to use the original model to label examples and mix them through your fine-tuning updates. As for other strategies, you can check the following threads:

We also published a blogpost on pseudo-rehearsal, and explains how it solves catastrophic forgetting.

Pragma_Tom · November 22, 2021, 1:58pm

LJ, Thank you for the welcome, the quick response, and the references! Much appreciated.

Topic		Replies	Views
New entity model ruins other entities ner , solved , best-practices	9	3891	August 16, 2018
Training few new entities: Result very low usage , ner , spacy	3	17	January 29, 2025
Generating examples in spacy to address catastrophic forgetting usage , ner , spacy , solved	8	987	January 3, 2022
Train one multipurpose Model or multiple models for different usecases? ner , spacy , training	1	27	August 27, 2024
Add on new name entity incrementally... usage , ner	2	1035	October 7, 2019

Adding new named entities to existing model

Related topics