Using previous model-last as base model in prodigy train

ryanwesslen · January 3, 2023, 7:44pm

Great question! In general, we recommend against this:

Yes - it's typically better to train a model from scratch, using the same full corpus (instead of updating the same artifact over and over again, which often makes it much harder to avoid overfitting and forgetting effects).

The one thing you may want to do (if you're not already) is to modify your base-model with different vectors (e.g., en_core_web_md or en_core_web_lg). There's a bit about it in the docs:

Using pretrained word embeddings to initialize your model is easy and can make a big difference. If you’re using spaCy, try using the en_core_web_lg model as the base model. If you’re working with domain-specific texts, you can train your own vectors and create a base model with them.

Topic		Replies	Views
Should I be using --base-model when training my model? ner , training	8	2047	May 27, 2022
Tune existing Spacy NER model usage , ner	5	308	April 16, 2022
Pre-trained model vs training a model from scratch? ner , best-practices	3	2810	June 27, 2018
`prodigy train` doesn't seem to use the tokenizer from base-model training	2	307	May 1, 2023
Help with training from scratch english NER model with pretrained Gensim vectors usage , ner , spacy	2	643	January 27, 2022

Using previous model-last as base model in prodigy train

Related topics