No pre-trained model to import when ner.batch-train

Quanti · July 16, 2019, 2:58am

Hi,

As we are exploring how we can improve a current annotation, we are wondering is there a way to not import en_core_web_sm? i.e. without a pre-trained model.

ines · July 16, 2019, 8:30am

Sure, but you’ll still need to pass in a base model to start with that includes the language data, tokenization rules etc. This can be a completely blank model with no weights – but you always need to start with something.

To save out a blank model, you can run the following:

import spacy
nlp = spacy.blank("en")  # or whichever language you want to use
nlp.to_disk("/path/to/model")

Or a handy one-liner on the command line:

python -c "import spacy;spacy.blank('en').to_disk('/path/to/model')"

You can then load in /path/to/model as the base model.

Topic		Replies	Views
Blank spacy model vs en_core_web_xx usage , ner , spacy , custom	2	881	October 25, 2021
Blank spacy model without being trained usage , ner , spacy , solved	6	3340	July 29, 2021
How do I train a custom ner model? usage , ner , spacy , solved	7	2396	June 25, 2019
ner.batch-train not to use default labels but just the ones from a training sample ner , spacy , solved	8	739	July 30, 2018
Annotate using ner.manual for a new language usage , ner , spacy , solved	2	671	October 27, 2019

No pre-trained model to import when ner.batch-train

Related topics