Changing the window size of a NER model

nix411 · April 6, 2020, 7:34pm

Can the window size be changed easily? E.g. if I want to use the 8 preceding and 8 following tokens as context to determine if a span is some entity.

I assume I'd have to write my own train recipe exploiting some input from spacy train?

nix411 · April 9, 2020, 9:10pm

Did this one fly under the radar?

honnibal · April 11, 2020, 10:31am

Hey,

Sorry, yes it did! I've experimented a bit with the window size, which is basically the depth of the CNN. It's tricky to get benefits without also making the layers wider, which in turn makes things slower and prone to overfitting. So you end up having to tweak a number of other parameters. The best solution is to use the spacy pretrain command to do language model pretraining, to avoid the overfitting. You can find a worked example along with the weights required here: https://github.com/explosion/projects/tree/master/ner-fashion-brands

In general it will be much easier to experiment with these things in Prodigy v2, when we switch over to spaCy v3. We did a lot of work over December and January to update Thinc machine learning library underneath spaCy and Prodigy, so that it will be easier to bring your own model. We're looking forward to updating the stack to use it.

Topic		Replies	Views
How can i change conv_depth of a ner model for training ner , spacy , solved	1	445	July 22, 2020
NER parameter conv_window does not change using en_core_web_lg ner , spacy	2	404	October 28, 2020
Size of context window for NLP	4	26	October 14, 2024
Ner Training with Prodigy vs Spacy ner , spacy , best-practices	2	1204	July 2, 2020
Split a ner.manual dataset, into smaller texts usage , ner , spacy	3	1138	June 24, 2022

Changing the window size of a NER model

Related topics