I am wondering if it’s possible to provide a patterns file or something similar with ‘negative’ terms for the model to reject/ignore.
I am currently using a language model with no word embeddings so I cannot use terms.teach therefore I am creating my own patterns file to help start off ner.teach. I see this file just has accepted terms not any rejected terms, i.e. there is no ‘answer’ like from the annotation files. The issue I am having is that my model is picking up a lot of stopwords - even after I have done 1000+ annotations when I created the model when try make-gold it is still identifying stopwords. I don’t want to remove them from my corpus however is there a way to force the model not to identify them as possible named entities?