Bad results with terms.teach

ines · August 20, 2018, 10:55am

To find other terms , the terms.teach recipe will iterate over the entries in the model's vocab. So maybe your custom model doesn't actually have the words present in its vocabulary?

This would explain why the only terms you see are the seed terms (which were added to the target Doc and are then part of the vocab) and why it works when you use the model manually (because words you process are then added to the vocabulary). You can test this by looking at len(nlp.vocab) – the number should be roughly the number of word you've added vectors for.

In your code, make sure to use the vocab.set_vector method to also add the word to the vocab. Alternatively, you could also use vocab.strings.add to add strings to the vocabulary directly.

Topic		Replies	Views
Error when adding seed terms to terms.teach done , terms , solved	8	1990	September 5, 2021
Web UI for pre-trained Chinese vectors spacy , terms	6	1555	August 22, 2018
terms.teach bigrams returning noisy results spacy , terms	6	1143	October 5, 2018
Error while running terms.teach (E018) spacy , terms , solved	14	2180	September 5, 2021
terms.teach not working for nightly spacy , nightly	3	538	April 25, 2021

Bad results with terms.teach

Related topics