Access to word embeddings

drdileepunni · April 21, 2020, 3:43pm

Does prodigy use word embeddings to do auto annotations? If yes do we have access to those word embeddings?

Thanks much

adriane · April 22, 2020, 12:21pm

The word embeddings are part of the spacy model provided to the recipe. If you're using a blank model or one of spacy's provided sm models, there won't be any word embeddings, but the md and lg models contain vectors.

If you load the model with spacy, you can access the vectors through the vocab with nlp.vocab["word"].vector or more generally under nlp.vocab.vectors, see https://spacy.io/api/vectors.

ines · April 22, 2020, 2:36pm

To add to Adriane's comment, if you have your own embeddings and model (fine-tuned transformer, custom word vectors etc.) and want to use that to suggest examples for annotation, you can set up a custom recipe and load in your data however you like. Here are two examples for NER and text classification:

Topic		Replies	Views
word embeddings for prodigy train recipe usage , spacy , training	8	568	October 24, 2022
Adding word vectors to spaCy model spacy , solved	2	1650	May 7, 2018
Loading gensim word2vec vectors for terms.teach? usage , terms , solved , gensim	17	5157	August 15, 2018
Using Fastext vector model in Prodigy? usage , spacy , solved	7	3410	March 15, 2018
biomedical nlp models in spacy usage , spacy , solved , gensim	4	2406	February 28, 2018

Access to word embeddings

Related topics