Quick newbie training question.
I have created industry-specific word2vec vectors and initialized an empty model. I collected annotations with ner.manual and pattern matching to get things going. I trained a new model with those annotations and then used ner.teach to annotate another 5000 entities.
I am correct in assuming that when ner.teach is updating the model that it is in effect the same as prodigy train command? I am thinking that if I train the model on the new annotations I would be training the model on the same data twice which sounds like a bad idea. Not completely sure about the pathway for updating a model that is already working pretty good.
Also, my empty model initialized from the word2vec vectors does not have a sentanceizer. How can I make a blank model from vectors and also have a sentanceizer.
Thanks for Prodigy it is a great tool.