Hello @ines, I am successfully using prodigy and spacy in my projects and i have 25k documents for one particular problem which are domain specific .
prodigy train ner dataset1 en_vectors_web_lg --output dataset1_Model --n-iter 10 --eval-split 0.2 --dropout 0.2
Best F-Score 85.036
prodigy train ner dataset1 blank:en --output dataset1_Model --n-iter 10 --eval-split 0.2 --dropout 0.2
Best F-Score 84.829
i ran prodigy train-curve ner.
50% 94.44 +0.95
75% 94.61 +0.17
100% 94.79 +0.18
I was expecting more accuracy from vectors model. Do you think accuracy will improve if i use spacy pretrain ?. Does pretrain helps in my case?.
One more thing, i am following below workflow. But sometimes accuracy between prodigy train and spacy train differs. Prodigy is a bit higher than spacy. Is there any step missing before using spacy train?
prodigy train ner