Training NER model with prodigy

mihaivinaga · August 19, 2020, 2:31pm

Hello,

We are a company that uses prodigy for our tagging and training of NER models. I have two questions regarding the usage of prodigy:

Is there a way to test which parameters are best suited for a certain dataset when training it. Does prodigy have a tool in which we specify a dataset and after running a command it lets you know what are the best values for parameters like losses or batch size
prodigy seems limited when it comes to customising the way you train your model, I was wondering what do you guys recommend when training a model for production, prodigy or a python script?

Thanks you @ines for your support and patience!

honnibal · August 21, 2020, 12:11pm

Hi @mihaivinaga,

It's always difficult to decide where to limit the scope of a tool. On the one hand, it's useful to do things in one place rather than assembling a workflow out of many pieces, and so it's tempting to put in features that a high percentage of users will use in their workflows. But on the other hand, it's good for tools to stay more limited, as no one tool can be the best at everything.

For hyper-parameter optimisation, we see this as a topic that's continuing to develop, and it's also one that requires integration into a remote execution environment, because you want to use multiple machines to execute the hyper-parameter search in parallel. We therefore have not implemented any hyper-parameter search into Prodigy. We recommend exploring Polyaxon and Ray as different approaches to hyper-parameter tuning and experiment management. Polyaxon is more of a full-featured environment, while Ray is a smaller tool that gives you primitives to code solutions yourself.

The prodigy train command was shaped by similar considerations. We did decide it was worth the convenience to have a simple train command to train directly from the database. But we haven't tried to cover every use-case, and you can easily replace the command with your own scripts (or export your data using data-to-spacy and train with spacy train directly). We recommend doing that for many situations, for example running training tasks under automation, which would normally be the right process for production deployments.

Topic		Replies	Views
Ner Training with Prodigy vs Spacy ner , spacy , best-practices	2	1212	July 2, 2020
Reproducing prodigy ner.batch-train in spacy: cross-validation results and outputted model usage , ner	3	1880	October 5, 2018
ner.batch_train vs spacy nlp.begin_training ner , spacy	1	1099	January 26, 2018
Flag --batch-size not recognized by prodigy train spacy , solved , nightly	3	927	May 20, 2021
Prodigy ner.batch-train vs Spacy train usage , spacy , best-practices	13	3505	June 2, 2020

Training NER model with prodigy

Related topics