how to test my model on new dataset ?

nehag0317 · April 24, 2020, 11:45am

Hi Team,

I want to export the model and want to test on new dataset. In spacy it is possible by using "GoldParse" and "Scorer". Is there anything similar in prodigy.
Here you can see that I am trying by giving some sample example but I want to pass a test file and want to evaluate the model on that.

Thanks
Neha

ines · April 24, 2020, 1:02pm

Hi! Under the hood, Prodigy's train recipe calls into spaCy and then runs nlp.update and nlp.evaluate (which returns a Scorer object with the scores).

If you want to use the train command to train your model, the easiest way is to use a custom evaluation set (instead of just the --eval-split 0.2, which holds back a random 20%). The train command accepts an --eval-id argument, which lets you point to a Prodigy dataset to use for evaluation. So if you have test data in Prodigy's format, you can import it to a new dataset and then use --eval-id name_of_your_test_dataset to evaluate on that data and report those results. This approach is also very useful if you're using Prodigy to create both your training and test data.

Alternatively, if you already have your own evaluation pipeline set up in your spaCy code, you could also export your Prodigy annotations with db-out and use them to train your model.

nehag0317 · April 26, 2020, 3:56pm

Thank you so much!!
I will try it out and update you

Topic		Replies	Views
How to evaluate the model accuracy with test data (not part of training) usage , ner , spacy	8	724	March 12, 2024
export predictions from prodigy model usage , spacy , solved	1	476	April 24, 2020
Gold notation, Test/Eval set for already trained model usage , ner	3	930	May 14, 2019
Formatting Prodigy annotations for evaluation of external NER models using spaCy usage , ner , spacy	4	595	April 13, 2022
Handling train / dev / test in Prodigy usage , ner , training	3	580	July 22, 2021

how to test my model on new dataset ?

Related topics