How to compare performance of 2 textcat models

curious · March 23, 2020, 5:25pm

I have 2 textcat models. One was taught manually, one was taught by active learning. The active learnt model only used the subset of the total dataset. Now I want to compare the performance of the 2 models.

Prodigy Train won't do it. Because it only evaluates again its own data. We would just compare apple with orange in my case.

It will make more sense to evaluate 2 models with the same unseen dataset. I haven't seen how to do in Prodigy. Looks like I will have to export 2 models to Spacy and evaluate in Spacy with the unseen evaluation set. Am I right?

ines · March 23, 2020, 8:54pm

You don't have to let Prodigy split the examples – that's just the default if no evaluation set is provided, so there's something to evaluate on. The --eval-id argument lets you pass in the name of a dataset used for evaluation. So if you're serious about evaluating your models and comparing the performance, that's probably what you want to be using.

That said, you can also export your data and train with spaCy directly, which makes sense once you're done with annotation and just want to run experiments, or if you want more fine-grained control over the training. The prodigy train command was mostly designed as a quick way to run experiments from Prodigy datasets and see how you're doing, but it's not necessarily how you have to train your final models.

Topic		Replies	Views
Can't improve textcat model performance textcat	2	389	May 3, 2020
How to evaluate the model accuracy with test data (not part of training) usage , ner , spacy	8	695	March 12, 2024
textcat.batch-train versus spacy classificaion example usage , textcat , spacy	4	543	March 30, 2019
Easiest way to get stats about a model trained in prodigy? ner , spacy , solved	2	402	March 4, 2022
how to test my model on new dataset ? usage , spacy , solved	2	941	April 26, 2020

How to compare performance of 2 textcat models

Related topics