Viewing Model Training Stats

aureliusnoble · October 20, 2021, 9:29pm

Hi,

I just started using prodigy and was watching some of the videos/reading (e.g. this one TRAINING AN INSULTS CLASSIFIER with Prodigy in ~1 hour - YouTube) the documentation and saw that running the "train" command generates a number of stats about the model, (e.g. recall, precision, f-stat, accuracy, #correct, #incorrect, etc.)

However, when I run:
$prodigy train ./model --textcat annotations
It only gives stats on loss, CATS_SCORE and SCORE.

I feel like I am missing something very obvious. Is there an easy way to generate these stats?

Thanks,

Aurelius

ljvmiranda921 · October 25, 2021, 12:25am

Hi @aureliusnoble !

The tutorial was created using an older version of Prodigy. that's why the console output differs. To see the per-label result, you can pass the --label-stats parameter in your train command:

prodigy train ./model --textcat annotations --label-stats

Note that you can also pass a --base-model for tokenization and sentence segmentation.

The reason why it didn't show up in the previous command is because the reported textcat scores depend upon the text classification architecture used. In addition, the training scheme for Prodigy v1.11+ is now identical to the v3 spacy train—the outputs are now similar for consistency.

Topic		Replies	Views
Print accuracy in prodigy train textcat	2	430	November 11, 2022
Train results accuracy dont appears for 1.11.0 usage , solved , training	2	249	August 17, 2021
Textcat results seems worse in new prodigy version usage , textcat , spacy , solved , training	4	648	August 30, 2021
TextCat Training Results on a per label basis. usage , textcat	1	443	February 18, 2019
train textcat doesn't show precision and recall enhancement , textcat , spacy	1	657	March 23, 2020

Viewing Model Training Stats

Related topics