NER train curve with label stats?

yihzhang-suncor · December 9, 2021, 3:47am

Is it possible to combine train-curve and label-stats such that the output provides the train-curve plots for each label individually?

Also is there a way to save the metrics from train-curve in a file?

ines · December 12, 2021, 5:25pm

Hi! At the moment, the per-label stats are only available in the regular training, since it'd otherwise get very verbose very quickly. The per-label stats can also be a bit less representative when training with small portions of the data, because you can easily end up with very few instances of a given label.

That said, you can take a look at the implementation in recipes/train.py (you can run prodigy stats to find the location of your Prodigy installation) and make a small adjustment to how the training is run in train_curve. If you change the call to the _train helper like this, it'll show all detailed stats for each training run, including the per-label stats:

baseline, scores = _train(
    config, gpu_id=gpu_id, overrides=overrides, silent=False, show_label_stats=True
)

If you just want to save the output in the same format it's shown on the CLI, one option would be to just redirect the output to a file:

prodigy train-curve ... > train_curve.txt

Topic		Replies	Views
save train-curve results usage , solved , training	1	358	January 5, 2022
Viewing Model Training Stats usage , training	1	356	October 25, 2021
--label-stats for spaCy train ner , spacy , solved , transformers	2	20	July 7, 2024
ner.batch-train callback? usage , ner	3	431	August 10, 2018
NER Trained Model Analysis ner , spacy	9	542	July 30, 2023

NER train curve with label stats?

Related topics