train textcat baseline accuracy

dshefman · January 14, 2020, 5:36pm

Where can I find a description of the "baseline accuracy" when using "prodigy train textcat"? Is it just the naive approach of classifying everything as the success class or something else?

ines · January 15, 2020, 12:27pm

Hi! The score reported as the baseline accuracy in the regular train recipe is the result of evaluating the base model on the evaluation set. If you're using a blank model, this is the accuracy with randomly initialized weights. Or, expressed in code, the equivalent of this:

nlp = spacy.blank("en")
textcat = nlp.create_pipe("textcat")
textcat.add_label("LABEL_A")  # etc.
nlp.add_pipe(textcat)
nlp.begin_training()
scores = nlp.evaluate(eval_data)

I think in the previous textcat.batch-train, Prodigy was actually calculating a majority class baseline, which is probably a more useful metric here and something we should add back (at least as an option).

Dany · January 23, 2020, 6:47pm

@ines just to clarify, is the baseline accuracy the predictive accuracy on the validation set? I'm a little confused because mine doesn't change after training....does this mean my model isn't actually improving? Despite ROC score of .9 (i.e., it's basically just overfitting).

ines · January 24, 2020, 12:17pm

Yes, in this case, it's the result of evaluating the model on the evaluation data before training. If you start with a blank model, it'll be the accuracy of a model with randomly initialized weights. So basically, the accuracy if you did nothing.

Do you mean the accuracy after training is lower than the baseline accuracy? If that's the case, that would indicate that something is wrong (either in the training or the evaluation), because the weights you trained ended up performing worse than the randomly initialized weights.

Topic		Replies	Views
Can't improve textcat model performance textcat	2	389	May 3, 2020
Training baseline scores vary despite random seed fixed textcat , spacy	2	1546	June 5, 2020
TextCat Training Results on a per label basis. usage , textcat	1	442	February 18, 2019
Evaluating a text classification model usage , textcat	4	794	September 24, 2019
Print accuracy in prodigy train textcat	2	426	November 11, 2022

train textcat baseline accuracy

Related topics