I would like to see training curves for each classification label using AUC as the metric. It doesn't seem like spacy supports this out of the box, in which case where would I start if I wanted to add reporting functionality that looked something like the below?
Hi! The new train recipe function returns a (best_scores, baseline) tuple – best_scores is an instance of spaCy's Scorer, which includes overall accuracy scores, as well as scores per label.
The train-curve mostly just runs train with different portions of the data and then outputs the best score for each training run at the end. So you could write your own version of the recipe that prints the best_scores.textcats_per_cat instead of using the default results printer. (You can find the recipe in recipes/train.py in your Prodigy installation – it should be pretty straightforward to copy, because it mostly just calls train() with arguments. To find the location of your Prodigy installation, you can run prodigy stats).