sent.correct vs sent.teach accuracy degredation

kylebigelow · April 24, 2022, 4:47pm

TLDR - Does using both sent.correct and sent.teach for the same dataset reduce training performance?

I have 100 annotations in a job postings dataset that explicitly delineate the correct sentence start through sent.correct. These data achieve ~80% when training using default config and a 70/30 split.

However, I noticed a signficant decrease in training scores after I created 100+ additional annotations using sent.teach.

It appears that the false answers from sent.teach affected training performance. I was able to copy the examples from the db, drop the dataset, and import back in.

koaning · April 25, 2022, 8:41am

How many examples with labels do you have? I usually prefer to have at least ~500 examples in a validation set before I take performance numbers very serious. The reason is that there might be a risk of overfitting on a subset of the data that isn't representative of the task.

That said if you have a relatively small dataset to train/score on, it's possible that the active learning approach, the one learning on newly labelled subsets, overfits a bit on those newly labelled subsets.

My gut feeling is that this issue will go away once you have more labels. This is my gut feeling though, if you have a much larger dataset with labels and if the issue persists then it'd certainly be interesting to dive into a bit more.

Topic		Replies	Views
textcat.teach vs correct wh	3	484	May 27, 2022
Active learning performs worse than pretrained model ner	4	569	October 18, 2022
from textcat.manual to textcat.teach usage , textcat , best-practices	1	574	February 13, 2022
Practical use of rejected textcat.teach annotations for downstream tasks	2	89	May 24, 2024
Two Questions on Teach recipes usage , ner , textcat , solved	5	743	January 27, 2020

sent.correct vs sent.teach accuracy degredation

Related topics