Impact of active learning on NER accuracy interpretation

Ben · December 4, 2018, 5:31am

Hi,

I am using NER and wonder whether active learning impacts the interpretation of the accuracy.

My thoughts are the following:

active learning only selects most challenging examples
the accuracy in the evaluation set might be lower than if I would use randomly drawn examples for the evaluation
that might mean, that, e.g., 65% is in reality 65+x%

Overall, the question is theoretical, with print stream I can look at the results and I see I like it. However, I was wondering when with more examples (500->1000) I just saw a minimal increase in accuracy (which might also be just right).

Thanks

honnibal · December 12, 2018, 3:09pm

Hi,

Sorry for the delay getting back to you on this — it slipped through, so I’m only seeing this now.

The simple answer to your question is yes, the active-learning does select a biased sample, and so for reliable estimation of accuracy you should annotate a separate data set for your held-out data, without using active learning to select the examples. The random splitting is a useful option at the start of a project as a quick-and-dirty measure of progress, but after the first day or two of work, I would suggest making a dedicated evaluation set.

Topic		Replies	Views
NER evaluation ner	2	541	July 23, 2020
Active learning and its reflection on accuracy ner	3	1200	March 5, 2018
How Active learning works? usage , ner	2	563	March 19, 2021
Active Learning: Does it work? discussion , best-practices	4	5822	May 15, 2018
Active learning performs worse than pretrained model ner	4	566	October 18, 2022

Impact of active learning on NER accuracy interpretation

Related topics