make-gold workflow

mbickel · June 11, 2018, 10:19am

Hi,

I am a bit confused by the ner.make-gold workflow. If the model gives me a suggestion where the labels are incorrect, am I supposed to correct the suggestions and then click accept or should I just reject it?

ines · June 11, 2018, 11:17am

The goal of the ner.make-gold workflow is to produce gold-standard data – i.e. annotations that are complete and “perfect”. In ner.teach, you just give the model binary feedback on different analyses of the text – but in ner.make-gold, the idea is that you correct the entities until the example is complete and all entities are labelled, and then accept it. If you come across a sentence that includes no entities, you would simply accept the unlabelled sentence.

I normally use the “reject” action to explicitly mark examples that are wrong for other reasons – for example, if the tokenization is bad or if it includes bad markup etc.

(Btw, you could also create gold-standard data by hand using ner.manual, but correcting the model’s predictions is often faster, because there’s always a chance that the model gets at least some of the entities right.)

Topic		Replies	Views
When to reject in ner.manual or ner.make-gold? usage , ner , solved	1	1291	October 17, 2018
Annotation strategy for gold-standard data usage , ner , solved , best-practices	5	2704	October 26, 2018
ner.make-gold to re-evaluate pre-annotated dataset ner , solved	2	666	July 25, 2018
training of annotated dataset with ner.make-gold usage , ner	6	1792	August 7, 2019
Manual Annotation Response for Text Without Entities usage , ner , solved	6	1005	March 16, 2018

make-gold workflow

Related topics