Prodigy 1.90 train recipe --ner-missing argument

ines · January 12, 2020, 1:53pm

In a typical training scenario, you're updating a model with examples and the correct answer – e.g. with a text and the entities in it. In some cases you may also have partial annotations: you know some entities but not all.

Prodigy's active learning recipes like ner.teach also let you collect binary yes/no decisions. The data you create here is different again: for some spans, you know that they are entities, because you accepted them. For the ones you rejected, you know that they're not of type X – but they could potentially be something else. This requires a different way of updating the model: you want to update with the positive examples where you know the answer, and proportionally with the "negative" example where you only know that a certain label doesn't apply. That's the type of training the --binary flag enables.

Fine-grained per-label accuracy is a very new feature in spaCy, so we only just added that to the regular training recipe in Prodigy in v1.9. The binary training requires very different evaluation (for the reasons explained above), so if we wanted more fine-grainde accuracy, we'd have to come up with our own implementation and logic for it. It's also not clear if it translates well and makes it easier to reason about the results.

Topic		Replies	Views
ner.train number of examples usage , ner	8	1948	August 3, 2018
Understanding ner.batch-train stats usage , ner , solved , best-practices	7	2707	October 26, 2018
train ner dataset -> ValueError: too many values to unpack ner , done	6	2626	January 10, 2020
questions on Multi NERs Annotation & Training at Once in a Sentence usage , ner , spacy	5	615	October 3, 2022
Advice on training NER models with new entities usage , ner , hr	13	3884	January 25, 2019

Prodigy 1.90 train recipe --ner-missing argument

Related topics