Easy way to evaluate the output of ner.correct against its initial input

simonc · July 15, 2021, 9:26am

Given the following situation what is the easiest strategy to get evaluation metrics (Precision, Recall, F-Measure) for NER performance

We have some preannotated material A (not produced by a spaCy or prodigy model)
We use ner.manual recipe to produce the corrected version B
How can we measure the performance of A against gold B without applying to many conversion steps?

Thanks for hints.

ines · July 19, 2021, 1:00am

Hi! I think the easiest way to do that would be to import your preannotated dataset into a separate Prodigy dataset and then run two training experiments with prodigy train and the same settings: one for the preannotated dataset and one for the corrected dataset. You'll then be able to compare the reported scores for both experiments and see how the model trained on your corrections compares.

Topic		Replies	Views
Create baseline metrics based on manual NER annotations usage , ner , solved	3	670	June 8, 2020
Prodigy NER model evaluation and custom evaluation scripts ner , spacy	5	2132	February 1, 2023
Recipe for comparing NER model and manual annotation usage , ner , custom , compare	4	1411	July 13, 2021
Specific formula for F score, precision and recall NER usage , spacy , training	1	980	July 10, 2021
Prodigy based ner models evaluation usage , spacy , solved	1	344	October 9, 2020

Easy way to evaluate the output of ner.correct against its initial input

Related topics