Recall and Precision (TN, TP, FN, FP)

honnibal · May 14, 2019, 4:17pm

The prodigy.models.ner.EntryRecognizer.evaluate() method will tell you the accuracy of the model, but doesn’t currently return P/R/F scores. The method supports the use-case where the gold-standard has only entities known to be correct, without necessarily containing all of the correct entities — i.e., the use-case where the gold-standard has missing values. You should specify the flag no_missing=True if you don’t have missing values in your gold-standard.

Here’s some code to return P/R/F, assuming you have no missing values in your gold standard:


tp = 0.0
fp = 0.0
fn = 0.0
for eg in test_examples:
    doc = nlp(eg["text"])
    guesses = set((ent.start_char, ent.eng_char, ent.label_) for ent in doc.ents)
    truths = set((span["start"], span["end"], span["label"]) for span in eg["spans"])
    tp += len(guesses.intersection(truths))
    fn += len(truths - guesses)
    fp += len(guesses - truths)
precision = tp / (tp+fp+1e-10)
recall = tp / (tp+fn+1e-10)
fscore = (2 * precision * recall) / (precision + recall + 1e-10)

Topic		Replies	Views
Evaluating Precision and Recall of NER ner , solved	6	11933	April 30, 2020
Specific formula for F score, precision and recall NER usage , spacy , training	1	977	July 10, 2021
Prodigy Train NER Results explanation usage , ner , solved	4	617	July 7, 2021
Prodigy NER model evaluation and custom evaluation scripts ner , spacy	5	2131	February 1, 2023
Ner evaluation probability threshold usage , ner , spacy	2	427	September 15, 2020

Recall and Precision (TN, TP, FN, FP)

Related topics