Getting the Negative Instances in a Trained Model

aball123 · July 30, 2021, 6:43pm

Hello,

I performed a training on my dataset and the results are great! I'm trying to tighten things up a bit and would like to review the actual annotations that the model got wrong. How do I go about doing this? I've looked in my model's folder where ner, parser, tagger, etc. are but cannot find a file that provides this information.

ines · August 1, 2021, 1:42am

Hi! The process of comparing the model's predictions to the correct examples in the evaluation data just runs during evaluation and is not something that's typically saved out with the model (because it's only relevant for computing the score).

However, you can always get that information by running your trained model over your evaluation data and then comparing its predictions (e.g. doc.ents, doc.cats) against the correct answers you have in your evaluation data. If the model predicts something that's not in the evaluation data, that's a false positive. If the evaluation data contains something that's not predicted by your model, that's a false negative.

For example, if you're annotating named entities, you could do something like this:

for eg in examples:
    doc = nlp(eg["text"])
    predicted_tuples = [(ent.start_char, ent.end_char, ent.label_) for ent in doc.ents]
    gold_tuples = [(span["start"], span["end"], span["label"]) for span in eg.get("spans", [])]

    # Output the information however you like
    print(doc.text)
    for ent in predicted_tuples:
        if ent not in gold_tuples:  # predicted by the model but not in evaluation data
            print("false positive:", ent)
    for ent in gold_tuples:
        if ent not in predicted_tuples:  # in evaluation data but not predicted
            print("false negative:", ent)

Topic		Replies	Views
show false negative/false positives in NER usage , ner , spacy , solved	7	2736	May 3, 2022
Review examples where the model fails to predict correctly usage , ner , custom	2	459	October 31, 2020
Form and prevalence of negative examples in the Training Set when training a Custom NER SPACY model spacy	3	1200	December 28, 2022
Getting the list of mispredictions on evaluation dataset usage , solved , dep	4	545	November 17, 2020
Train model for certain, repeating mislabelling usage , ner	1	481	May 28, 2019

Getting the Negative Instances in a Trained Model

Related topics