transformers based ner evaluation result is 90 but while predicting using nlp pipeline it's showing no output

sagornsl · July 28, 2021, 8:52am

Hi,
I trained my NER data spacy with roberta-base transformers.
evaluation result is 90 F1 score. But while predicting using normal spacy nlp pipeline no entity output is showing.
My training data is in "word label" format. where "word" get from whitespace tokenizer and label in BLIOU format.
Can anyone point out the problem here.
Thanks in advance.
regards

ines · July 29, 2021, 11:55pm

Hi! This is a bit difficult to debug from afar because there can be many different causes an explanations. Some things to check would be:

How is your evaluation done, and is it representative? If your evaluation set is really small, or doesn't contain many entities, you can end up with a score that doesn't actually tell you very much about how useful your model actually is. For instance, if your evaluation data doesn't contain any entities, your model may report an accuracy of 100%, because it has just learned to never predict entities.
How do you perform the checks with our nlp object? Are you just testing on some random examples you can think of? This isn't always the best and most representative way to check if things are working as expected. Even if your model is 90% accurate, it'll still make mistakes.

Topic		Replies	Views
Review Approaches to NER on Unstructured Data (and Discussing Amazon Comprehend vs spaCy + Prodigy) ner , spacy , aws	6	1169	August 2, 2022
Ner evaluation probability threshold usage , ner , spacy	2	427	September 15, 2020
How to evaluate the model accuracy with test data (not part of training) usage , ner , spacy	8	706	March 12, 2024
--label-stats for spaCy train ner , spacy , solved , transformers	2	20	July 7, 2024
Missing entity result ner , solved	7	918	August 29, 2022

transformers based ner evaluation result is 90 but while predicting using nlp pipeline it's showing no output

Related topics