accuracy not improving much with ner.batch-train

Pedram · December 19, 2019, 1:54pm

Hello,

I had an issue training my model, I realised it was because I was not merging entities for the same text. I can see from the way you are collecting your training data, you might get similar issue.

For example, you cannot train your model with these two training data:
("Google and Apple are companies.", {'entities': [(0, 6, 'Company')]})
("Google and Apple are companies.", {'entities': [(11, 16, 'Company')]})

Instead you need to have:
("Google and Apple are companies.", {'entities': [(0, 6, 'Company'), (11, 16, 'Company')]})

Also don't remove any training data with no entities, for example:
("Apple is a fruit.", {'entities': []})
is a useful training data.

I'm new to Spacy, please correct me if I'm wrong.
I hope it helps.

Topic		Replies	Views
ner.train number of examples usage , ner	8	1948	August 3, 2018
Reproducing prodigy ner.batch-train in spacy: cross-validation results and outputted model usage , ner	3	1878	October 5, 2018
questions on Multi NERs Annotation & Training at Once in a Sentence usage , ner , spacy	5	615	October 3, 2022
different dataset for ner.batch-train usage , ner	1	421	August 28, 2019
ner.batch train output - Right, wrong, accuracy returned as Zero ner	9	942	May 20, 2019

accuracy not improving much with ner.batch-train

Related topics