Hello,
I had an issue training my model, I realised it was because I was not merging entities for the same text. I can see from the way you are collecting your training data, you might get similar issue.
For example, you cannot train your model with these two training data:
("Google and Apple are companies.", {'entities': [(0, 6, 'Company')]})
("Google and Apple are companies.", {'entities': [(11, 16, 'Company')]})
Instead you need to have:
("Google and Apple are companies.", {'entities': [(0, 6, 'Company'), (11, 16, 'Company')]})
Also don't remove any training data with no entities, for example:
("Apple is a fruit.", {'entities': []})
is a useful training data.
I'm new to Spacy, please correct me if I'm wrong.
I hope it helps.