Prodigy NER train recipe getting killed for no apparent reason

dave-espinosa · June 22, 2022, 10:03pm

I got to realize a couple of things:

data-to-spacy managed to build the .spacy files required for training in spaCy.
When training the model however, I faced up again that Killed message.

Having the .spacy files already generated however, I decided to move to another cloud-based VM, with more computational resources in regards of memory this time... And the training completed successfully. That indirectly explains the root cause of my issue.

Still, it would be awesome to have some updated code snipet to diagnose this problem (i.e., a snipet which can tell if you are actually running short of memory or not for your training dataset[s]), and some suggestions to avoid this problem for "big" training datasets.

Thank you.

Topic		Replies	Views
ner.batch-train is really slow ner	21	2392	April 4, 2018
Command "ner.batch-train" returns MemoryError ner , solved	5	827	August 22, 2019
Large Datasets Google Cloud usage , ner , google-cloud	5	1818	October 13, 2018
ner.correct memory usage usage , ner , done , solved , streams	9	711	November 19, 2020
Ner Training with Prodigy vs Spacy ner , spacy , best-practices	2	1213	July 2, 2020

Prodigy NER train recipe getting killed for no apparent reason

Related topics