NER training - high RAM usage - Memory leak ?

opsomert · December 7, 2017, 2:48pm

Hello,

I guess it’s more a Spacy issue, but I didn’t try to train directly from Spacy, so I post here :). Anyway I’m using the ner.batch_train recipe, with the following command: prodigy ner.batch-train ner-3 en_core_web_md --output /tmp/ner-3.model -es 0.3 -n 15 -b 32

I observe a HUGE ram consumption, see the screen below (It’s even at 15G right now )… Hopefully OSX compress most of it, and that’s why I didn’t noticed at first. But it’s quite problematic…

By the way I’m using the lastest spacy - 2.0.5

honnibal · December 7, 2017, 2:53pm

Hm! Sorry about this – thanks for the report.

Edit: What version of Thinc are you using?

opsomert · December 7, 2017, 2:56pm

thinc==6.10.2

honnibal · December 7, 2017, 2:59pm

Memory use in spaCy parser beam training looks stable, so either the memory leak is within Prodigy, or it’s something to do with serialising the vectors.

opsomert · December 12, 2017, 10:24am

Just to confirm I switched to training directly with Spacy and memory usage is OK. The leak must be in prodigy then. Maybe it’s related to the training part that include negative example…

Actually I’m wondering does using the negative examples is something that you added only for boostrapping models in prodigy (with few example), or is it something that you would recommend in general ? I didn’t found any documentation in spacy of this training scenario. I guess that the idea is to use negative examples to constraint the beam search, but not sure…

mbickel · July 5, 2018, 9:20am

is there an update on this?

Topic		Replies	Views
Memory Error usage	3	2148	June 15, 2018
Segmentation fault (intermittent) done , spacy	10	2057	August 30, 2018
ner.correct memory usage usage , ner , done , solved , streams	9	703	November 19, 2020
Large Datasets Google Cloud usage , ner , google-cloud	5	1815	October 13, 2018
Prodigy NER train recipe getting killed for no apparent reason	9	764	December 4, 2022

NER training - high RAM usage - Memory leak ?

Related topics