prodigy train OutOfMemoryError

ryanwesslen · November 14, 2022, 10:04pm

Yes very likely that's the problem. Have you tried to train with a CPU? Also, can you try to train on a random X docs shortest docs to see if you can still run? This will at least confirm the length of the docs.

One possible option is to try a different suggester function than the n-grams (since it blows up candidates with long spans):

Is there any way you can break up your transcripts? I know sometimes it doesn't have sentences/punctuation, but any simple rules you can do. I think a clever way to segment your data may help your model more than a different suggester though.

As you have more questions on training, you may also want to check out spaCy's discussion forum. There's more posts on optimizing training with spaCy there.

Hope this helps!

Topic		Replies	Views
spancat out of memory training , spancat	3	1041	April 24, 2022
any solution for this issue even after i've changed batch size its not working usage , spacy , training , spancat	9	881	June 23, 2022
Train spancat bug spacy , training , spancat	7	557	October 12, 2021
cupy.cuda.memory.OutOfMemoryError problem ner , training	1	975	September 8, 2021
training long sequence on spancat memory problem spancat	1	392	March 29, 2023

prodigy train OutOfMemoryError

Related topics