feature request: early stopping, learning rate

mhigginslp · March 20, 2018, 5:33pm

It would be nice to have more control over optimization with batch-train, especially the learning rate. Right now, If I run batch-train for 10 epochs and see that the model is still improving I can’t reload the saved model and continue training (efficiently) because the learning rates default is too large. I have no choice but to rerun training for more epochs.

Another nice-to-have would be early-stopping, where the model keeps training until there is no improvement over --wait epochs.

honnibal · March 21, 2018, 12:27pm

I agree that these things are nice. We used to write out the model after each epoch, but if the pretrained vectors are large this gets annoying.

I’m reluctant to make the command too complicated though. I suggest customising the recipe will work better for you.

You can set the learning rate by writing to the optimizer.alpha attribute within the recipe (i know, this should be named better…). Advice about spaCy’s hyper-parameters can be found here: https://spacy.io/usage/training#tips

Note that there are several settings that interact. In particular, the parameter averaging means later iterations have less impact on the model, which is a bit like annealing the learning rate. The adam solver, gradient clipping and batch size all interact too. I usually find the model isnt that sensitive to the learning rate if you dont change other settings. I actually dont usuaslly modify the LR, but maybe i should.

Topic		Replies	Views
epochs spacy , training	1	634	May 17, 2023
Comprehensive guide or course to model finetuning/training results spacy , solved , best-practices , training	2	480	June 23, 2022
Ner Training with Prodigy vs Spacy ner , spacy , best-practices	2	1209	July 2, 2020
ner.batch_train vs spacy nlp.begin_training ner , spacy	1	1098	January 26, 2018
ner.batch-train random Python has stopped/Segmentation Fault ner , done , windows	1	571	September 24, 2018

feature request: early stopping, learning rate

Related topics