Training table doubts

ryanwesslen · November 17, 2022, 1:56pm

Check out this previous post:

The "E" is epochs and it seems like your dataset is so large (240,000+ training), it can't make a full pass (epoch) on your entire dataset.

I found this spaCy discussion post on advice on training large NER datasets. They recommended a few suggestions like creating a labels file (more to speed up the process), modifying the learning rate (will need some experimentation), and reducing the evaluation set. If you have further questions related to training, I would recommend posting on the spaCy discussion as the spaCy core team answers those posts (and prodigy train is really just a wrapper for spacy train, so your questions are really spaCy questions).

Topic		Replies	Views
understanding the different terminology in the command line output of a training pipeline ner , solved , best-practices , training	2	1951	June 20, 2022
Evaluation data for ner model ner	2	379	October 11, 2023
What does the outputs mean from "train?" usage , ner , spacy , solved	1	784	February 9, 2022
Ner Training with Prodigy vs Spacy ner , spacy , best-practices	2	1209	July 2, 2020
Prodigy Training Piepline usage , ner , solved , training	1	345	January 19, 2022

Training table doubts

Related topics