batch-train from the UI

ines · January 31, 2019, 8:24pm

Not really – the Prodigy UI is really just the annotation interface for labelling data. Batch training a model is a Python process that’s completely separate from the data collection, and it potentially takes a while, so there’s not really a logical path to integrating the two.

I guess if you wanted to, you could implement this in the stream generator – after X annotations, you fetch the current dataset, call the batch_train function on it and once you have an updated artifact, load it and swap out the existing nlp object. But there are a lot of open questions around this – for example, what do you do with annotations that are collected while your model is training? Are you going to keep track of those and then perform another update afterwards?

Btw, on a related note: Prodigy Scale comes with built-in training runs that you can set up in the app and run with the latest state of the dataset to keep track of the results. But this feature is also mostly designed to have continuous training experiments running to see how the annotations you collect improve the model.

Topic		Replies	Views
Can we use prodigy for labeling the data while model is not training at the back end usage , solved	2	457	November 8, 2018
Is it possible to run batch train on a file enhancement , usage , solved	3	533	January 31, 2019
Automatically run train command usage , database	2	313	May 20, 2021
How to get specific data samples to show up on the annotation UI usage , custom , streams	1	392	July 15, 2021
Prodigy Version 1.5.1 vs 1.4.2 api , solved	3	852	July 21, 2018

batch-train from the UI

Related topics