integration with guildai for storing runs

cristianmtr · July 20, 2020, 11:12am

Hey

I have been using prodigy for annotating and building models.

However, I am noticing that, as the need for training and comparing various models increases, I would like to be able to store these as guild runs.

(Guild is a framework for storing ML experiments https://my.guild.ai/t/get-started-with-guild-ai/35 , similar to MLFlow or Sacred).

Do you know how this could be done?

ines · July 21, 2020, 9:16am

Hi! I haven't used guild.ai myself but it does indeed look pretty similar to other experiment management tools, so it shouldn't be difficult to integrate It just depends on what you want to log, and where.

I just had a brief look at the docs and I didn't immediately find details on the "track without changing your code" part. But if it lets you "wrap" commands and capture the output, you could probably just run your Prodigy commands with it.

The train recipe is really just a Python function and it returns a (best_scores, baseline) tuple after each training run. You can see how it looks in recipes/train.py in your Prodigy installation. So if you want to log the final best accuracy, you could write a script that calls into train() with the respective arguments and then logs the result via Guild. You could also send other info, like the name of the Prodigy dataset used, the Prodigy version and all other settings

Not sure if it makes sense to log "annotation runs" in the same way, but it's definitely possible with a similar approach, using a custom recipe.

cristianmtr · July 22, 2020, 9:34am

Ah cool, thanks! That makes perfect sense. Didn't think of train as a recipe that could be customized

Topic		Replies	Views
Inquiries about Prodigy 1.12 and Future GPT-4 Integration ner , spacy , relations	1	304	July 10, 2023
Storing training scores? usage , training	2	353	January 14, 2022
Automatically run train command usage , database	2	313	May 20, 2021
Prodigy 1.12.0rc2 release candidate available for download! news	5	662	July 5, 2023
ML-FLOW Integration with prodigy usage , ner , training	1	228	September 12, 2022

integration with guildai for storing runs

Related topics