Exporting a NER model with training.jsonl & evaluation.jsonl

hariesh23 · June 2, 2020, 5:34am

Hi! I was wondering if it’s possible to export training.jsonl & evaluation.jsonl to the output directory after creating a NER model from scratch. The model I exported has the following: meta.json; ner/; tokenizer; vocab/, and the import works great. Many thanks!

ines · June 2, 2020, 2:03pm

If you're using the train recipe and don't have a dedicated evaluation set and just hold back a random sample, Prodigy currently doesn't save out the files again separately.

Once you're serious about training and evaluation, you can use a separate Prodigy dataset for your evaluation examples, and pass that in as the --eval-id. This also makes your experiments more stable and repeatable, because you're always evaluating on the same data. You can later save out the training and evaluation set using the db-out command.

If you use the data-to-spacy recipe to convert your dataset to a JSON-formatted training file for spaCy, you can also specify an --eval-split and Prodigy will shuffle the examples and save out 2 separate files: a training file and an evaluation set (e.g. if you set --eval-split 0.2, 20% of examples will become the evaluation set).

hariesh23 · June 2, 2020, 7:21pm

That's awesome - thanks, @ines!

Topic		Replies	Views
SpaCy3 models evaluation on a custom dataset usage , spacy , solved , training	3	641	July 7, 2021
Formatting Prodigy annotations for evaluation of external NER models using spaCy usage , ner , spacy	4	596	April 13, 2022
How to train a NER model using spaCy 3 only, starting from prodigy (1.11) JSON files? usage , ner , spacy	1	2641	August 22, 2021
SpaCy training from data-to-spacy output ? usage , spacy	8	1826	June 14, 2022
Training prodigy ner data through spacy usage , ner , spacy , solved	3	893	January 8, 2020

Exporting a NER model with training.jsonl & evaluation.jsonl

Related topics