I'm following this tutorial - https://www.youtube.com/watch?v=59BKHO_xBPA - and have successfully run the annotation stage (with the difference being that I used my own data rather than the reddit data, and I called my dataset
school_data rather than
prodigy ner.manual school_data blank:en texts_containing_school_stuff.txt --label SCHOOL --patterns ./school_problems/school_pattern_file.jsonl
I've just tried running the next step, which is to run the following:
prodigy train ner school_data en_vectors_web_lg --init-tok2vec ./tok2vec_cd8_model289.bin --output ./tmp_model --eval-split 0.2
...and I'm seeing the following error:
Invalid config override 'school_data': name should start with --
I've triple-checked that the commands I've written are the same as those in the tutorial video (except for the name of my dataset of course) and I can't see what I've done wrong. I've searched for a similar support ticket and can't find one.
In case it helps, here is a pic of the annotation UI at the end of my annotation stage, showing the prodigy version at the bottom.
Can anyone help me to correct whatever I've done wrong?
PS - Thank you prodigy team for a great tool and useful video tutorials! I find them so much more useful than documentation (for getting started with a new tool at least)