I started by creating a dataset using
prodigy dataset ... and then using
prodigy ner.manual ... with a bunch of my own labels I annotated a bunch of examples.
I was initially planning on using the BERT model: en_trf_bertbaseuncased_lg, but trying to run batch-train with:
prodigy ner.batch-train demo_v01 en_trf_bertbaseuncased_lg --output /tmp/model --eval-split 0.2 --dropout 0.2
I got the following error:
KeyError: "[E001] No component 'trf_tok2vec' found in pipeline. Available names: ['sentencizer', 'ner']"
Is there some missing import in prodigy?