Hi, there's a built-in spacy converter that's intended for use with prodigy NER data:
spacy convert --lang en data.jsonl .
This should create data.json
in spacy's training format. You need to specify the language so that the converter can tokenize the texts.
See a more detailed example here: Unable to use Prodigy annotations with SpaCy CLI train