Hi @PuneethaPai,
Yes, I think using spacy train
is better once you have a reasonably sized data set.
I think the easiest way is to use the spacy convert
command, which supports the jsonl format that Prodigy produces. So you should be able to just use prodigy db-out
, and then pass that file through spacy convert
. If you set the extension to .jsonl, it should select the correct converter automatically. But in case it doesn't, you can also specify it explicitly with --converter jsonl