I'm trying to use the new spacy-nightly training pipeline. How can I convert a prodigy dataset to .spacy object? Thanks,
You can use Prodigy's data-to-spacy
to conver one or more datasets to a JSON training file and then convert that to .spacy
using spacy convert
. Also see here:
Under the hood, the .spacy
format is just a serialized DocBin
object, so for full flexibility, you can also create Doc
objects from your existing annotations and add them to a DocBin
. See here for details: https://nightly.spacy.io/api/docbin