data-to-spacy for transformers

koaning · October 12, 2022, 1:24pm

This command should already take care of that:

prodigy data-to-spacy my_directory --ner my_dataset

If you're training a spaCy pipeline with a transformer then spaCy will take care of all the token translation on your behalf.

You can see me report on all of the required steps here. Since you mentioned you're running similar steps but on Colab ... that's why I'm thinking this might be spaCy issue on top of colab.

Topic		Replies	Views
Transform annotations to match tokenization required for SpanBERT/BERT spacy , transformers , spancat	19	1610	July 30, 2023
config.cfg for bert.ner.manual usage , ner , transformers	5	831	September 30, 2022
can I use prodigy train-curve with a transformers model? usage , spacy , transformers	2	606	June 11, 2021
Convert annotated Data To Spacy ner , spacy	1	324	May 4, 2023
SpaCy3 models evaluation on a custom dataset usage , spacy , solved , training	3	641	July 7, 2021

data-to-spacy for transformers

Related topics