Default Prodigy NER Format to BERT Format

yaamin6236 · December 7, 2022, 7:39am

Hi, is there a way to export the default NER annotation format to a format that can work on BERT. I saw some other posts similar to mine but I wasn't sure if there is a newer feature that lets me convert formats. I'm trying to reuse the annotations I already have and use them to train a BERT model for NER.

koaning · December 7, 2022, 8:59am

Hi Yaamin,

have you seen this segment on our docs?

In particular, it shows that if you want to use BERT in spaCy you can just use the annotations as-is. To quote the page:

New in Prodigy v1.11 and spaCy v3
spaCy v3 lets you train a transformer-based pipeline and will take care of all tokenization alignment under the hood, to ensure that the subword tokens match to the linguistic tokenization. You can use data-to-spacy to export your annotations and train with spaCy v3 and a transformer-based config directly, or run `` train and provide the config via the --config argument.

Topic		Replies	Views
Transform annotations to match tokenization required for SpanBERT/BERT spacy , transformers , spancat	19	1600	July 30, 2023
BERT support for prodigy train ner usage , ner , spacy , solved	2	1025	June 30, 2021
Training BERT on prodigy transformers , relations	3	817	February 2, 2023
How to do relation annotation after using bert.mer.manual transformers , relations	2	366	December 12, 2023
ner.train on data not annotated by Spacy? ner	3	1148	June 11, 2018

Default Prodigy NER Format to BERT Format

Related topics