Output Annotations in a dataframe form

konstantinidis.alexa · April 29, 2020, 1:39pm

Is it possible that the annotated documents through Prodigy are returned in this format?

ines · April 30, 2020, 9:39am

Sure – Prodigy gives you all the underlying information in JSON, and you can then set up your output however you like.

You can find an example of the format produced by the manual NER interface here: https://prodi.gy/docs/api-interfaces#ner_manual It includes the tokens and "spans" describing the annotated entities. You can then convert the entity offsets to IOB or BILUO tags and add one row per token to your dataframe.

See this section for how to convert chacter offsets to IOB or BILUO: https://prodi.gy/docs/named-entity-recognition#tip-offsets-biluo

konstantinidis.alexa · May 2, 2020, 9:11pm

Thank you Ines

Topic		Replies	Views
convert prodigy annotation file to iob format usage , ner , solved , transformers	2	2816	April 16, 2020
Ner format to CONLL usage , ner , solved	7	5365	June 4, 2019
NER Prodigy to IOB2 format usage , ner , spacy	1	1118	August 4, 2020
Creating a revised annotation dataset, from the output of another NER model usage , ner , solved	1	405	September 20, 2020
Getting annotated output from ner.match usage , ner	1	409	November 8, 2019

Output Annotations in a dataframe form

Related topics