Output Annotations in a dataframe form

Is it possible that the annotated documents through Prodigy are returned in this format?


Sure – Prodigy gives you all the underlying information in JSON, and you can then set up your output however you like.

You can find an example of the format produced by the manual NER interface here: https://prodi.gy/docs/api-interfaces#ner_manual It includes the tokens and "spans" describing the annotated entities. You can then convert the entity offsets to IOB or BILUO tags and add one row per token to your dataframe.

See this section for how to convert chacter offsets to IOB or BILUO: https://prodi.gy/docs/named-entity-recognition#tip-offsets-biluo

Thank you Ines