Getting annotated output from ner.match

daniyalSelani · November 7, 2019, 4:32am

Im trying to get annotated documents vetted from experts using ner.match. I want to get the output of the ner.match session as a parsable document. Is there a way to make a custom recipe. or edit the ner.match recipe to get an output in a .json file in the following format:
[{'text': text1, 'annotations': {'annotation term': term, 'span': (n1, n2)}, 'positive': True/False}, .....]

ines · November 8, 2019, 10:06am

Prodigy's db-out command (or Python database API – see the PRODIGY_README.html for details) lets you download the annotations as a JSONL file (or list of dicts). You can then convert that to any format you need using a custom script etc.

Each example will have an "_input_hash" propery, which makes it easy to find different annotations on the same text. So you can combine all examples with the same input hash. Examples also include a list of "spans" (the highlighted entities) and an "answer" (whether you accepted or rejected the suggestion). So creating the format you need should be pretty straightforward in a few lines of code

Topic		Replies	Views
Create a jsonl pre-populated with annoatations from .txt file usage , ner	4	1066	March 1, 2021
Using a handmade annotation file for model training ner , best-practices	3	1626	June 22, 2018
Creating a custom recipe to integrate bespoke model usage , ner , custom , solved	3	713	November 12, 2019
Prodigy present text with no matching pattern (ner.manual) usage , ner , solved	5	458	April 12, 2020
Edit saved annotations ner , solved	4	1372	March 2, 2018

Getting annotated output from ner.match

Related topics