I’m struggling to understand how to fit prodigy in the picture here, let’s say I want to use it as an annotation tool, but for pre-annotated text.
In the documentation I saw it’s possible to 1) annotate your text and 2) import your annotations, however I’m missing some blocks here.
Let’s take an example, I have the following text:
{'text':'This is a text talking about the strange behaviour of superconductor material Br2A2, which has Tc (Critical Temperature) = 23K'}
And the following annotations
{'text': 'Br2A2', 'label': 'supercon', 'spans': [{'start': 123, 'end': 456, 'label': 'supercon'}], 'meta': {'source': 'Semi-automatic generation supercon'}}
{'text': 'Tc (Critical Temperature)', 'label': 'abbreviation', 'spans': [{'start': 1000, 'end': 1020, 'label': 'abbreviation'}], 'meta': {'source': 'Semi-automatic generation abbreviation'}}
(FYI the offsets are just random)
Let’s suppose we have multiple set of text with the corresponding annotations.
How can this be loaded in prodigy and an user can correct that?
Here my questions:
- The
mark
recipt cannot work because I need also to load annotations. - The part where how to load annotations is explained, is not clear, because it is assumed that there is a model in spacy that we can use (https://prodi.gy/docs/workflow-first-steps#load-data), this is not my case
- at the same page: https://prodi.gy/docs/workflow-first-steps#import-annotations if I have multiple sets of annotations (for example coming from different files) how can I relate each text with their respective batch of annotations? Should I use the ‘meta’ / ‘source’? if so, how? with a recipe?
- I’d like to create a recipe and add it to the ‘registered’ ones but again, this doesn’t seems to be the way it has been designed, so I can use a recipe only for annotation, correct?
Thank you very much
Luca