ayaniwa
(Ayana Niwa)
March 12, 2020, 11:10am
1
Hi there,
Is there any way to modify already annotated texts after closing the sessions on Prodigy?
The present steps are:
(1)annotate some text as below:
(2)close the session
(3)after reopening the session, total number remains 5, but there is no history and I cannot go back to previous text.
I would like to know how to go back to previous text like (1) without db-out
exporting.
I read this QA but I need to edit previously saved annotations with simple and quick GUI operations.
Thanks!
ines
(Ines Montani)
March 12, 2020, 2:09pm
2
Hi! See my answer on this thread for details:
Datasets in Prodigy are append-only by design: you typically don't want to overwrite existing records, because that means you'd lose a datapoint you've collected. And it'd also make it too easy to erase work. Instead, you can re-annotate and correct the data, and save the results to a new dataset. If you make a mistake, you still have the previous data and can start again.
Prodigy's input and output formats are the same – so you can always export a dataset and load the data back in. For example, if you load a manually-annotated NER dataset back into ner.manual
, the entities will be pre-highlighted and you can correct them.
If it's possible to automate some of the changes, that's great, too – for instance, if you removed label X
from your label scheme, you can iterate over the "spans"
and remove all entries that contain "label": "X"
before you send them out for correction again.
If you have conflicting annotations that you want to resolve to one final "master corpus", you can also use the review
recipe. It takes one or more datasets with one or more sessions and will group annotations on the same input together. So if annotator A has labelled a span and annotator B hasn't, you can see both and decide what the correct answer is (or even label something entirely different by hand).
1 Like