Skip Functionality

cheyanneb · September 26, 2022, 5:13pm

I was wondering if there was a way to build a skip option in addition to, or in place of, ignore. ignore, afaik, doesn't present skipped documents to the annotator again, and instead allows me (the admin) to gather the ignored documents and create a new task.

Use case: we have annotators who have questions about tricky cases, and since our annotators often work off-hours, there may a delay in a response back to them. It would be great if they could skip those and be presented with them again at the end.

Thanks!
Cheyanne

koaning · September 27, 2022, 11:47am

If examples are tricky but deserve another look, wouldn't it be best to allow users to flag examples? Flagged examples can be queried with the db-out command for re-use by setting the --flagged-only flag.

There's a Prodigy Short that explains how to set this up too.

cheyanneb · September 27, 2022, 12:44pm

We already flag them and discuss them, but I wanted annotators to be able to revisit them later because sometimes there is a clear answer by the end of the task.

koaning · September 28, 2022, 12:42pm

Prodigy doesn't allow too much interaction with the database, as explained here, because it easily gets messy. If users are able to make changes to annotations, you probably also need a way to track who made what change and when.

So instead, here's how I've dealt with this in the past. I make two datasets, say ner_v1 and ner_v2. When I start annotating, everything goes into ner_v1. I'm fully aware that this v1 data will be a first draft. Many annotations are correct, but some might need to change later after understanding the problem better.

Then, once there are a few flagged examples, or when some bad labels have been detected, I re-label the relevant candidates and move these annotations to ner_v2.

Then, when it's time to make a model, I have a custom script that gets the examples from ner_v1 and ner_v2. If an example appears in both sets, I always prefer the annotation from ner_v2. This gives me a final dataset that can be used to train a model.

Other people might have another way to handle their data, but for my projects, this approach has worked quite well.

Topic		Replies	Views
Skip annotation and annotate later usage , ner	2	417	May 23, 2022
Undesirable "ignore" examples build up with low quality input streams enhancement	5	1762	September 27, 2022
Reviewing Ignored Cases enhancement , usage , textcat , done , review	14	1260	July 28, 2023
Review recipe: Ignore for now, but go over later. usage , ner , solved , review	2	442	January 21, 2023
Multi-user sessions and excluding annotations by session enhancement , usage , streams	7	1679	December 25, 2019

Skip Functionality

Related topics