ner.correct examples repeat

latkins · September 8, 2021, 12:53pm

I am running the ner.correct recipe, but find that the same ~20 examples just cycle (and when they re-appear, they don't have the corrected labels).

I can confirm that this occurs with multiple models (both trained locally and not). My input jsonl does not have these repeats, and each line contains only a text field. The ~/.prodigy/prodigy.json is empty. I am on 1.11.3.

Onyoursix · September 8, 2021, 8:16pm

I'm also getting this same issue on 1.11.3 but I'm using ner.manual

kab · September 9, 2021, 5:57am

Thank you both for the reports. Looks like we pushed a bug along with a fix in version 1.11.3. This will be fixed in the next release (1.11.4) but in the meantime you can add a ?session={session_id} query parameter.

So if you're running Prodigy locally on port 8080 (which is default) then instead of doing your annotations at: http://localhost:8080, do your annotations at http://localhost:8080?session={session_id} (e.g. I use kab as my session id like: http://localhost:8080?session=kab.

We recommend using a session parameter whenever possible anyway but we will definitely get the fix out for allowing no session to be specified.

ines · September 13, 2021, 4:13am

Just released v1.11.4, which should fix the underlying issue!

chuanw24 · December 30, 2021, 6:05pm

I think the issue is still there for v1.11.6. I'm using a custom recipe based on ner.manual (simply added one additional block). 10% of the examples are shown repeatedly.

ines · December 30, 2021, 6:19pm

@chuanw24 Does it make a difference if you save the results to a new dataset? This particular issue is incredibly mysterious because we've tried all kinds of things and have not been able to reproduce this, which makes it incredibly tricky

Topic		Replies	Views
Duplicated examples over sessions in NER manual ner	7	594	May 19, 2022
multi -session annotation database , streams	5	653	April 9, 2020
Duplicated examples in NER.teach & large jsonl files usage , ner , done	5	1437	September 10, 2018
Example repeated/duplicated within and across sessions usage , textcat , multi-user	5	475	December 20, 2022
Repeated sentences and incorrect annotator id relations	1	76	May 20, 2024

ner.correct examples repeat

Related topics