hi @axsmarine!
Thanks for your message and welcome to the Prodigy community
Have you seen my post on post on duplicate or missing data?
One thing I suspect it could be is work stealing. I noticed you have that in your config ("allow_work_stealing": false,
) but that wasn't implemented until v1.12.0. Since you're using v1.11.11, can you upgrade to v1.12.7
and retry?
But be aware - work stealing may sound like something you don't want, it is a preventive mechanism to avoid the loss of records in a stream, which can be much worse than duplicates!
The docs explain this well:
Without work stealing you might be able to guarantee annotations occur at most once while accepting losing a few examples in the process. By enabling work stealing you ensure all examples will be annotated in the data stream at least once .
Unfortunately, the best answer is better training for your annotators to save their annotations when they're done and close out browser windows when they're done.
One other point -- won't contribute to your issue -- is that "force_stream_order": true
won't do anything as it was deprecated in v1.11.0.
Hope this helps!