Again thank you for taking the time to answer.
Yes, we are using default SQLite. We tried with no luck to make it work with an external db (Redshift). And maintaining a different external DB for this didn't seem like a good idea.
The other reason we are doing this is that we need is observability. That is why we need to keep in sync the progress made by different sessions with our external DB. We have dashboards to know what's been annotated, by whom and how.
I understand that saving annotations in the update method could be causing other issues for us (repeated examples across sessions even with feed_overlap = False).
Would you say that using some kind of schedule python library and the programmatic way of accessing the datasets could be the way to go?
Thank you.