hi @rolisz!
Thanks for the heads up. This is interesting. It should happen by default since v1.11 (August 2021).
Can you show an example of your data?
from prodigy.components.db import connect
db = connect()
all_dataset_names = db.datasets
examples = db.get_dataset("dataset_name")
examples[0]
You can mask/change any fields. I'm more interested in seeing the _session_id
or other fields.
Have you noticed any other patterns with this problem? For example, you mentioned textcat
and spancat
-- does it work sometimes but not others?
The fact that it's happening is indicative of a bug that our testing is missing. Worst case, I'll log this and queue it up. We greatly appreciate you bringing this to our attention!