Hi, I ran into an issue when using the
After data-to-spacy, when I run
spacy debug data I get the warning:
⚠ 411 training examples also in evaluation data.
Can this be caused by duplicate
_input_hash in the Prodigy database?
I've already made sure not to have duplicate
_task_hash in the database.
And could data-to-spacy be made to check for this and deduplicate examples before export?
I'm using Prodigy 1.11.6 and spaCy 3.2.0.