I'm using prodigy 1.9.9. I don't want to see the same text to be selected by pattern and model in textcat.teach so I modified prodigy.json and added "exclude_by": "input". That didn't work. I still see the same text be showed twice, one with the pattern number, one with the model score.
I exported the data and I could see that the input hash was the same for these 2 records. What am I missing? One more detail. In order to test this easily. I only annotated 5 records. The batch_size was set to 10. I wonder whether the size of the data set caused the problem.