This is totally fine, you don't need to add such a settings file. I'm mainly asking because it's relevant to being able to reproduce your situation on my machine.
I fear that I do not see the same error as you do. I've taken your file, then I passed this command:
python -m prodigy textcat.teach example blank:en ./samplefile.jsonl --label POSITIVE,NEGATIVE
Using 2 label(s): POSITIVE, NEGATIVE
✨ Starting the web server at http://localhost:8080 ...
Open the app in your browser and start annotating!
Even when I annotate a few examples I do not see the same warnings that you see. Just to confirm: the subset that you gave me. Do you see the warnings appear on that small samplefile.jsonl subset?
Detail
One interesting thing is that it does seem to skip the first example. The one containing the 0xFEFF character. But I don't think this is currently related to your warnings.
Might be good to check
Just to check. When you annotate some examples, are you able to confirm that the annotations are saved as expected? When you run this command:
python -m prodigy db-out <name-of-dataset>
Do you see the annotations that you would expect? No examples missing?
Even when I annotate a few examples I do not see the same warnings that you see. Just to confirm: the subset that you gave me. Do you see the warnings appear on that small samplefile.jsonl subset?
Not actually, I didn’t try this sample data file.
Just to check. When you annotate some examples, are you able to confirm that the annotations are saved as expected? When you run this command:
Yes, I saved the annotations and successfully executed the db-out command
Do you see the annotations that you would expect? No examples missing?
Annotated file is there, same as I annotated but Prodigy combined it with a previous file, I don’t know how it happen. Just trying to figure it out and will let you know, if unsuccessful.
Thanks a lot for the timely support on every step.