I'm planning to run a binary textcat.teach
on a corpus of raw texts, and I'm a bit confused by the process.
$ prodigy dataset social-texts
✨ Successfully added 'social-texts' to database SQLite.
$ prodigy db-in social-texts ./data/social_text_data_1.jsonl
✨ Imported 10000 annotations for 'social-texts' to database SQLite
Added 'accept' answer to 10000 annotations
Session ID: 2019-11-21_13-14-54
These are raw texts that don't have any annotations yet, where one line of social_text_data_1.jsonl
is like:
{"text": "i can't believe the service on American Airlines! It's so terrible @aa #badflights"}
I'm confused as to this message upon loading the dataset: Added 'accept' answer to 10000 annotations
Is there a different way to load a corpus of raw texts for annotation that doesn't assume the examples are all 'Accept'?