After doing annotations with
textcat.teach, I wanted to output the data so I could train my model.
When doing a
db-out, I get 247 positive matches for my label, but when doing
data-to-spacy I only get 60. It's the same dataset in both cases so I don't quite understand why the results differ here.
For another label there is 53 matches with
db-out and 39 with
Exemplary commands I use are:
prodigy db-out nar_5_proration ./nar_5_pro.jsonl
prodigy data_to_spacy spacy_data_nar5 --lang "en" --textcat nar_5_proration
Am I missing something?
Thanks in advance for your help!