NER tag capitalization question

hi @BenHolmes,

Thanks for your question and welcome to the Prodigy community :wave:

By "tags", I assume you mean label names.

Do you have the exact commands you ran during annotation?

For future purposes, please be sure to provide full Prodigy commands (and ideally Prodigy version too) as this can help debug.

Yes, but it sounds like you did use lowercase when running annotation as your labels are lower case in your data (i.e., ran prodigy ner.manual ... you used lowercase. How did you do training? Can you provide the full command (e.g., prodigy train vs spacy train)?

I'm suspecting you're running into this issue:

Probably your best bet now is to output your data, write a Python script to change the labels from lower case to upper case in your .jsonl (e.g., try ChatGPT, it's really great if you give it an example input and an example output for generic Python scripting), then reload your data with db-in. Sorry for the hassle, but as mentioned in that previous post, it's best to always annotation with capital letters for --label.

Hope this helps!