I am creating some training data. In the real world sometimes the input strings are lower case, sometimes capitals and sometimes a mixture. The case doesnt matter to me, we just need the entities. Whats the best approach when training. Should I make a copy of the input strings in lower case and upper case?
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Keep case in annotation UI, but model case-insensitive | 1 | 423 | December 24, 2019 | |
| Testing ner.batch-train model:case-sensitive issue | 5 | 443 | October 22, 2019 | |
| Can't use upper-case label in patterns for ner.teach | 17 | 1590 | August 1, 2018 | |
| entity linking -- how to search candidates from knowledge base in case insensitive | 1 | 740 | February 12, 2021 | |
| Placing Data in One Dataset | 6 | 1748 | November 6, 2018 |