For reference, I think this might have already been solved here:
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Add tokenization rule | 4 | 749 | May 15, 2020 | |
| Custom Tokenizer help | 1 | 326 | December 23, 2022 | |
| Off-track use of Prodigy/Spacy - Custom Regex Pattern Matching and Modeling | 35 | 7656 | February 4, 2019 | |
| Match patterns without creating huge files | 5 | 1111 | March 21, 2019 | |
|
character-based Matching
|
3 | 661 | July 28, 2020 |