For reference, I think this might have already been solved here:
Related topics
| Topic | Replies | Views | Activity | |
|---|---|---|---|---|
| Add tokenization rule | 4 | 750 | May 15, 2020 | |
| Custom Tokenizer help | 1 | 330 | December 23, 2022 | |
| Off-track use of Prodigy/Spacy - Custom Regex Pattern Matching and Modeling | 35 | 7695 | February 4, 2019 | |
| Match patterns without creating huge files | 5 | 1113 | March 21, 2019 | |
|
character-based Matching
|
3 | 678 | July 28, 2020 |