@Sarah Hi! There's not really an easy solution, because it's more of a conceptual problem and there's no easy answer for how to treat HTML markup. I've also posted about this in more detail on this thread, and why it's difficult to render HTML if you're highlighting manually and planning on training a statistical model using the resulting data:
Related topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
NER manual on view id HTML | 1 | 867 | May 16, 2019 | |
Re-use UI elements | 8 | 959 | February 18, 2019 | |
About html ner extraction | 3 | 509 | June 15, 2021 | |
In "textcat" recipes, is it possible to format the to-be-annotated texts? | 7 | 626 | October 7, 2019 | |
Fully manual NER annotations without tokeniser
|
3 | 995 | June 17, 2020 |