@Sarah Hi! There's not really an easy solution, because it's more of a conceptual problem and there's no easy answer for how to treat HTML markup. I've also posted about this in more detail on this thread, and why it's difficult to render HTML if you're highlighting manually and planning on training a statistical model using the resulting data:
Related Topics
Topic | Replies | Views | Activity | |
---|---|---|---|---|
About html ner extraction | 3 | 465 | June 15, 2021 | |
Fully manual NER annotations without tokeniser
|
3 | 928 | June 17, 2020 | |
Bad formatting in gui for manual tagging | 7 | 849 | March 22, 2019 | |
How to train custom NER with preannotations? | 2 | 312 | July 28, 2021 | |
Using ner.manual on HTML Input | 3 | 2679 | October 12, 2018 |