Segmentation and newlines in ner.manual

ines · May 22, 2018, 9:13am

Quick update: I tested the "↵ plus line break" solution and it's been working well – so we will be able to ship this update with the next release

I've also been experimenting with solutions for use cases like this one and how to allow adding more visual clues to the manual interface:

In the upcoming version, you'll be able to mark individual "tokens" in the input data as "disabled": true. This will render them in grey and will prevent the user from selecting those tokens (or any text spanning across them). The disabled tokens can be used for whitespace characters, list bullets and other tokens purely intended for formatting, and they can also help the annotator identify what's important quicker. You can also use them to prevent highlighting mistakes (e.g. by disabling all newline tokens to not allow entities spanning over two paragraphs). The "disabled" property can also make it easier later on to separate annotator-only markup from the annotated text.

Topic		Replies	Views
✨ Demo: fully manual NER annotation interface enhancement , ner , front-end	48	6736	February 2, 2018
NER document Labeling ner , solved	25	3690	August 1, 2019
Re-use UI elements usage , front-end	8	969	February 18, 2019
ner.manual gives ValueError: Mismatched tokenization. usage , ner , solved	9	1417	August 1, 2019
HTML to jsonl and NER task workflow usage , ner , solved	6	851	July 19, 2019

Segmentation and newlines in ner.manual

Related topics