Disable annotations for part of the text

ameliatqy · December 18, 2018, 4:42pm

Hello there,

I just want to ask if there is a way to disable part of the text for NER annotation in the Prodigy interface. For example, I want to annotate an article. I need to display the title (as the information in it is important for annotation) but I don’t want to annotate it. Is there a way to do this?

Thank you and appreciate your help!

ines · December 18, 2018, 7:23pm

You mean in the manual interface, right?

The manual interface supports marking individual tokens as "disabled": true, which will display them greyed out and will make them unselectable. Spans across disabled tokens will also be considered invalid.

So your input data could look like this:

{
    "text": "Hello Apple",
    "tokens": [
        {"text": "Hello", "start": 0, "end": 5, "id": 0, "disabled": true},
        {"text": "Apple", "start": 6, "end": 11, "id": 1}
    ]
}

To tokens to your data, you can use Prodigy's add_tokens preprocessor (which is also used in the ner.manual recipe):

from prodigy.components.preprocess import add_tokens

nlp = spacy.load(your_model)
stream = add_tokens(nlp, your_data)

ameliatqy · December 19, 2018, 5:43pm

Thanks Ines for the quick reply and your helpful response! Your solution will work for me - I’ll let you know if I have any more questions.

Topic		Replies	Views
Fully manual NER annotations without tokeniser enhancement , ner , done	3	996	June 17, 2020
ner.train on data not annotated by Spacy? ner	3	1148	June 11, 2018
NER manual on view id HTML usage , ner , custom	1	871	May 16, 2019
NER and POS Tagging Annotation using One Prodigy User Interface	2	17	January 31, 2025
Disable active-learning component ner_manual usage , ner , custom , solved	2	764	November 26, 2019

Disable annotations for part of the text

Related topics