Disable automatic selection of full word when using ner.correct recipe

magdaaniol · February 23, 2024, 9:32am

If you skip these words from processing, you won't be able to extract any information from it. If you believe that the information you're after appears correctly tokenized in other parts of text and you would be getting enough training examples despite ignoring mistokenized words then it should OK to ignore them.

This thread is in-depth discussion of such "agglutinations" - it might be of interest to you as well.

Topic		Replies	Views
How to make more specific selection? usage , ner	1	250	January 18, 2023
NER with commas in the word through ner.correct	1	381	September 12, 2022
Multiple issues with character based annotation bug , ner , front-end	3	733	July 22, 2021
ws vs disabled usage , ner , front-end	5	647	November 9, 2020
ner.manual gives ValueError: Mismatched tokenization. usage , ner , solved	9	1415	August 1, 2019

Disable automatic selection of full word when using ner.correct recipe

Related topics