Issues with custom matchers for NER

ljvmiranda921 · October 29, 2021, 5:10am

So what happens here is that the detected tokens are actually ["50", "%"], and a pattern that describes one token with a regex wouldn't match. I'd recommend that you write a pattern that covers both.

It might also be useful to implement your own matching logic using regex over the whole text (Create new entities from regex - #3 by ines) . Just ensure that you won't have any overlapping spans

The advantage of the latter approach is that you have total control of the implementation logic, and write some heuristics where, for example, you only take the longest span if there are any overlaps.

Topic		Replies	Views
ner.match error with exact string patterns enhancement , usage , ner , done	8	762	June 12, 2018
Error while using ner.match for pattern matching usage , ner , solved	8	897	October 13, 2018
ner.match ner , spacy , solved	17	703	January 7, 2020
NER or PhraseMatcher? ner , spacy , best-practices	17	6093	September 20, 2018
Train a new NER entity with multi-word tokens usage , ner , solved	15	9674	September 10, 2019

Issues with custom matchers for NER

Related topics