Path to sense disambiguation for a new NER model

daqieq · October 15, 2021, 7:29pm

My NER model will have spans that apply to multiple NER labels. For example, 'grill' could be labelled as a 'cooking product' or as a 'car part'. This is similar to the "Michael Jordan" or "Washington" problem.

Is there a document that discusses moving towards sense disambiguation from a standard NER model? Is Entity Linking the right move for this situation?

SofieVL · October 19, 2021, 3:49pm

Hi! It sounds like perhaps your NER challenge isn't really a typical "entity recognition" challenge. It might be that the NER approach is still working for you, but just for reference I also want to point you to the new spancat implementation in spaCy, and the Prodigy docs here: https://prodi.gy/docs/span-categorization#ner-vs-spancat

Whether Entity Linking is appropriate, kind of depends on that as well. Entity Linking typically means resolving an ambiguous mention to a unique identifier, and that identifier typically refers to a real-world object/person. There is not a single unique "car part" in the world, so WSD works slightly differently. Though I do think that you could use similar algorithms for EL as for WSD.

Here's a different thought: I don't know how well that fits your data/use-case, but if you have a limited number of senses/domains, you could also work with a textcat component? If you've found the mention "grill" in a sentence that is classified as being in the domain "car" rather than "cooking", then your grill will be further defined by that class/domain.

Topic		Replies	Views
Annotation interface to do both SpanCat and NER ner , spancat	2	563	August 31, 2022
Overlapping Entities ner , solved	2	866	August 20, 2023
spans.correct recipe enhancement , done , spancat	4	919	August 17, 2021
Multi-label NER usage , ner	1	1626	April 25, 2021
Connecting named entities ner , spacy	1	413	October 29, 2020

Path to sense disambiguation for a new NER model

Related topics