Segmenting examples with long spans as NERs

honnibal · June 27, 2018, 11:06pm

As I mentioned in the last thread, I'm suspicious of using the entity recognised for these long spans. I think you should try applying sentence labels, and perhaps also marking words which are important for the category you're interested in. Then you can use the dependency parse to find the claim boundaries. You can find documentation about the dependency parser here: Linguistic Features · spaCy Usage Documentation

There should be as many spans, whether you set unsegmented or not --- unless the spans cross segmentation boundaries. This sounds like it might be a bug; we'll look into it. At first glance the segmentation function looks correct, and it's passing our tests. But I'll play around with your sample and see if I can find the problem.

Topic		Replies	Views
Sentence / long spans classification tasks with context	2	251	March 15, 2024
Strange text segmentation with ner.teach recipe usage	7	596	September 9, 2019
consolidating unsegmented and segmented annotations usage , ner	2	662	February 14, 2022
80 Entities ner.manual usage , ner , solved	7	789	August 15, 2021
Questions about ner.teach and ner.correct usage , ner	10	371	January 11, 2024

Segmenting examples with long spans as NERs

Related topics