spancat with really large spans? (Identify sections in text)

koaning · November 11, 2022, 3:19pm

I found a dialogue on the forum that might be inspirational here.

It's a different problem, but it highlights another two-step approach to rethinking spans.

That said, reading your reply still makes me think that textcat might be the simplest way forward, albeit on paragraphs instead of sentences. While I like your idea of using NER to detect the start of a section, I wonder if you might be able to leverage that this always starts on a newline, which suggests a heuristic might be better than a ML model.

Topic		Replies	Views
Text length for spancat model usage , spancat	8	364	April 11, 2023
Extracting useful information from Job description ner , textcat , spancat	1	1555	January 24, 2023
Sentence / long spans classification tasks with context	2	285	March 15, 2024
span segmentation containing large labels and sublabels spancat	2	360	March 29, 2023
Spancat : surrounding text used as context?	3	362	June 23, 2022

spancat with really large spans? (Identify sections in text)

Related topics