textcat vs textcat_multilabel

ines · September 28, 2021, 8:10am

The problem with this is that in practice, you still only have one label (Sectoral) that either applies or doesn't. And there might be use cases where you want to combine this dataset with other binary datasets for other labels and train an exclusive or non-exclusive classifier on all the data.

The alternative would be for Prodigy to add a label NOT_SECTORAL (or OTHER), but that feels like a very invasive default behaviour because it really modifies the data. So if you only have one label you're predicting, the easier solution would be to use the textcat_multilabel component instead.

Topic		Replies	Views
text classification: binary v. mutually exclusive labels usage , textcat , solved	1	704	March 1, 2022
textcat training with only one label textcat	1	155	January 17, 2024
Multilabel textcat dependecy between labels usage , textcat	1	445	January 31, 2022
Train a textcat model after it has been 'prodigy.teach'ed with 3 labels usage , textcat	5	574	November 16, 2020
textcat.manual seems to be exclusive by default usage , textcat , solved	2	507	March 26, 2020

textcat vs textcat_multilabel

Related topics