textcat vs textcat_multilabel

The problem with this is that in practice, you still only have one label (Sectoral) that either applies or doesn't. And there might be use cases where you want to combine this dataset with other binary datasets for other labels and train an exclusive or non-exclusive classifier on all the data.

The alternative would be for Prodigy to add a label NOT_SECTORAL (or OTHER), but that feels like a very invasive default behaviour because it really modifies the data. So if you only have one label you're predicting, the easier solution would be to use the textcat_multilabel component instead.

1 Like