Hi there,
we have a very large number of labels that we'll use for annotations.
However, I cannot find a search functionality in the documentation.
Can you please advise?
Many thanks
Hi there,
we have a very large number of labels that we'll use for annotations.
However, I cannot find a search functionality in the documentation.
Can you please advise?
Many thanks
Hi! I think I may have already answered you via email (assuming you/your colleague emailed us earlier about span categorization and search functionality for thousands of labels). Copying over my reply, also in case it's useful for others
You can change the label view to a dropdown instead of a list of labels (using the label_style
config setting) – however, if your goal is to train a span categorizer and you ended up with this many labels, we'd usually recommend rethinking your label scheme and breaking down the task differently so you're able to annotate and train more efficiently.
With this many labels, the annotators will otherwise have to think about 6k options on every example and search for the correct one, which can get pretty slow and lead to mistakes and inconsistent data. In order to predict a large number of labels, you also need sufficient examples of every label in context (which are naturally unevenly distributed). So you're potentially making your life a lot harder than it should be.
If your labels are hierarchical, one common solution is to start by annotating the top-level categories (which is usually the most important distinction) and then have separate processes for selecting the more fine-grained labels. This also lets you train and iterate a lot quicker. Also see this thread for some background (it's about NER but the considerations for SpanCat are pretty similar):