multilabel image classification on a bounding box

ines · February 8, 2021, 2:40am

Hi! A similar(ish) question came up in this thread the other day, and for cases like this, we'd typically recommend making two passes over the annotations for the two objectives: getting the bounding box right and categorising the content labelled by the bounding box:

In terms of implementation, the logic in your recipe could look something like this:

options = [
    {"id": "STAND", "text": "🧍 standing"}, 
    {"id": "POINT", "text": "👉 pointing"}
]

def get_box_classification_stream(stream):
    for eg in stream:
        for span in eg.get("spans", []):  # the bounding boxes
            eg = copy.deepcopy(eg)
            eg["spans"] = [span]
            eg["options"] = options
            eg = prodigy.set_hashes(eg)
            yield eg

Another advantage of this approach is that it also makes it easy to slowly start automating parts of the process – for instance, recognising the actual person (i.e. drawing the box) might be something your model is able to do quite accurately pretty quickly, so you can mix in suggestions from the model and only focus on the classification of each box

Topic		Replies	Views
Selecting multiple labels in image.manual usage , image	4	829	June 3, 2021
Draw a shape and label a person and their behavior in a video frame usage , image , front-end	4	547	January 18, 2021
Multilabel Classification for Imaging Dataset	8	323	December 12, 2022
Classify images with binary-option working through multiple labels usage , image , solved	6	731	October 9, 2021
Image Classification Example? usage , image	4	1240	September 9, 2019

multilabel image classification on a bounding box

Related topics