Annotate multimodal dataset

Hi team,
Does prodigy tool support annotation of multimodal dataset?
I have a dataset where each data point consists of image and text. Each data point can be assigned multiple labels. Looking forward to hearing from you. Thanks

Hi @shuklaham and welcome to the Prodigy forum :slight_smile:

We don't have a built-in recipe for multimodal inputs, but you can easily put together a custom UI by combining image, text and choice interfaces via blocks.
If you need a custom layout/style, you can also inject custom html and css, but for a simple combination of images and text blocks should be enough.

Here you can find the (simplified) code for the textcat recipes: https://github.com/explosion/prodigy-recipes/tree/master/textcat which can be helpful while working on your custom version of it.
Let us know if you need any help on the way :slight_smile: