Relations between image objects

akrogager · October 13, 2022, 8:53am

Hello, I'm looking for an annotation tool for annotating images of pdf pages with text blocks and reading order. I can easily annotate the text blocks as image objects and categorise them but I don't see any way of annotating their reading order. Ideally, I would like a feature where I could draw an arrow from one object to another. Does Prodigy support anything like this?

koaning · October 13, 2022, 12:27pm

I think the order in which you annotate the examples is also the order in which the annotations are saved. I double-checked to make sure.

Demo

I have a folder with images of my cats locally, and I started an annotation interface via:

python -m prodigy image.manual demo-issue-6023 images --label noa,sok --remove-base64

This is what the annotation interface looks like.

I will now first annotate the white cat, Noa.

Next, I will annotate the black one, Sok.

When I now hit "save" I'm able to retrieve the annotations from the database. I'll use this command from the command line:

python -m prodigy db-out demo-issue-6023 | jq -c ".spans"

This command grabs the "spans" key from the annotated example. That's the part of the JSON that has the image annotation. Here's what it contains:

[
    {"id":"2a22b798-607d-4af5-9366-99075835e816","label":"noa","color":"yellow","x":6.5,"y":9.3,"height":264,"width":211,"center":[112,141.3],"type":"rect","points":[[6.5,9.3],[6.5,273.3],[217.5,273.3],[217.5,9.3]]},
    {"id":"b52b737f-6d11-49eb-a906-857d2b5e4eff","label":"sok","color":"cyan","x":181.5,"y":28.3,"height":243,"width":290,"center":[326.5,149.8],"type":"rect","points":[[181.5,28.3],[181.5,271.3],[471.5,271.3],[471.5,28.3]]}
]

As you can see, the annotations with the label "noa" always appear first.

Final Thoughts

So you might be able to get what you want if you annnotate in the correct order. That said, if you're still unsure and you're acting on behalf of a company, you can also send an email to contact@explosion.ai for a trial. To quote the FAQ page:

Prodigy runs entirely on your own hardware and never phones home or connects to our servers. So we typically do trials by hosting a VM that you can log in to. This gives you the full experience of the tool, including the scriptable back-end, and also makes it easy for us to log in and help if you get stuck. If you’re interested, get in touch! Please note that we’re only able to offer VM trials to companies and organizations, not individuals.

akrogager · October 13, 2022, 12:56pm

Thanks for the reply, Vincent! We are using model assisted labelling where we use our current ML model to pre-annotate the pages. Since there is no way to verify in which order the text blocks were pre-annotated (in the labelling UI), I think your approach will not work in our case, unfortunately.

Topic		Replies	Views
Grouping images for PDF file annotation usage , image , front-end	4	1100	January 22, 2020
Image Classification - annotating labels usage , image , solved	10	2166	April 17, 2019
Can you annotate Videos? usage , image , custom , pytorch	13	2339	June 10, 2019
How to create annotations using prodigy for image BB or segmentation / image	1	1224	May 29, 2018
✨ Demo: fully manual image annotation interface enhancement , done , image , front-end	11	3740	August 31, 2019

Relations between image objects

Demo

Final Thoughts

Related topics