Taking a Computer Vision Approach (leveraging image.manual) to build a custom NER model on PDFs

Hi Jetson, here are the answers to your questions:

  1. Yes, correct. Prodigy doesn't natively support PDFs. However, you can choose to write your own custom recipe that is able to use them. That way, you could consider using a Python package that can parse .pdf files. You might be able to consider this if your pdfs follow a very strict structure, but the image OCR path seems like a more common approach.
  2. An annotated image will have bounding boxes with the data format described here.
  3. You may appreciate this answer if you want to auto-tag images.
1 Like