Multiple s3 bucket to stream in a custom recipe

ryanwesslen · November 20, 2023, 3:10pm

Could you take this Prodigy s3 recipe and generalize it to handle multiple buckets, for something like this:

import boto3
import prodigy
import json
from prodigy.util import img_to_b64_uri

@prodigy.recipe("stream-from-s3")
def stream_from_s3(buckets, prefix=None):
    for bucket in buckets:
        # Get all loaded images.
        s3 = boto3.client('s3')

        # Build a paginator for when there are a lot of objects.
        paginator = s3.get_paginator('list_objects')
        paginate_params = {
            'Bucket': bucket
        }

        # Check if only certain images from S3 should be loaded.
        if prefix is not None:
            paginate_params['Prefix'] = prefix

        page_iterator = paginator.paginate(**paginate_params)

        # Iterate through the pages.
        for page in page_iterator:
            # Iterate through items on the page.
            for obj in page['Contents']:
                img_key = obj['Key']

                # Read the image.
                img = s3.get_object(Bucket=bucket, Key=img_key).get('Body').read()

                # Provide response that Prodigy expects.
                print(json.dumps({'image': img_to_b64_uri(img, 'image/jpg')}))

# Example usage
buckets_to_process = ['your_bucket_1', 'your_bucket_2']
stream_from_s3(buckets_to_process, prefix='optional_prefix')

I haven't tried it out but could you see if it works?

Topic		Replies	Views
Using Boto3 to stream photos from an s3 bucket usage , image , aws , streams	11	3595	March 4, 2021
Getting "No tasks available" message, all images are not getting loaded to prodigy! usage , image , streams	2	485	November 26, 2021
Duplicates in AudioVideo tasks usage , streams	2	387	November 4, 2021
No Task Available Error and S3 loader for custom recipe usage , image , streams , video	7	783	December 13, 2020
Examples from stream are shown twice usage , custom , streams	13	650	October 26, 2021

Multiple s3 bucket to stream in a custom recipe

Related topics