AWS Serverless Media Processing Pipeline - Part 1: Infrastructure Foundation & Worker Lambda

Build a production-ready serverless media processing pipeline with AWS Lambda, S3, DynamoDB, and SQS. Part 1 covers infrastructure setup and image processing worker.

AWS serverless media-processing

October 19, 2025

Share This Post

Twitter LinkedIn Copy Link

AWS Serverless Media Processing Pipeline - Part 1: Infrastructure Foundation & Worker Lambda

Overview

In this comprehensive guide, we’ll build a production-ready serverless media processing pipeline that automatically watermarks images uploaded to S3. This is Part 1 of a 4-part series covering the complete infrastructure setup and core processing engine.

What we’ll build:

Event-driven architecture using S3, SQS, and Lambda
Automatic image watermarking with Pillow
Scalable job tracking with DynamoDB
Production-ready error handling and monitoring

Architecture Preview:

User Upload → S3 → SQS → Lambda Worker → Processed S3
     ↓
DynamoDB (Job Status)

Region: ap-south-1 (Mumbai)
Estimated Setup Time: 2-3 hours
Monthly Cost: ~$0.20 for 1000 operations

Prerequisites
Phase 1: Core Infrastructure Setup
Phase 2: Worker Lambda Development
Testing & Verification
Production Considerations
Cost Analysis
Troubleshooting

Prerequisites

Before starting, ensure you have:

AWS Account with appropriate permissions
Basic understanding of AWS services (S3, Lambda, DynamoDB, SQS)
Python knowledge (for Lambda functions)
Terminal access (for CloudShell or local development)

Required AWS Permissions:

S3: Create buckets, manage objects
Lambda: Create functions, layers, roles
DynamoDB: Create tables, manage items
SQS: Create queues, send/receive messages
IAM: Create roles and policies

Phase 1: Core Infrastructure Setup

Step 1: Create S3 Buckets

We’ll create two S3 buckets: one for original uploads and another for processed images.

1.1 Create Upload Bucket

Navigate to S3:
- Sign in to AWS Console
- Search for “S3” in the services search bar
- Click S3 to open the S3 console
Create the uploads bucket:
- Click Create bucket
- Bucket name: amodhbh-media-uploads
- AWS Region: Asia Pacific (Mumbai) ap-south-1
- Object Ownership: ACLs disabled (recommended)
- Block Public Access settings: Keep all boxes checked (block all public access)
- Bucket Versioning: Disabled (to save costs, unless you need version history)
- Tags (Optional):
  - Key: Project, Value: serverless-media
  - Key: Purpose, Value: uploads
  - Key: Environment, Value: production
- Default encryption:
  - Encryption type: Server-side encryption with Amazon S3 managed keys (SSE-S3)
- Click Create bucket
Verify creation:
- You should see amodhbh-media-uploads in your bucket list

1.2 Create Processed Bucket

Create the processed bucket:
- Click Create bucket again
- Bucket name: amodhbh-media-processed
- AWS Region: Asia Pacific (Mumbai) ap-south-1
- Object Ownership: ACLs disabled (recommended)
- Block Public Access settings: Keep all boxes checked
- Bucket Versioning: Disabled
- Tags (Optional):
  - Key: Project, Value: serverless-media
  - Key: Purpose, Value: processed
  - Key: Environment, Value: production
- Default encryption:
  - Encryption type: Server-side encryption with Amazon S3 managed keys (SSE-S3)
- Click Create bucket
Verify creation:
- You should now see both buckets: amodhbh-media-uploads and amodhbh-media-processed

Step 2: Create DynamoDB Table

We’ll create a DynamoDB table to track job status throughout the processing pipeline.

2.1 Create the Jobs Table

Navigate to DynamoDB:
- In the AWS Console, search for “DynamoDB”
- Click DynamoDB to open the DynamoDB console
Create table:
- Click Create table
- Table name: media-processing-jobs
- Partition key: jobId (Type: String)
- Sort key: Leave empty (not needed)
Table settings:
- Table class: DynamoDB Standard
- Capacity mode: Select On-demand
  - This is cost-effective for unpredictable workloads
  - You only pay per request, no upfront capacity planning needed
Encryption:
- Encryption at rest: Owned by Amazon DynamoDB (default, no additional cost)
Tags (Optional):
- Key: Project, Value: serverless-media
- Key: Environment, Value: production
Create table:
- Click Create table
- Wait 20-30 seconds for the table to become Active

2.2 Verify Table Creation

Click on the media-processing-jobs table
Go to the Explore table items tab
You should see an empty table with the jobId partition key

Step 3: Create SQS Queues

We’ll create two SQS queues: a main processing queue and a dead letter queue for failed messages.

3.1 Create Dead Letter Queue (DLQ)

We create the DLQ first, so we can reference it when creating the main queue.

Navigate to SQS:
- In the AWS Console, search for “SQS”
- Click Simple Queue Service
Create queue:
- Click Create queue
- Type: Standard
- Name: media-processing-dlq
Configuration:
- Visibility timeout: 5 minutes (300 seconds)
- Message retention period: 14 days (maximum, so you can investigate failures)
- Delivery delay: 0 seconds
- Maximum message size: 256 KB (default)
- Receive message wait time: 0 seconds (short polling is fine for DLQ)
Access policy:
- Leave as default (only queue owner can send/receive)
Encryption:
- Server-side encryption: Disabled (optional, but adds cost)
Tags (Optional):
- Key: Project, Value: serverless-media
- Key: Environment, Value: production
Create queue:
- Click Create queue
- Note the Queue ARN - you’ll need this in the next step
- Example: arn:aws:sqs:ap-south-1:123456789012:media-processing-dlq

3.2 Create Main Processing Queue

Create queue:
- From the SQS console, click Create queue
- Type: Standard
- Name: media-processing-queue
Configuration:
- Visibility timeout: 5 minutes (300 seconds)
  - This should be at least 6x your Lambda timeout
  - Our worker Lambda will have a 2-minute timeout, so 5 minutes is safe
- Message retention period: 4 days (default)
- Delivery delay: 0 seconds
- Maximum message size: 256 KB (default)
- Receive message wait time: 20 seconds (enables long polling, more efficient)
Dead-letter queue:
- Enable: Check the box
- Choose queue: Select media-processing-dlq from the dropdown
- Maximum receives: 3
  - After 3 failed processing attempts, the message moves to the DLQ
Access policy:
- Leave as default
Encryption:
- Server-side encryption: Disabled (to minimize costs for learning)
Tags (Optional):
- Key: Project, Value: serverless-media
- Key: Environment, Value: production
Create queue:
- Click Create queue

3.3 Verify Queue Creation

You should see both queues in the SQS console:
- media-processing-queue
- media-processing-dlq
Click on media-processing-queue and verify:
- Dead-letter queue is set to media-processing-dlq
- Maximum receives is 3

Phase 2: Worker Lambda Development

Step 1: Create IAM Role for Worker Lambda

1.1 Navigate to IAM

Open the AWS Console
Search for “IAM” and click IAM
In the left sidebar, click Roles
Click Create role

1.2 Configure Trust Policy

Trusted entity type: AWS service
Use case: Lambda
Click Next

1.3 Add Permissions Policies

We’ll attach AWS managed policies first, then add a custom inline policy.

Search and attach these managed policies:
- AWSLambdaBasicExecutionRole (for CloudWatch Logs)
Click Next

1.4 Name and Create Role

Role name: media-worker-lambda-role
Description: IAM role for worker lambda to process media files
Click Create role

1.5 Add Custom Inline Policy

Now we’ll add permissions for S3, SQS, and DynamoDB.

In the Roles list, search for and click media-worker-lambda-role
Click the Permissions tab
Click Add permissions → Create inline policy
Click the JSON tab
Replace the content with:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "SQSReceiveAndDelete",
      "Effect": "Allow",
      "Action": [
        "sqs:ReceiveMessage",
        "sqs:DeleteMessage",
        "sqs:GetQueueAttributes"
      ],
      "Resource": "arn:aws:sqs:ap-south-1:*:media-processing-queue"
    },
    {
      "Sid": "S3ReadUploads",
      "Effect": "Allow",
      "Action": ["s3:GetObject"],
      "Resource": "arn:aws:s3:::amodhbh-media-uploads/*"
    },
    {
      "Sid": "S3WriteProcessed",
      "Effect": "Allow",
      "Action": ["s3:PutObject"],
      "Resource": "arn:aws:s3:::amodhbh-media-processed/*"
    },
    {
      "Sid": "DynamoDBUpdateJobs",
      "Effect": "Allow",
      "Action": ["dynamodb:GetItem", "dynamodb:UpdateItem"],
      "Resource": "arn:aws:dynamodb:ap-south-1:*:table/media-processing-jobs"
    }
  ]
}

Click Next
Policy name: WorkerLambdaCustomPolicy
Click Create policy

Step 2: Prepare Lambda Deployment Package

The Worker Lambda needs the Pillow library for image processing. We’ll use a Lambda Layer approach.

2.1 Create Lambda Layer for Pillow

Option A: Use AWS-Provided Layer (Recommended - Easiest)

AWS provides pre-built layers for common Python packages. However, if not available, proceed to Option B.

Option B: Create Custom Layer (Most Reliable)

You’ll need to create this on a Linux environment or use AWS CloudShell.

Open AWS CloudShell:
- In the AWS Console (ap-south-1 region), click the CloudShell icon (terminal icon) in the top navigation bar
- Wait for the shell to initialize

Create the layer directory structure:

mkdir -p pillow-layer/python
cd pillow-layer

Install Pillow:
```
pip3 install Pillow -t python/
```
Create the zip file:
```
zip -r pillow-layer.zip python/
```
Download to your local machine:
- In CloudShell, click Actions → Download file
- Enter file path: pillow-layer/pillow-layer.zip
- Save the file to your computer

2.2 Create Lambda Layer in AWS Console

Navigate to Lambda:
- Search for “Lambda” in the AWS Console
- Click Lambda
Create layer:
- In the left sidebar, click Layers
- Click Create layer
- Name: pillow-layer
- Description: Pillow library for image processing
- Upload: Click Upload a .zip file
- Choose the pillow-layer.zip file you downloaded
- Compatible runtimes: Select Python 3.11, Python 3.12
- Click Create
Note the Layer ARN (you’ll need this when creating the Lambda function)
- Example: arn:aws:lambda:ap-south-1:123456789012:layer:pillow-layer:1

Step 3: Create Worker Lambda Function

3.1 Create the Function

Navigate to Lambda:
- In Lambda console, click Functions in the left sidebar
- Click Create function
Basic information:
- Select Author from scratch
- Function name: worker-lambda
- Runtime: Python 3.11 (or Python 3.12)
- Architecture: x86_64
Permissions:
- Expand Change default execution role
- Select Use an existing role
- Existing role: Select media-worker-lambda-role from the dropdown
Advanced settings:
- Leave defaults for now
Click Create function

3.2 Configure Function Settings

In the Configuration tab:
- Click General configuration → Edit
- Memory: 512 MB (image processing needs more memory)
- Timeout: 2 minutes (120 seconds)
- Ephemeral storage: 512 MB (default is fine)
- Click Save
Add the Pillow Layer:
- Scroll down to Layers section
- Click Add a layer
- Select Custom layers
- Choose pillow-layer and the latest version
- Click Add

3.3 Add Environment Variables

Click Configuration tab
Click Environment variables → Edit
Click Add environment variable for each:
Key Value
UPLOAD_BUCKET amodhbh-media-uploads
PROCESSED_BUCKET amodhbh-media-processed
DYNAMODB_TABLE media-processing-jobs
Click Save

Key	Value
`UPLOAD_BUCKET`	`amodhbh-media-uploads`
`PROCESSED_BUCKET`	`amodhbh-media-processed`
`DYNAMODB_TABLE`	`media-processing-jobs`

3.4 Add Function Code

Click the Code tab
In the code editor, replace the contents of lambda_function.py with:

import json
import boto3
import os
from PIL import Image, ImageDraw, ImageFont
from io import BytesIO
from datetime import datetime
import logging

# Configure logging
logger = logging.getLogger()
logger.setLevel(logging.INFO)

# Initialize AWS clients
s3_client = boto3.client('s3')
dynamodb = boto3.resource('dynamodb')
table = dynamodb.Table(os.environ['DYNAMODB_TABLE'])

UPLOAD_BUCKET = os.environ['UPLOAD_BUCKET']
PROCESSED_BUCKET = os.environ['PROCESSED_BUCKET']

def lambda_handler(event, context):
    """
    Main Lambda handler. Processes SQS messages containing S3 upload events.
    """
    logger.info(f"Received event: {json.dumps(event)}")

    for record in event['Records']:
        try:
            # Parse the SQS message body
            message_body = json.loads(record['body'])

            job_id = message_body['jobId']
            s3_key = message_body['s3Key']

            logger.info(f"Processing job {job_id} for file {s3_key}")

            # Update job status to PROCESSING
            update_job_status(job_id, 'PROCESSING')

            # Download the image from S3
            image_data = download_image(UPLOAD_BUCKET, s3_key)

            # Process the image (add watermark)
            processed_image_data = add_watermark(image_data)

            # Generate output key
            output_key = f"processed/{s3_key}"

            # Upload processed image to S3
            upload_image(PROCESSED_BUCKET, output_key, processed_image_data)

            # Update job status to COMPLETED
            update_job_status(
                job_id,
                'COMPLETED',
                processed_url=f"s3://{PROCESSED_BUCKET}/{output_key}"
            )

            logger.info(f"Successfully processed job {job_id}")

        except Exception as e:
            logger.error(f"Error processing record: {str(e)}")

            # Update job status to FAILED
            if 'job_id' in locals():
                update_job_status(job_id, 'FAILED', error=str(e))

            # Re-raise the exception so SQS knows the message failed
            raise

    return {
        'statusCode': 200,
        'body': json.dumps('Processing complete')
    }

def download_image(bucket, key):
    """
    Download an image from S3 and return as bytes.
    """
    logger.info(f"Downloading s3://{bucket}/{key}")
    response = s3_client.get_object(Bucket=bucket, Key=key)
    image_data = response['Body'].read()
    return image_data

def add_watermark(image_data):
    """
    Add a watermark to the image with improved error handling and performance.
    """
    logger.info("Adding watermark to image")

    try:
        # Open the image
        image = Image.open(BytesIO(image_data))

        # Convert to RGBA if not already (for transparency support)
        if image.mode != 'RGBA':
            image = image.convert('RGBA')

        # Create a transparent overlay
        overlay = Image.new('RGBA', image.size, (255, 255, 255, 0))
        draw = ImageDraw.Draw(overlay)

        # Calculate watermark position (bottom-right corner)
        watermark_text = "© Amodhbh Media"

        # Try to use a better font, fall back to default if not available
        try:
            font = ImageFont.truetype("/usr/share/fonts/dejavu/DejaVuSans-Bold.ttf", 36)
        except:
            font = ImageFont.load_default()

        # Get text size using textbbox
        bbox = draw.textbbox((0, 0), watermark_text, font=font)
        text_width = bbox[2] - bbox[0]
        text_height = bbox[3] - bbox[1]

        # Position text in bottom-right with 20px margin
        x = image.width - text_width - 20
        y = image.height - text_height - 20

        # Draw semi-transparent background rectangle
        padding = 10
        draw.rectangle(
            [x - padding, y - padding, x + text_width + padding, y + text_height + padding],
            fill=(0, 0, 0, 128)
        )

        # Draw the watermark text
        draw.text((x, y), watermark_text, fill=(255, 255, 255, 255), font=font)

        # Composite the overlay onto the original image
        watermarked = Image.alpha_composite(image, overlay)

        # Convert back to RGB (removes alpha channel)
        watermarked = watermarked.convert('RGB')

        # Save to bytes with optimized settings
        output = BytesIO()
        watermarked.save(output, format='JPEG', quality=95, optimize=True)
        output.seek(0)

        return output.getvalue()

    except Exception as e:
        logger.error(f"Error adding watermark: {str(e)}")
        raise

def upload_image(bucket, key, image_data):
    """
    Upload processed image to S3 with metadata.
    """
    logger.info(f"Uploading to s3://{bucket}/{key}")
    s3_client.put_object(
        Bucket=bucket,
        Key=key,
        Body=image_data,
        ContentType='image/jpeg',
        Metadata={
            'processed-by': 'serverless-media-pipeline',
            'processing-timestamp': datetime.utcnow().isoformat()
        }
    )

def update_job_status(job_id, status, processed_url=None, error=None):
    """
    Update job status in DynamoDB with improved error handling.
    """
    logger.info(f"Updating job {job_id} to status {status}")

    try:
        update_expression = "SET #status = :status, updatedAt = :timestamp"
        expression_values = {
            ':status': status,
            ':timestamp': datetime.utcnow().isoformat()
        }
        expression_names = {
            '#status': 'status'
        }

        if processed_url:
            update_expression += ", processedUrl = :url"
            expression_values[':url'] = processed_url

        if error:
            update_expression += ", errorMessage = :error"
            expression_values[':error'] = error

        table.update_item(
            Key={'jobId': job_id},
            UpdateExpression=update_expression,
            ExpressionAttributeValues=expression_values,
            ExpressionAttributeNames=expression_names
        )

        logger.info(f"Successfully updated job {job_id} to {status}")

    except Exception as e:
        logger.error(f"Error updating job status: {str(e)}")
        raise

Click Deploy to save the function code

Step 4: Add SQS Trigger

4.1 Configure the Trigger

In the Lambda function, click Add trigger
Select a source: SQS
SQS queue: Select media-processing-queue
Batch size: 1 (process one message at a time)
Batch window: 0 seconds
Enable trigger: Checked
Click Add

4.2 Verify Trigger Configuration

In the Configuration tab → Triggers
You should see media-processing-queue listed
Status should be Enabled

Testing & Verification

Step 1: Create Test Event (Manual Test)

We’ll simulate an SQS message manually to test the worker Lambda.

Click the Test tab
Event name: TestMediaProcessing
Template: SQS
Replace the event JSON with:

{
  "Records": [
    {
      "messageId": "test-message-id",
      "receiptHandle": "test-receipt-handle",
      "body": "{\"jobId\": \"test-job-123\", \"s3Key\": \"test-image.jpg\"}",
      "attributes": {
        "ApproximateReceiveCount": "1",
        "SentTimestamp": "1234567890000",
        "SenderId": "test-sender",
        "ApproximateFirstReceiveTimestamp": "1234567890000"
      },
      "messageAttributes": {},
      "md5OfBody": "test-md5",
      "eventSource": "aws:sqs",
      "eventSourceARN": "arn:aws:sqs:ap-south-1:123456789012:media-processing-queue",
      "awsRegion": "ap-south-1"
    }
  ]
}

Click Save

Note: This test will fail because test-image.jpg doesn’t exist in your bucket. We’ll do a proper end-to-end test in Part 2.

Step 2: Verify CloudWatch Logs

Go to Monitor tab → View CloudWatch logs
Click the latest log stream
You should see log entries from your function execution

Step 3: Verification Checklist

Before proceeding to Part 2, verify:

S3 Buckets (2)

amodhbh-media-uploads exists in ap-south-1
amodhbh-media-processed exists in ap-south-1
Both buckets have “Block all public access” enabled
Both buckets have encryption enabled

DynamoDB Table (1)

media-processing-jobs table exists
Table status is Active
Partition key is jobId (String)
Capacity mode is On-demand

SQS Queues (2)

media-processing-queue exists
media-processing-dlq exists
Main queue has DLQ configured with max receives = 3
Visibility timeout is 300 seconds (5 minutes)

IAM Role

Role media-worker-lambda-role exists
Has AWSLambdaBasicExecutionRole managed policy
Has custom inline policy with SQS, S3, and DynamoDB permissions

Lambda Layer

Layer pillow-layer created
Layer contains Pillow library

Lambda Function

Function worker-lambda exists
Runtime is Python 3.11 or 3.12
Memory is 512 MB
Timeout is 120 seconds (2 minutes)
Pillow layer is attached
Environment variables are set (UPLOAD_BUCKET, PROCESSED_BUCKET, DYNAMODB_TABLE)
Function code is deployed
SQS trigger is configured and enabled

Production Considerations

Security Enhancements

Enable S3 Server-Side Encryption:
- Use AWS KMS for additional security
- Implement bucket policies for access control
IAM Least Privilege:
- Review and minimize IAM permissions
- Use resource-specific ARNs where possible
VPC Configuration:
- Consider placing Lambda in VPC for additional security
- Configure VPC endpoints for AWS services

Monitoring & Alerting

CloudWatch Alarms:
- Set up alarms for Lambda errors
- Monitor SQS queue depth
- Track DynamoDB throttling
X-Ray Tracing:
- Enable AWS X-Ray for distributed tracing
- Monitor performance bottlenecks
Custom Metrics:
- Track processing times
- Monitor image sizes and formats

Performance Optimization

Lambda Configuration:
- Adjust memory based on image sizes
- Use provisioned concurrency for consistent performance
- Implement connection pooling for AWS services
S3 Optimization:
- Use S3 Transfer Acceleration for faster uploads
- Implement lifecycle policies for cost optimization
DynamoDB Optimization:
- Use batch operations where possible
- Implement caching for frequently accessed data

Cost Analysis

Current Monthly Cost for Idle Resources

S3 buckets (empty): $0.00
DynamoDB table (on-demand, no requests): $0.00
SQS queues (no messages): $0.00
Lambda (no invocations): $0.00

Total idle cost: $0.00/month

Cost When in Use

Example: 1000 image processing jobs per month

S3 storage: ~$0.023 per GB/month
DynamoDB: $0.285 per million read requests, $1.4225 per million write requests
SQS: $0.40 per million requests (first 1 million free each month)
Lambda requests: $0.20 per 1 million requests
Lambda duration: $0.0000166667 per GB-second

Estimated monthly cost for 1000 operations: ~$0.20

Cost Optimization Tips

S3 Lifecycle Policies:
- Automatically delete old processed images
- Move to cheaper storage classes
DynamoDB Optimization:
- Use on-demand billing for unpredictable workloads
- Implement TTL for automatic cleanup
Lambda Optimization:
- Right-size memory allocation
- Use provisioned concurrency only when needed

Troubleshooting

Common Issues and Solutions

“Unable to import module ’lambda_function’: No module named ‘PIL’”

Cause: The Pillow layer is not attached or incorrectly built
Solution:
- Verify the layer is attached in the function’s Configuration → Layers
- Rebuild the layer ensuring the directory structure is python/PIL/...

“Task timed out after 120.00 seconds”

Cause: Image is too large or processing is too slow
Solution:
- Increase timeout to 3-5 minutes in Configuration → General configuration
- Consider increasing memory to 1024 MB (more memory = faster CPU)

“Access Denied” errors

Cause: IAM permissions issue
Solution:
- Check IAM role permissions
- Verify bucket names match exactly in environment variables and IAM policy
- Ensure the S3 objects exist in the correct bucket

Messages going to DLQ

Cause: Lambda function errors
Solution:
- Check CloudWatch Logs for specific error messages
- Verify all environment variables are set correctly
- Check that the function code was deployed (click Deploy after editing)

Bucket name already exists

Cause: S3 bucket names are globally unique across all AWS accounts
Solution:
- If amodhbh-media-uploads is taken, try: amodhbh-media-uploads-<random-number>
- Update the bucket name consistently across all phases

Cannot create DynamoDB table

Cause: Region or naming conflict
Solution:
- Verify you’re in the correct region (ap-south-1)
- Check you don’t already have a table with the same name

SQS queue creation fails

Cause: DLQ not created first
Solution:
- Ensure the DLQ was created first
- Verify the DLQ ARN is correctly selected

Next Steps

Congratulations! You’ve successfully set up the core infrastructure and worker Lambda for your serverless media processing pipeline.

What’s Next:

Part 2: API Gateway and supporting Lambda functions
Part 3: S3 event triggers and end-to-end testing
Part 4: Production deployment and monitoring

Quick Reference:

Resource Type	Name	Purpose
S3 Bucket	`amodhbh-media-uploads`	Stores original uploaded images
S3 Bucket	`amodhbh-media-processed`	Stores watermarked images
DynamoDB Table	`media-processing-jobs`	Tracks job status (PENDING/COMPLETED/FAILED)
SQS Queue	`media-processing-queue`	Decouples upload events from processing
SQS Queue	`media-processing-dlq`	Captures failed messages for investigation
IAM Role	`media-worker-lambda-role`	Worker Lambda execution role
Lambda Layer	`pillow-layer`	Provides Pillow library for image processing
Lambda Function	`worker-lambda`	Downloads, watermarks, and uploads images
SQS Trigger	`media-processing-queue`	Triggers worker Lambda when messages arrive

Summary

In this comprehensive guide, we’ve built the foundation of a production-ready serverless media processing pipeline:

✅ Infrastructure Setup:

S3 buckets for uploads and processed images
DynamoDB table for job tracking
SQS queues with dead letter queue for reliability

✅ Worker Lambda:

Image processing with Pillow library
Comprehensive error handling and logging
Production-ready code with optimizations

✅ Security & Monitoring:

IAM roles with least privilege
CloudWatch logging and monitoring
Cost optimization strategies

Key Benefits:

Scalable: Handles thousands of concurrent requests
Cost-effective: Pay only for what you use
Reliable: Built-in error handling and retry logic
Maintainable: Clean, well-documented code

Ready for Part 2? We’ll add API Gateway, pre-signed URLs, and complete the end-to-end pipeline!

This is Part 1 of a 4-part series on building a production-ready serverless media processing pipeline. Stay tuned for Part 2 where we’ll add API Gateway and complete the user-facing components! Here is the Part 2, where we’ll add API Gateway and complete the user-facing components!

Share This Post

Twitter LinkedIn Copy Link

AWS Serverless Media Processing Pipeline - Part 1: Infrastructure Foundation & Worker Lambda

Table of Contents

Share This Post

AWS Serverless Media Processing Pipeline - Part 1: Infrastructure Foundation & Worker Lambda

Overview

Table of Contents

Prerequisites

Phase 1: Core Infrastructure Setup

Step 1: Create S3 Buckets

1.1 Create Upload Bucket

1.2 Create Processed Bucket

Step 2: Create DynamoDB Table

2.1 Create the Jobs Table

2.2 Verify Table Creation

Step 3: Create SQS Queues

3.1 Create Dead Letter Queue (DLQ)

3.2 Create Main Processing Queue

3.3 Verify Queue Creation

Phase 2: Worker Lambda Development

Step 1: Create IAM Role for Worker Lambda

1.1 Navigate to IAM

1.2 Configure Trust Policy

1.3 Add Permissions Policies

1.4 Name and Create Role

1.5 Add Custom Inline Policy

Step 2: Prepare Lambda Deployment Package

2.1 Create Lambda Layer for Pillow

2.2 Create Lambda Layer in AWS Console

Step 3: Create Worker Lambda Function

3.1 Create the Function

3.2 Configure Function Settings

3.3 Add Environment Variables

3.4 Add Function Code

Step 4: Add SQS Trigger

4.1 Configure the Trigger

4.2 Verify Trigger Configuration

Testing & Verification

Step 1: Create Test Event (Manual Test)

Step 2: Verify CloudWatch Logs

Step 3: Verification Checklist

S3 Buckets (2)

DynamoDB Table (1)

SQS Queues (2)

IAM Role

Lambda Layer

Lambda Function

Production Considerations

Security Enhancements

Monitoring & Alerting

Performance Optimization

Cost Analysis

Current Monthly Cost for Idle Resources

Cost When in Use

Cost Optimization Tips

Troubleshooting

Common Issues and Solutions

“Unable to import module ’lambda_function’: No module named ‘PIL’”

“Task timed out after 120.00 seconds”

“Access Denied” errors

Messages going to DLQ

Bucket name already exists

Cannot create DynamoDB table

SQS queue creation fails

Next Steps

Summary

Table of Contents

Share This Post