Distributed Job Queue System

1. Executive Summary

This project implements a production-grade distributed job queue system designed for asynchronous report generation. Leveraging a decoupled microservices architecture, it ensures high availability, scalability, and reliability by using Redis as a message broker and AWS S3 for persistent storage.

2. Project Structure Overview

The repository is organized into distinct sub-modules to separate infrastructure, frontend, and backend services.

.
├── dashboard-api/          # Express.js service providing queue analytics
├── dashboard-frontend/     # React + Vite UI portal
├── producer/               # Entry point for submitting new job requests
├── worker/                 # Core processing engine
├── infra/                  # Docker Compose files for local/prod environments
├── infra-pulumi/           # Infrastructure as Code (AWS provisioner)
├── docs/assets/            # Architectural diagrams and screenshots
└── assignment_report.md    # Detailed project documentation

3. System Architecture & Design Decisions

3.1 Component Role Breakdown

Producer: Acts as the gateway. It validates input and pushes a "generate-report" job into the Redis report-queue.
Worker: The "heavy lifter". Multiple replicas can run concurrently to drain the queue.
Scheduler: A lightweight monitor that ensures jobs don't stay in "active" state forever if a worker crashes.

3.2 Technical Tradeoffs

BullMQ over plain Redis: BullMQ handles the complexity of atomic operations and job state transitions, reducing the risk of data races.
S3 vs Local Storage: Local storage is ephemeral in Docker. Using S3 ensures that reports persist across container restarts and scale-outs.

4. Worker Service Internals

The Worker Service is the most critical part of the system, handling intensive PDF generation tasks.

4.1 Job Processing Lifecycle

Job Claim: A worker pulls a job from Redis and sets a lock (default 5 mins).
Rendering: The report.processor.ts generates dynamic HTML based on the job payload.
PDF Generation: A local Puppeteer instance (Headless Chrome) renders the HTML.
S3 Persistence: The raw PDF buffer is uploaded to AWS S3.
Completion: The worker updates the job status in Redis and releases the lock.

4.2 Error Handling & Concurrency

Concurrency: Each worker is configured to handle WORKER_CONCURRENCY=2 jobs simultaneously.
Automatic Retries: If Puppeteer fails, BullMQ automatically retries the job up to 3 times with exponential backoff.

5. API Documentation

POST 5001/api/v1/reports/generate: Submit a new report job.
GET 5003/api/stats: Real-time counts of waiting/active/completed/failed jobs.
GET 5003/api/workers: Lists all active workers and their health heartbeats.
GET 5003/api/jobs/completed: Fetches the last successfully generated reports.

6. Infrastructure & CI/CD Pipeline

6.1 Infrastructure (Pulumi)

The project uses Pulumi (TypeScript) to manage networking, compute, and storage. Below is a snapshot of the provisioned resources in the AWS console:

6.2 CI/CD Workflow (GitHub Actions)

The system is deployed via an automated pipeline defined in .github/workflows/aws-ec2-deploy.yml:

Trigger: On push to main branch.
Infra Refresh: Pulumi ensures the AWS environment is up-to-date.
SSH Deployment: Runner connects via SSH to pull code and restart services.

Below is the live application running on an AWS EC2 instance:

Conclusion

This system demonstrates a robust, production-ready distributed job queue using modern engineering practices and cloud-native infrastructure.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributed Job Queue System

1. Executive Summary

2. Project Structure Overview

3. System Architecture & Design Decisions

3.1 Component Role Breakdown

3.2 Technical Tradeoffs

4. Worker Service Internals

4.1 Job Processing Lifecycle

4.2 Error Handling & Concurrency

5. API Documentation

6. Infrastructure & CI/CD Pipeline

6.1 Infrastructure (Pulumi)

6.2 CI/CD Workflow (GitHub Actions)

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 55 Commits
.github/workflows		.github/workflows
dashboard-api		dashboard-api
dashboard-frontend		dashboard-frontend
docs/assets		docs/assets
infra-pulumi		infra-pulumi
infra		infra
producer		producer
worker		worker
.env.production		.env.production
.gitignore		.gitignore
README.md		README.md
Workflows.md		Workflows.md

Folders and files

Latest commit

History

Repository files navigation

Distributed Job Queue System

1. Executive Summary

2. Project Structure Overview

3. System Architecture & Design Decisions

3.1 Component Role Breakdown

3.2 Technical Tradeoffs

4. Worker Service Internals

4.1 Job Processing Lifecycle

4.2 Error Handling & Concurrency

5. API Documentation

6. Infrastructure & CI/CD Pipeline

6.1 Infrastructure (Pulumi)

6.2 CI/CD Workflow (GitHub Actions)

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages