🚀 Rust AI Gateway

A lightning-fast, production-ready API gateway that unifies OpenAI and Google Gemini models into a single endpoint. Built with Rust for maximum performance and reliability.

✨ Features

🔄 Multi-Provider Support - Seamlessly integrate OpenAI and Gemini
⚡ Automatic Failover - Falls back to secondary provider if primary fails
💾 Smart Caching - In-memory cache for improved performance and reduced costs
🎯 Manual Provider Selection - Override automatic routing when needed
📊 Request Logging - Structured logging with tracing for observability
🔒 Type-Safe - Leverages Rust's type system for reliability
⚙️ Async/Await - Non-blocking I/O for high concurrency

🏗️ Architecture

┌─────────────┐
│   Client    │
└──────┬──────┘
       │
       ▼
┌─────────────────────────────────┐
│    Axum API Gateway             │
│  /generate endpoint             │
└──────┬──────────────────────────┘
       │
       ▼
┌─────────────────────────────────┐
│   Router (model selection)      │
│   - Automatic failover          │
│   - Manual override             │
└──────┬──────────────────────────┘
       │
       ├──────────────┬─────────────┐
       ▼              ▼             ▼
┌──────────┐   ┌──────────┐   ┌─────────┐
│ OpenAI   │   │  Gemini  │   │  Cache  │
│ Client   │   │  Client  │   │  Layer  │
└──────────┘   └──────────┘   └─────────┘

🚀 Quick Start

Prerequisites

Rust 1.70+ (Install Rust)
OpenAI API Key (Get one here)
Google Gemini API Key (Get one here)

Installation

Clone the repository

git clone https://github.com/ProngsDev/ai-gateway.git
cd ai-gateway

Set up environment variables

cp .env.example .env

Edit .env with your API keys:

PORT=8080
OPENAI_API_KEY=sk-your-openai-key-here
GEMINI_API_KEY=your-gemini-key-here

Build and run
```
cargo run --release
```

The server will start on http://localhost:8080

📖 API Reference

Health Check

GET /health

Response:

AI Gateway is healthy

Generate Text

POST /generate
Content-Type: application/json

Request Body:

{
  "prompt": "Explain Rust ownership in one sentence",
  "provider": "OpenAI"  // Optional: "OpenAI" or "Gemini"
}

Response:

{
  "provider": "OpenAI",
  "output": "Rust's ownership system ensures memory safety by enforcing that each value has a single owner, and when the owner goes out of scope, the value is automatically deallocated.",
  "cached": false
}

Fields:

provider (optional): Specify "OpenAI" or "Gemini". If omitted, automatic failover is used.
prompt (required): The text prompt to send to the AI model.

Response Fields:

provider: Which provider generated the response
output: The generated text
cached: Whether the response was served from cache

💡 Usage Examples

Automatic Failover (Recommended)

curl -X POST http://localhost:8080/generate \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "What is Rust?"
  }'

The gateway will:

Try OpenAI first
If OpenAI fails, automatically fall back to Gemini
Cache the successful response

Manual Provider Selection

Use OpenAI specifically:

curl -X POST http://localhost:8080/generate \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Explain async/await",
    "provider": "OpenAI"
  }'

Use Gemini specifically:

curl -X POST http://localhost:8080/generate \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "Explain async/await",
    "provider": "Gemini"
  }'

Cache Behavior

Send the same prompt twice:

# First request - hits OpenAI
curl -X POST http://localhost:8080/generate \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Hello"}' | jq

# Response: {"provider": "OpenAI", "output": "...", "cached": false}

# Second request - served from cache
curl -X POST http://localhost:8080/generate \
  -H "Content-Type: application/json" \
  -d '{"prompt": "Hello"}' | jq

# Response: {"provider": "OpenAI", "output": "...", "cached": true}

🛠️ Development

Run Tests

cargo test

Run with Debug Logging

RUST_LOG=debug cargo run

Build for Production

cargo build --release

The optimized binary will be in target/release/ai-gateway

🐳 Docker Deployment

Build Docker Image

docker build -t ai-gateway .

Run Container

docker run -p 8080:8080 \
  -e OPENAI_API_KEY=your-key \
  -e GEMINI_API_KEY=your-key \
  ai-gateway

Docker Compose

docker-compose up -d

See Docker Setup section for details.

📁 Project Structure

ai-gateway/
├── src/
│   ├── main.rs              # Server setup & initialization
│   ├── routes.rs            # API endpoint handlers
│   ├── router.rs            # Provider routing & failover logic
│   ├── cache.rs             # In-memory cache implementation
│   ├── error.rs             # Custom error types
│   └── providers/
│       ├── mod.rs           # AIProvider trait definition
│       ├── openai.rs        # OpenAI client implementation
│       └── gemini.rs        # Gemini client implementation
├── Cargo.toml               # Dependencies & metadata
├── .env                     # Environment variables (gitignored)
├── Dockerfile               # Docker build configuration
├── docker-compose.yml       # Docker Compose setup
└── README.md                # This file

🔧 Configuration

Environment Variables

Variable	Description	Required	Default
`PORT`	Server port	No	`8080`
`OPENAI_API_KEY`	OpenAI API key	Yes	-
`GEMINI_API_KEY`	Google Gemini API key	Yes	-

Provider Priority

The default failover order is:

OpenAI (primary)
Gemini (fallback)

To change this, modify the provider order in src/main.rs:

// Current order (OpenAI first)
ai_router.add_provider(openai_client);
ai_router.add_provider(gemini_client);

// To make Gemini primary, swap the order:
ai_router.add_provider(gemini_client);
ai_router.add_provider(openai_client);

🎯 Use Cases

SaaS Applications - Add AI features with built-in resilience
Cost Optimization - Route to cheaper providers, cache expensive calls
High Availability - Automatic failover ensures uptime
Multi-Model Apps - Leverage strengths of different models
Development/Testing - Single API for multiple providers

🔒 Security Considerations

Store API keys in environment variables, never in code
Use HTTPS in production
Consider adding rate limiting for public deployments
Implement authentication/authorization as needed

📊 Performance

Async I/O - Non-blocking requests for high concurrency
In-Memory Cache - Sub-millisecond cache hits
Compiled - Native performance with Rust
Lightweight - Minimal resource footprint

🤝 Contributing

Contributions are welcome! Areas for improvement:

Add more providers (Claude, Llama, Cohere)
Implement Redis caching for distributed systems
Add rate limiting
Prometheus metrics
Streaming responses (SSE)
Configuration via YAML/TOML

📝 License

MIT License - See LICENSE file for details

🙏 Acknowledgments

Built with:

Axum - Web framework
Tokio - Async runtime
Reqwest - HTTP client
Serde - JSON serialization
Tracing - Logging

Made with ❤️ and Rust

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml

Folders and files

Latest commit

History

Repository files navigation

🚀 Rust AI Gateway

✨ Features

🏗️ Architecture

🚀 Quick Start

Prerequisites

Installation

📖 API Reference

Health Check

Generate Text

💡 Usage Examples

Automatic Failover (Recommended)

Manual Provider Selection

Cache Behavior

🛠️ Development

Run Tests

Run with Debug Logging

Build for Production

🐳 Docker Deployment

Build Docker Image

Run Container

Docker Compose

📁 Project Structure

🔧 Configuration

Environment Variables

Provider Priority

🎯 Use Cases

🔒 Security Considerations

📊 Performance

🤝 Contributing

📝 License

🙏 Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages