Modern, full-stack AI platform for automated log analysis, incident classification, and intelligent remediation.
🚀 Quick Start | 📖 Setup Guide | 🔧 ADK Integration | 🧪 Test Integration
ChaosPilot is an intelligent log analysis platform that uses AI agents to automatically analyze error, warning, and critical logs, detect patterns, classify incidents, and recommend fixes.
We manage the chaos of production errors through intelligent log analysis.
- 🔍 AI-Powered Analysis: LLMs analyze logs for patterns and anomalies
- 📊 Incident Classification: Auto-classifies by severity, impact, urgency
- 📋 Smart Response Planning: Generates actionable response plans
- 🛠️ Automated Fix Recommendations: Suggests and executes safe fixes
- ⚡ Safe Auto-Fixing: Automated fixes with rollback capability
- 📢 Smart Alerting: Intelligent notifications and escalations
- 📈 Real-Time Dashboard: Live metrics, workflow visualization, and insights
| Layer | Tech/Tools |
|---|---|
| Frontend | Angular, TypeScript, TailwindCSS, RxJS |
| Backend | Python, FastAPI, Google ADK, async/await |
| Auth | Supabase (user/session management) |
| Data/AI | Google ADK agents, BigQuery, LLMs |
| DevOps | Docker, GCP, uv, hatch |
- Google Cloud CLI
- Python 3.9+
- Uv - 🚀 A single tool to replace pip, pip-tools, pipx, poetry, pyenv, twine, virtualenv, and more.
- Node.js 16+
- MCP Toolbox for Databases
-
Clone the repository
git clone https://github.com/pmutua/ChaosPilot cd ChaosPilot -
Create and activate a Python virtual environment
-
Create virtual environment
uv venv
-
Activate virtual environment
# Linux/macOS: source .venv/bin/activate # Windows .venv\Scripts\activate
-
-
Install dependencies
uv sync
-
Rename
.env.templateto.envand add relevant environment variables:APP_NAME="agent_manager" VERSION=0.0.0 MODEL="" # If you are using Google Gemini use this GOOGLE_GENAI_USE_VERTEXAI=FALSE GOOGLE_API_KEY=PASTE_YOUR_ACTUAL_API_KEY_HERE # If you are using Azure Open AI use this AZURE_API_KEY="add api key" AZURE_API_BASE=https://example.openai.azure.com/ AZURE_API_VERSION="2025-05-05-preview"
-
Set up Google Cloud, BigQuery, and MCP Toolbox
(See Setup Guide for full details.)
# Start MCP Toolbox
cd mcp-toolbox
toolbox --tools-file="tools.yaml"
# Start ADK API Server (with CORS)
cd ../agent_manager
adk api_server --allow_origins="*"
# or in root directory run:
adk api_server agent_manager --allow_origins="*"
# Start Frontend
cd ../web
npm install
npm start
Simply run the following command:
adk web
# And navigate to port 8000. Now you can interact with the Agents and debug.
NOTE: Make sure that you select "agent_manager" on "Select Agent" option.
- Go to the Log Analysis page
- Select the "Log Analyzer" agent
- Provide error logs or describe the issue
- Get AI-powered analysis with confidence scores
- Use the "Fix Recommender" agent
- Share the analyzed logs and issues
- Get specific fix suggestions with implementation steps
- System detects critical issues
- AI agents generate response plans
- Safe automated fixes are applied
- Teams are notified with detailed reports
See Setup & Deployment Guide for GCP/Cloud Run instructions.
- CORS Errors: Use
adk api_server app --allow_origins="*" - Service Account Issues: Run the IAM role assignment scripts
- Billing Errors: Ensure GCP billing is enabled
- Agent Not Found: Verify you're running from the correct directory
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
MIT License – see the LICENSE file for details.
- Create an issue in the repository
- Check the documentation
- Review the troubleshooting guide
ChaosPilot – Managing the chaos of production errors through AI-powered log analysis.
