Autonomous multi-agent orchestration platform that executes complex goals end-to-end β without hand-holding.
Give it a goal in plain English. It researches, writes code, generates documents, and delivers results to your Telegram, Discord, or Slack β autonomously.
β‘ Quick Start Β· β¨ Features Β· π€ Agent Profiles Β· π Integrations Β· π Architecture Β· πΊοΈ Ecosystem
βΆ Goal β autonomous execution |
βΆ Browser automation & delivery |
AI Partner is a self-hosted, autonomous AI agent platform you run on your own machine with Docker. You describe a goal β it decomposes it into tasks, executes them using real tools (web search, code execution, file generation, messaging), validates outcomes against measurable criteria, and delivers results to you automatically.
"Research the top 10 AI tools launched this week,
write a PDF comparison report, and send it to my Telegram."
AI Partner will: search the web β extract data β analyse results β generate a PDF β send to Telegram β confirm delivery. You don't touch it again.
Requirements: Docker Desktop + one LLM API key (OpenAI, Anthropic, Gemini, Groq, DeepSeek, or local Ollama)
Mac / Linux β one command:
curl -fsSL https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/setup.sh | bashWindows β paste into PowerShell:
iwr -useb https://raw.githubusercontent.com/AmitkrPaiwal/AI-Partner/main/install.ps1 | iexOr manually:
git clone https://github.com/AmitkrPaiwal/AI-Partner
cd AI-Partner
./setup.sh # Mac/Linux
.\install.ps1 # Windows (PowerShell)The installer walks you through choosing an LLM provider, entering your API key, and opens the browser UI automatically. First run takes 2β4 minutes (Docker image build).
Type a goal β AI Partner decomposes it, builds an execution plan, runs it with real tools, validates outcomes against measurable success criteria, and retries or replans on failure.
- β Up to 3 concurrent goals, each with up to 5 parallel sub-agents
- β ReAct loop: Reason β Act β Assess β Retry
- β Self-correcting: if a script errors, it semantically repairs and re-runs
- β Typed success criteria β agent proves completion, doesn't just say "done"
Pre-built agents with enforced tool whitelists, iteration caps, and automatic routing based on keywords.
| Cluster | Agents |
|---|---|
| Research | Web Researcher, Fact Checker, Trend Spotter |
| Dev | Python Developer, Node.js Developer, Debugger, Shell Operator |
| Data | Financial Analyst, Data Analyst, Excel Builder |
| Content | Report Generator, Summarizer, Tech Writer, Prompt Architect, Task Planner |
| Delivery | Telegram Reporter |
Invoke directly: @fin-analyst what is RELIANCE.NS today?
Or let keywords auto-route: typing "trending AI tools" fires @trend-spotter automatically.
Puppeteer-powered browser with live CDP screencasting visible in the UI. When a CAPTCHA appears, the agent pauses and shows a "Solve CAPTCHA β Take Control" button. You solve it, the agent resumes.
Results aren't just saved β they're validated delivery goals. The agent marks a task failed if messaging_send_file doesn't succeed.
Supports: Telegram Β· Discord Β· Slack Β· WhatsApp Β· Signal
- Episodic memory β timestamped event log of every conversation and outcome
- Vector search β semantic similarity across 4 embedding backends
- Persona β biographic facts and preferences injected into every prompt
- Knowledge base β upload PDFs/docs for RAG retrieval
PDF Β· Excel (xlsx) Β· PowerPoint (pptx) Β· Word (docx) Β· HTML β downloadable from the UI or sent via messaging.
After a successful goal, AI Partner generalises the solution into a reusable parameterised skill template. Deduplicated by embedding similarity. Skills can be promoted to first-class MCP tools.
Cron-expression scheduling, webhook triggers, Google Calendar events, Gmail arrival β all fire autonomous goal execution.
Each profile specifies:
| Field | Description |
|---|---|
| Tool whitelist | Enforced β agent cannot use tools outside its list |
| Iteration cap | Prevents runaway loops |
| Auto-select keywords | Fires automatically when matched in chat |
| agentType | Determines exhaustion behaviour (research / execution / delivery / synthesis) |
| Handoff instructions | Baked into every system prompt |
Profiles are editable from the UI (Settings β Agent Profiles) or by editing server/src/agents/seedProfiles.ts.
Add any key to .env β the agent automatically gains those tools.
| Service | Env Var | Tools Unlocked |
|---|---|---|
| GitHub | GITHUB_TOKEN |
search repos, list issues, create issues, get files, list PRs, add comments, search code |
| Notion | NOTION_API_KEY |
search, read page, create page, query database, append blocks |
| Gmail | GMAIL_USER + GMAIL_APP_PASSWORD |
send, search, read, list inbox |
| Google Calendar | GOOGLE_CALENDAR_ACCESS_TOKEN |
list events, create event, check availability, delete event |
| Google Drive | GOOGLE_DRIVE_ACCESS_TOKEN |
search, get file, list folder, create file |
| Twitter/X | TWITTER_BEARER_TOKEN |
search tweets, read timeline (+ OAuth keys for posting) |
| Trello | TRELLO_API_KEY + TRELLO_TOKEN |
list boards/cards, create card, move card, add comment |
| Spotify | SPOTIFY_ACCESS_TOKEN |
search, play, pause, skip, queue, create playlist |
| Apify | APIFY_API_TOKEN |
residential proxy scraping for CAPTCHA-protected sites |
| Image Gen | OPENAI_API_KEY or STABILITY_API_KEY |
DALL-E 3 / Stability AI image generation |
Messaging platforms: Telegram Β· Discord Β· Slack Β· WhatsApp Β· Signal
At least one required. Add the key to .env:
| Provider | Env Var | Notes |
|---|---|---|
| Anthropic | ANTHROPIC_API_KEY |
Claude 3.5 / 4 family |
| OpenAI | OPENAI_API_KEY |
GPT-4o, GPT-4o-mini |
GOOGLE_API_KEY |
Gemini 2.0 Flash | |
| Groq | GROQ_API_KEY |
Free tier, very fast (Llama, Mistral) |
| DeepSeek | DEEPSEEK_API_KEY |
Low cost, strong at coding |
| Mistral | MISTRAL_API_KEY |
European-hosted option |
| Together AI | TOGETHER_API_KEY |
Wide open-source model selection |
| Ollama | OLLAMA_HOST |
Local models, no API key needed |
| Perplexity | PERPLEXITY_API_KEY |
Search-grounded LLM with citations |
Switch models any time from Settings β Models in the UI.
flowchart TD
U([π€ User]) -->|"Natural language goal"| AO[AgentOrchestrator\nChat OODA loop]
AO -->|Goal detected| GE[GoalOrientedExecutor]
GE --> GX[GoalExtractor\nTyped success criteria]
GX --> RR[ReActReasoner\nReason β Act β Assess loop]
RR --> EE[ExecutionEngine]
EE --> MCP[MCP Tool Servers\n17 servers]
EE --> DS[Docker Sandbox\ncode execution]
RR --> SC[SelfCorrector\nSemantic script repair]
SC --> RR
RR --> GV[GoalValidator\nChecks file / content / messaging]
GV -->|All criteria pass| SL[SkillLearner\nGeneralise + store for reuse]
GV -->|Criteria fail| RR
style U fill:#6366f1,color:#fff
style SL fill:#10b981,color:#fff
style GV fill:#f59e0b,color:#000
Concurrency: Up to 3 concurrent goals, each with up to 5 parallel sub-agents via delegate_parallel.
MCP Tool Servers (17): web_search Β· browser_automation Β· code_executor Β· file_system Β· gmail Β· google_calendar Β· google_drive Β· github Β· notion Β· twitter Β· trello Β· spotify Β· apify Β· messaging (Telegram/Discord/Slack/WhatsApp/Signal) Β· image_generator Β· document_builder Β· memory
The open-source self-hosted agent space has several strong projects, each built around a different design philosophy. Here's how they are positioned:
| Project | Primary Design Focus | Best Suited For |
|---|---|---|
| AI Partner | End-to-end goal execution with validated outcomes, specialist agents, document generation, and messaging delivery | Users who want to hand off a complete goal and receive a finished, delivered result β with no babysitting |
| Agent Zero | OS-level autonomy with dynamic tool creation at runtime; runs in an isolated Docker terminal | Power users who want an agent that can build its own tools and interact deeply with the operating system |
| OpenClaw | Personal, always-on AI assistant with a modular skills system; strong messaging integrations | Users who want a self-hosted personal assistant accessible via Telegram, WhatsApp, or iMessage |
| OpenHands | Enterprise-grade autonomous software engineering; multi-agent collaboration and audit trails | Engineering teams automating code review, bug fixing, or large-scale software development workflows |
| OpenManus | Open alternative to Manus; flexible task decomposition and planning with reinforcement learning | Researchers and developers experimenting with agent reasoning and RL-based decision-making |
Where AI Partner fits: It is the only project in this space that treats goal delivery β not just task execution β as a first-class requirement. The agent must prove outcomes via typed success criteria, validates messaging delivery, and self-corrects on failure. The one-command Docker install also makes it the most accessible entry point for non-developer users.
Key files β editable without redeploying:
| File | Purpose |
|---|---|
server/prompts/agent.system.md |
Agent core identity |
server/prompts/profiles/ |
Per-profile LLM prompts |
server/prompts/reasoner-reason.md |
ReAct reasoning prompt |
server/prompts/reasoner-decide.md |
ReAct action-selection prompt |
server/config/blocked-domains.json |
Domains blocked from browser navigation |
server/config/data-api-hints.json |
API fallback hints injected when search fails |
server/templates/workspace/HEARTBEAT.md |
Proactive agenda tasks |
server/templates/workspace/SOUL.md |
Agent persona + quiet hours |
Environment variables β see .env.example for the full annotated list.
# Start
docker compose up -d
# View logs
docker compose logs -f app
# Stop
docker compose down
# Update to latest
./setup.sh --update # Mac/Linux
.\install.ps1 -Update # Windows
# Wipe all data and start fresh
./setup.sh --reset
.\install.ps1 -Reset# Hot-reload dev mode
docker compose -f docker-compose.dev.yml up
# Run unit tests (145 tests)
cd server && npm run test:unit
# TypeScript check
cd server && npx tsc --noEmitContributions are welcome! Here are some good places to start:
- Browse good first issues β beginner-friendly tasks
- Open a Discussion to propose features or ask questions
- Submit a PR β all improvements are reviewed within 48 hours
Ideas for new contributors:
- New MCP tool server integrations (Linear, Jira, Confluence, Airtable)
- Additional LLM provider adapters
- Browser automation improvements
- UI/UX enhancements
- Documentation and tutorials
MIT β free to use, modify, and distribute.
Built with TypeScript Β· Express Β· React Β· Puppeteer Β· Docker Β· SQLite Β· MCP
β If AI Partner saves you time, please star this repo β it helps others find it.
Star β Β· Fork π΄ Β· Issues π Β· Discussions π¬