mini-agent

The AI agent that sees before it acts.

Most agent frameworks are goal-driven: give it a task, get steps back. mini-agent is perception-driven — it observes your environment continuously, then decides whether to act. Goal-driven agents fail when the goal is wrong. Perception-driven agents adapt to what's actually happening.

Shell scripts define what the agent can see. Claude decides what to do. No database, no embeddings — just Markdown files + shell scripts + Claude CLI.

Quick Start

Prerequisites: Node.js 20+ and Claude CLI (npm install -g @anthropic-ai/claude-code)

# Install (pnpm auto-installed if needed)
curl -fsSL https://raw.githubusercontent.com/miles990/mini-agent/main/install.sh | bash

# Interactive chat — auto-creates agent-compose.yaml on first run
mini-agent

# Run autonomously in background
mini-agent up -d        # Start the OODA loop
mini-agent status       # What is it doing?
mini-agent logs -f      # Watch it think

What a Cycle Looks Like

── Perceive ─────────────────────────────────
  <workspace> 2 files changed: src/auth.ts, src/api.ts </workspace>
  <docker> container "redis" unhealthy (OOM) </docker>

── Decide ───────────────────────────────────
  Redis OOM is blocking the API. Fix infrastructure first.

── Act ──────────────────────────────────────
  Restarted redis with --maxmemory 256mb. API responding.
  Notified via Telegram: "Redis was OOM, restarted with memory limit."

Each cycle: perceive → decide → act. No human prompt needed.

What Makes It Different

	Platform Agents	Goal-Driven (AutoGPT)	mini-agent
Core idea	Agents on a platform	Goal in, steps out	See first, then act
Identity	Platform-assigned	None	SOUL.md — personality, growth
Memory	Platform DB	Vector DB	Markdown files (human-readable)
Perception	Platform APIs	Minimal	Shell scripts — anything is a sense
Security	Sandbox	Varies	Transparency > Isolation
Complexity	Heavy	181K lines (AutoGPT)	~29K lines TypeScript

How It Works

Four building blocks:

Perception — Shell scripts that output environment state. Anything scriptable becomes a sense
Skills — Markdown files injected into the prompt. Domain knowledge as instructions
Memory — Markdown + JSON Lines. Hot → warm → cold tiers. FTS5 full-text search, no vector DB
Identity — SOUL.md defines personality, interests, evolving worldview. Not just a task executor

Perception Plugins

Any executable that writes to stdout becomes a sense:

#!/bin/bash
# plugins/my-sensor.sh — output becomes <my-sensor>...</my-sensor> in context
echo "Status: $(systemctl is-active myservice)"
echo "Queue: $(wc -l < /tmp/queue.txt) items"

Register it in agent-compose.yaml:

perception:
  custom:
    - name: my-sensor
      script: ./plugins/my-sensor.sh

34 plugins included out of the box: workspace changes, Docker health, Chrome tabs, Telegram inbox, mobile GPS, GitHub issues/PRs, and more.

Skills

Write domain knowledge in Markdown. The agent follows it as instructions:

skills:
  - ./skills/docker-ops.md      # Container troubleshooting
  - ./skills/web-research.md    # Three-layer web access
  - ./skills/debug-helper.md    # Systematic debugging

25 skills included.

Configuration

One YAML file defines your agent:

# agent-compose.yaml
agents:
  assistant:
    name: My Assistant
    port: 3001
    persona: A helpful personal AI assistant
    loop:
      enabled: true
      interval: "5m"
    cron:
      - schedule: "*/30 * * * *"
        task: Check for pending tasks
    perception:
      custom:
        - name: docker
          script: ./plugins/docker-status.sh
    skills:
      - ./skills/docker-ops.md

Features

Organic Parallelism — Multi-lane architecture inspired by slime mold: main cycle + foreground lane + 6 background tentacles
System 1 Triage — Optional mushi companion uses a small model (~800ms) to filter noise before expensive LLM calls — saves ~40% token cost
Telegram — Bidirectional messaging with notifications and smart batching
Mobile PWA — Phone sensors (GPS, accelerometer, camera) as perception inputs
Web Access — Multi-layer extraction: Readability → trafilatura → VLM vision fallback
Team Chat Room — Multi-party discussion with persistent history and threading
MCP Server — 14 tools for Claude Code integration
CI/CD — Auto-commit → auto-push → GitHub Actions → deploy
Modes — calm (loop off) / reserved (loop on, notifications off) / autonomous (everything on)

Requirements

Node.js 20+
Claude CLI (npm install -g @anthropic-ai/claude-code)
Chrome (optional, for web access via CDP)

Philosophy

"There is no such thing as an empty environment."

A personal AI agent shares your context — your browser sessions, your conversations, your files. Isolating it means isolating yourself. mini-agent chooses transparency over isolation: every action has an audit trail (behavior logs + git history + File=Truth).

The agent's world is defined by its perception plugins — its Umwelt. Add a plugin, expand what it can see. What it sees shapes what it does.

Documentation

CLAUDE.md — Full architecture reference
CONTRIBUTING.md — How to contribute
plugins/ — All perception plugins
skills/ — All skill modules

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 1,844 Commits
.claude		.claude
.github		.github
docker		docker
docs		docs
kuro-portfolio		kuro-portfolio
memory		memory
plugins		plugins
scripts		scripts
skills		skills
src		src
tests		tests
tools/kuro-sense		tools/kuro-sense
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
SKILL.md		SKILL.md
agent-compose.yaml		agent-compose.yaml
chat-room.html		chat-room.html
dashboard.html		dashboard.html
install.sh		install.sh
mcp-agent.json		mcp-agent.json
mobile.html		mobile.html
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
tsconfig.json		tsconfig.json
uninstall.sh		uninstall.sh
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mini-agent

Quick Start

What a Cycle Looks Like

What Makes It Different

How It Works

Perception Plugins

Skills

Configuration

Features

Requirements

Philosophy

Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

mini-agent

Quick Start

What a Cycle Looks Like

What Makes It Different

How It Works

Perception Plugins

Skills

Configuration

Features

Requirements

Philosophy

Documentation

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages