Agentic Engineering Patterns

A Claude Code plugin for building software products with AI agents — from raw idea to shipped MVP.

Why This Exists

Traditional software development bottlenecks on human coding time. Process design optimizes "how to make people write code faster."

When agents can execute dozens of tasks in parallel, that bottleneck vanishes. A new one takes its place:

Agent execution capacity is near-infinite. Specification quality is not.

Vague specs don't slow down a human — they ask a colleague and adjust. Vague specs paralyze agents — they guess, diverge, and produce incompatible code across parallel sessions. The cost of ambiguity scales with parallelism.

This inverts the entire design logic:

Traditional:    plan roughly → adjust as you go → ship
                (optimizes for human coding speed)

Agentic:        invest heavily in spec precision → parallel execution → ship
                (optimizes for agent execution quality)

Every skill in this plugin serves that logic. The time you spend in /envision and /map pays back exponentially when agents build in parallel without asking questions.

The Mental Model

The workflow separates thinking from doing:

┌─────────────────────────────────────────────────────────────────┐
│                                                                 │
│   CONTROL PLANE  (human decides what to build)                  │
│                                                                 │
│   You + AI collaborate on high-leverage decisions:              │
│   goals, decomposition, architecture, priorities, feedback      │
│                                                                 │
│   ┌──────────┐    ┌──────────┐    ┌──────────┐                 │
│   │ /envision │───►│  /map    │───►│ /reflect │──┐              │
│   │          │    │          │    │          │  │              │
│   │ what to  │    │ how to   │    │ what we  │  │              │
│   │ build    │    │ break it │    │ learned  │  │              │
│   │          │    │ down     │    │          │  │              │
│   └──────────┘    └──────────┘    └──────────┘  │              │
│        ▲                │              │         │              │
│        └────────────────┼──────────────┘         │              │
│                         │  feedback loop         │              │
│                         ▼                        │              │
│                  ┌────────────┐                   │              │
│                  │ /dispatch  │  picks stories    │              │
│                  │            │  from the map,    │              │
│                  │ what to    │  creates OpenSpec │              │
│                  │ work on    │  changes          │              │
│                  │ next       │                   │              │
│                  └─────┬──────┘                   │              │
│                        │                         │              │
└────────────────────────┼─────────────────────────┼──────────────┘
                         │                         │
          story specs    │    status + cost flow up │
          flow down      │                         │
                         ▼                         │
┌──────────────────────────────────────────────────┼──────────────┐
│                                                  │              │
│   EXECUTION PLANE  (agents build it)             │              │
│                                                                 │
│   Agents receive precise specs, work in isolation,              │
│   produce PRs. They don't decide what to build.                 │
│                                                                 │
│   ┌──────────┐    ┌──────────┐    ┌──────────┐    ┌─────────┐  │
│   │ /design  │───►│ /launch  │───►│  /build  │───►│  /wrap  │  │
│   │          │    │          │    │          │    │         │  │
│   │ refine   │    │ spawn    │    │ implement│    │ archive │  │
│   │ the spec │    │ agent    │    │ + test   │    │ + update│  │
│   │          │    │          │    │ + PR     │    │ status  │  │
│   └──────────┘    └──────────┘    └──────────┘    └─────────┘  │
│                                                                 │
│   (repeat per story — multiple stories run in parallel)         │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

Agents don't talk to each other. They communicate through structured artifacts — context documents, story specs, interface contracts, signal files. The harness coordinates everything. This is a production system design, not a chatroom-style agent swarm.

The Story Map

AEP organizes all work as a Jeff Patton story map. Read left-to-right for the user journey, top-to-bottom for enrichment. Every AEP term maps to a position on this structure:

                            ACTIVITY BACKBONE (extracted by /envision)
    ─────────────────────────────────────────────────────────────────────────────►
    "The user authenticates, then configures, then monitors, then reviews"

    ┌──────────────┐  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐
    │  Authenticate │  │  Configure   │  │   Monitor    │  │    Review    │
    │  (activity)   │  │  (activity)  │  │  (activity)  │  │  (activity)  │
    └──────┬───────┘  └──────┬───────┘  └──────┬───────┘  └──────┬───────┘
           │                 │                 │                 │
═══════════╪═════════════════╪═════════════════╪═════════════════╪══════════════
 Layer 0   │  WALKING SKELETON — thinnest end-to-end path       │
           │                 │                 │                 │
  Wave 1   │  ┌────────┐    │  ┌────────┐     │  ┌────────┐    │
           │  │ STORY  │    │  │ STORY  │     │  │ STORY  │    │
           │  │ db-    │    │  │ api-   │     │  │ web-   │    │
           │  │ setup  │    │  │ scaff  │     │  │ scaff  │    │
           │  │   S ◆  │    │  │   S    │     │  │   S    │    │
           │  └────────┘    │  └────────┘     │  └────────┘    │
           │                 │                 │                 │
  Wave 2   │  ┌────────┐    │  ┌────────┐     │                │  ┌────────┐
  (needs   │  │ STORY  │    │  │ STORY  │     │                │  │ STORY  │
   wave 1) │  │ auth-  │    │  │ config │     │                │  │ audit- │
           │  │ setup  │    │  │ basic  │     │                │  │ list   │
           │  │   M    │    │  │   S    │     │                │  │   M    │
           │  └────────┘    │  └────────┘     │                │  └────────┘
           │                 │                 │                 │
 ─ ─ ─ ─ ─│─ ─ LAYER GATE ─ "user can complete full journey" ─ │─ ─ ─ ─ ─ ─
           │                 │                 │                 │
═══════════╪═════════════════╪═════════════════╪═════════════════╪══════════════
 Layer 0.5 │  ALIGNMENT LAYER — human calibrates quality        │
           │                 │                 │                 │
  Wave 1   │  ┌────────┐    │                 │  ┌────────┐    │  ┌────────┐
  (visual- │  │ STORY  │    │                 │  │ STORY  │    │  │ STORY  │
   design) │  │ landing│    │                 │  │ dash-  │    │  │ auth-  │
           │  │ polish │    │                 │  │ board  │    │  │ pages  │
           │  │  M ✦   │    │                 │  │  M ✦   │    │  │  S ✦   │
           │  └────────┘    │                 │  └────────┘    │  └────────┘
           │                 │                 │                 │
           │  ✦ = calibration_type: visual-design               │
           │      dispatched with calibration/visual-design.yaml │
           │                 │                 │                 │
 ─ ─ ─ ─ ─│─ ─ RELEASE LINE ─ Layer 0 + 0.5 = first release ─ │─ ─ ─ ─ ─ ─
           │                 │                 │                 │
═══════════╪═════════════════╪═════════════════╪═════════════════╪══════════════
 Layer 1   │  CORE FEATURES — deeper capabilities               │
           │                 │                 │                 │
  Wave 1   │  ┌────────┐    │  ┌────────┐     │  ┌────────┐    │  ┌────────┐
           │  │ STORY  │    │  │ STORY  │     │  │ STORY  │    │  │ STORY  │
           │  │ oauth  │    │  │ guard- │     │  │ live-  │    │  │ audit- │
           │  │ provid │    │  │ rails  │     │  │ status │    │  │ detail │
           │  │   L    │    │  │   M    │     │  │   M    │    │  │   L    │
           │  └────────┘    │  └────────┘     │  └────────┘    │  └────────┘
           │                 │                 │                 │
  Wave 2   │                │  ┌────────┐     │  ┌────────┐    │
           │                │  │ STORY  │     │  │ STORY  │    │
           │                │  │ rule-  │     │  │ alert- │    │
           │                │  │ engine │     │  │ system │    │
           │                │  │   L    │     │  │   M    │    │
           │                │  └────────┘     │  └────────┘    │
           │                 │                 │                 │
 ─ ─ ─ ─ ─│─ ─ LAYER GATE ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ │─ ─ ─ ─ ─ ─
           │                 │                 │                 │
═══════════╪═════════════════╪═════════════════╪═════════════════╪══════════════
 Layer 1.5 │  ALIGNMENT LAYER — multiple calibration types      │
           │                 │                 │                 │
           │  ┌──────────────────────────────────────────────┐  │
           │  │ ✦ visual-design (extension — new patterns)   │  │
           │  │ ✦ copy-tone    (establishment — brand voice) │  │
           │  │ ✦ api-surface  (light — inline YAML update)  │  │
           │  └──────────────────────────────────────────────┘  │
           │                 │                 │                 │
 ─ ─ ─ ─ ─│─ ─ RELEASE LINE ─ Layer 1 + 1.5 = second release ─│─ ─ ─ ─ ─ ─
           │                 │                 │                 │
           ▼                 ▼                 ▼                 ▼

LEGEND

  STRUCTURE                           EXECUTION
  Activity    = column (user verb)    Wave      = parallel batch (← →)
  Layer       = row (enrichment)      Story     = atomic work unit (one PR)
  Layer Gate  = integration test      Dispatch  = pick + lock + launch

  ALIGNMENT                           SYMBOLS
  .5 Layer    = human checkpoint      ◆  critical path story
  Calibration = capture "right"       ✦  calibrated story
  Quality Dim = what to calibrate     S/M/L  complexity

  SKILLS                              READING ORDER
  /envision  → activities + layers    left → right  = user journey
  /map       → stories + waves        top → down    = enrichment
  /calibrate → alignment decisions    ═══           = layer boundary
  /dispatch  → scores + launches      ─ ─           = gate / release line
  /reflect   → feedback → right phase

The Plugins

Each plugin implements one layer of the mental model.

1. Product Context — the persistent map

Captures the "what and why" of the entire product in a single product-context.yaml — committed to git, versioned, and machine-parseable.

/envision                        /map                            /reflect
    │                               │                               │
    ▼                               ▼                               ▼
Opportunity Brief               System Map                      Classify feedback:
"should we build this?"         "modules + interfaces"          bug → fix story
    │                               │                           refinement → next layer
    ▼                               ▼                           discovery → update map
Context Document                Story Graph                     shift → re-envision
"what exactly to build,         "layered work items,                │
 for whom, within               waves + slices"                     │
 what constraints"                  │                               │
    │                               ▼                               │
    │                           Agent Topology                      │
    │                           "roles + contracts"                 │
    │                               │                               │
    └───────────────┬───────────────┘                               │
                    │                                               │
                    ▼                                               │
               /dispatch                                            │
               "pick next story,          ◄─────────────────────────┘
                create OpenSpec change,     (new stories feed back
                route to /design"            into the dispatch queue)
                    │
                    ├─── integer layer ──► /design → /launch → /build → /wrap
                    │
                    └─── .5 alignment layer ──► /calibrate → human aligns
                                                  → /calibrate capture
                                                  → /dispatch → /launch → /build → /wrap

All sections live in one product-context.yaml file — opportunity, product, architecture, stories (with state machine), topology, layer gates, cost tracking, and a semantic changelog.

Why this exists: Without a product-level map, each feature is designed in isolation. Agents build incompatible pieces. Module boundaries are implicit. The YAML makes the whole system visible, machine-readable, and git-versioned before any code is written.

2. Feature Lifecycle — the execution cycle

Takes one story from the map and turns it into a merged PR. /dispatch picks the story; the two-session model executes it:

MAIN SESSION (you + AI)                WORKSPACE SESSION (agent alone)
━━━━━━━━━━━━━━━━━━━━━━                ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━

/dispatch
  pick story from YAML
  create OpenSpec change
         │
/design
  refine the spec
  (or skip if well-specified) ────►   /build
         │                              init tracking + jj change stack
/launch                                 implement each task
  create jj workspace                   code review (+ evaluator loop)
  bootstrap agent             ◄────     create PR, handle review
  optional: spawn evaluator             merge
         │                                     │
/wrap    ◄─────────────────────────────────────┘
  archive OpenSpec change
  update story status in YAML
  check layer gate
  suggest /reflect

Why two sessions: Design needs human judgment — you decide direction, scope, tradeoffs. Implementation is mechanical — the agent follows the spec, implements, tests, publishes. Separating them lets the agent work autonomously for hours while you do other things.

Why jj (not git): Changes are mutable until published. No staging area. Auto-rebase when editing earlier changes. jj workspace add gives each agent an isolated working copy with no extra disk space. The agent generates rough code, then cleans up with split/squash — a natural post-generation step.

3. Project Setup — the one-time foundation

Gets your machine and project ready. Run once.

/onboard                             /scaffold
    │                                    │
    ▼                                    ▼
Verify tools                         Scaffold monorepo
(jj, bun, git, gh,                   (Better-T-Stack: frontend,
 claude, openspec,                    backend, database, auth,
 tmux, cmux)                         API layer, addons)
    │                                    │
    ▼                                    ▼
Install plugins                      Initialize OpenSpec
(superpowers, agent-browser,         (explore/propose/apply/archive
 frontend-design, mgrep)             commands for spec-driven dev)

The Feedback Loop

The workflow is a loop, not a line. After shipping features, /reflect classifies what you learned:

                    ┌──────────────────────────────────┐
                    │                                  │
     ┌──────────── │ ◄── opportunity shift             │
     │              │      (back to /envision)          │
     │              │                                  │
     │  ┌───────── │ ◄── discovery                     │
     │  │           │      (update /envision or /map)   │
     │  │           │                                  │
     │  │  ┌────── │ ◄── refinement                    │
     │  │  │        │      (new story in next layer)    │
     │  │  │        │                                  │
     │  │  │  ┌─── │ ◄── polish                        │
     │  │  │  │     │      (.5 layer → /calibrate)      │
     │  │  │  │     │                                  │
     │  │  │  │  ┌─ │ ◄── bug                           │
     │  │  │  │  │  │      (fix story, back to /design) │
     │  │  │  │  │  │                                  │
     │  │  │  │  │  │ ◄── process                       │
     │  │  │  │  │  │      (workflow improvement)       │
     │  │  │  │  │  │                                  │
     │  │  │  │  │  └──────────────────────────────────┘
     │  │  │  │  │           /reflect
     ▼  ▼  ▼  ▼  ▼
  Each feedback type routes to the right phase.
  "Polish" is now "Calibration" — covers visual design,
  UX flow, API surface, data model, copy/tone, scope,
  and performance quality dimensions.
  The product context evolves. The cycle continues.

Human Alignment Layers

Agents build to spec, but specs are lossy compressions of human intent. After each implementation layer, optional .5 alignment layers let the human recalibrate what "right" means across any quality dimension:

Layer 0 (walking skeleton)
  → /calibrate visual-design → human explores with design tools → capture
  → Layer 0.5 (alignment: implement with calibrated design context)
Layer 1 (core features)
  → /calibrate api-surface   → 30-min conversation → updates product-context.yaml
  → /calibrate copy-tone     → establish brand voice → calibration/copy-tone.yaml
  → Layer 1.5 (alignment: extend design system + apply voice)

The /calibrate skill supports 7 dimensions — visual-design, ux-flow, api-surface, data-model, scope-direction, copy-tone, performance-quality — split into two classes:

Heavy (visual-design, ux-flow, copy-tone): external exploration, standalone YAML artifacts in calibration/
Light (api-surface, data-model, scope-direction, performance-quality): 30-60 min conversation, updates product-context.yaml directly

Quality dimensions are declared during /envision and checked by /reflect after each layer.

Institutional Memory

Workspace agents capture what they learn during builds — solutions discovered, errors encountered, missing docs — in .dev-workflow/lessons.md. When /wrap archives the workspace, substantive lessons are persisted to lessons-learned/ at the repo root. /launch injects relevant prior lessons into bootstrap prompts, so the next agent building in the same module doesn't start from zero.

Design Principles

These aren't rules we invented — they're patterns extracted from Anthropic's engineering research on long-running agent harnesses:

Spec precision over implementation speed. Time invested in unambiguous specs pays back exponentially across parallel agents. A 10-minute conversation in /envision saves hours of agent confusion.

Walking skeleton first. Build the thinnest end-to-end path (Layer 0) before going deep into any module. Validate the architecture at minimum cost. Going deep before proving the skeleton works is the most expensive mistake.

Every harness component earns its place. Sprint contracts, verification JSON, signal files, evaluator agents — each exists because of a specific failure mode observed in practice. As models improve, stress-test each component and remove what's no longer needed.

Generator-evaluator separation. Agents praise their own work even when it's mediocre. A separate evaluator agent, calibrated toward skepticism, catches problems the builder missed. This is the single most durable pattern from Anthropic's research.

Getting Started

New to this plugin?

/onboard

Installs prerequisites, verifies tools, configures recommended plugins.

Have a product idea?

/envision  →  /map  →  /scaffold

Validate the opportunity, decompose into stories, scaffold the project.

Ready to build a feature?

/dispatch  →  /design  →  /launch  →  /build  →  /wrap

Pick a story from the map, spec it, spawn the agent, let it build, archive when merged.

Want hands-free autonomous mode?

/autopilot

One command. Autopilot dispatches, launches, monitors, reviews, merges, and wraps — pausing only when human design input is needed.

Shipped something? Close the loop:

/reflect

Classify feedback, update the product context, plan the next iteration.

Something feels off? Calibrate:

/calibrate visual-design    → design brief → external tools → /calibrate capture
/calibrate api-surface      → conversation → updates product-context.yaml
/calibrate scope-direction  → conversation → updates product-context.yaml

Generate a dimension-specific brief, explore or discuss, capture decisions for agents to follow.

All Skills

Skill	Plugin	Purpose
`/envision`	product-context	Opportunity brief + context document
`/map`	product-context	System map + story graph + agent topology
`/dispatch`	product-context	Pick next story + create OpenSpec change
`/calibrate`	product-context	Human alignment checkpoint for any quality dimension
`/reflect`	product-context	Classify feedback + update context
`/onboard`	project-setup	Verify tools + install plugins
`/scaffold`	project-setup	Scaffold monorepo + initialize OpenSpec
`/design`	agentic-development-workflow	Explore + propose + review a feature
`/launch`	agentic-development-workflow	Spawn workspace + optional evaluator
`/build`	agentic-development-workflow	Implement → test → PR → merge
`/wrap`	agentic-development-workflow	Archive + cleanup + suggest reflect
`/jj-ref`	agentic-development-workflow	jj command reference (on-demand)
`/gen-eval`	patterns	Generator/evaluator separation for honest validation
`/autopilot`	patterns	Autonomous dispatch-launch-monitor-wrap loop via `/loop`

Documentation

Glossary — Ubiquitous Language — precise definitions for every AEP term
Skills Quick Reference — when to use which skill, decision trees, common sequences
Autonomous Loop — how /autopilot orchestrates the full cycle
Generator/Evaluator Data Flow — the three tracking systems and signal protocol
Release Line Adjustments — when and how to re-slice layers
Design Calibration Workflow — the original visual-design /calibrate skill
Generalized Calibration Workflow — multi-dimension /calibrate and .5 alignment layers
AEP v2 Lesson Learning — structural improvements from Layer 0 post-mortem

Syncing Skills to Your Project

A sync script is included to copy AEP skills into any project's .claude/skills/ directory with the aep- prefix.

Setup

Copy scripts/sync.sh to your project's scripts/ directory
Set AEP_REPO to point to your local clone of this repo

Usage

# Sync all skills
AEP_REPO=~/agentic-engineering-patterns bash scripts/sync.sh

# Preview changes without modifying files
bash scripts/sync.sh --dry-run

# Sync only one group (workflow, product, setup, patterns)
bash scripts/sync.sh workflow

# Override target directory
TARGET_DIR=./my-skills bash scripts/sync.sh

The script flattens the nested skill directories and prefixes each with aep- (e.g., skills/product-context/envision/ becomes .claude/skills/aep-envision/). Run it whenever you want to pull the latest skill versions.

Push Mode (sync-downstream)

Push skills from the AEP repo to all registered downstream projects at once.

# One-time setup: create the config
bash scripts/sync-downstream.sh --init

# Edit .aep/config.yaml with your project paths
# Then push to all projects:
bash scripts/sync-downstream.sh

# Preview changes:
bash scripts/sync-downstream.sh --dry-run

# Push to one project (name match):
bash scripts/sync-downstream.sh 91app

The config file (.aep/config.yaml) is gitignored — paths are machine-local. Each entry specifies the project path and optionally which skill groups to sync.

Inspired By

Harness Design for Long-Running Application Development — Anthropic Engineering
Effective Harnesses for Long-Running Agents — Anthropic Engineering
Effective Context Engineering for AI Agents — Anthropic Engineering
Better-T-Stack — Full-stack TypeScript scaffold engine
OpenSpec — Spec-driven development CLI
User Story Mapping — Jeff Patton (walking skeleton, layered delivery)

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
.claude-plugin		.claude-plugin
apps		apps
docs		docs
packages		packages
scripts		scripts
skills		skills
.gitignore		.gitignore
.oxfmtrc.json		.oxfmtrc.json
.oxlintrc.json		.oxlintrc.json
LICENSE		LICENSE
README.md		README.md
bts.jsonc		bts.jsonc
bun.lock		bun.lock
lefthook.yml		lefthook.yml
package.json		package.json
tsconfig.json		tsconfig.json
turbo.json		turbo.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Engineering Patterns

Why This Exists

The Mental Model

The Story Map

The Plugins

1. Product Context — the persistent map

2. Feature Lifecycle — the execution cycle

3. Project Setup — the one-time foundation

The Feedback Loop

Human Alignment Layers

Institutional Memory

Design Principles

Getting Started

All Skills

Documentation

Syncing Skills to Your Project

Setup

Usage

Push Mode (sync-downstream)

Inspired By

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic Engineering Patterns

Why This Exists

The Mental Model

The Story Map

The Plugins

1. Product Context — the persistent map

2. Feature Lifecycle — the execution cycle

3. Project Setup — the one-time foundation

The Feedback Loop

Human Alignment Layers

Institutional Memory

Design Principles

Getting Started

All Skills

Documentation

Syncing Skills to Your Project

Setup

Usage

Push Mode (sync-downstream)

Inspired By

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages