Agentic Quality Engineering Fleet

Release Notes | Changelog | Issues | Discussions

AI-powered quality engineering agents that generate tests, find coverage gaps, detect flaky tests, and learn your codebase patterns — across 11 coding agent platforms.

What AQE Does For You

Generates comprehensive tests automatically — unit, integration, property-based, and BDD scenarios for your codebase with framework-specific output (Jest, Vitest, Playwright, Cypress, pytest, JUnit, Go, Rust, Swift, Flutter, and more)
Finds coverage gaps and prioritizes what to test — risk-weighted analysis identifies the most impactful untested code paths
Detects and fixes flaky tests — ML-powered detection with root cause analysis and stabilization recommendations
Learns your codebase patterns over time — remembered patterns are reused across sessions and projects, improving with every interaction
Coordinates 60 specialized QE agents — from test generation to security scanning to chaos engineering, orchestrated by a central coordinator
Reduces AI costs with intelligent routing — automatically routes tasks to the right model tier (fast/cheap for simple tasks, powerful for complex ones)
Works with your existing tools — integrates with 11 coding agent platforms and your existing CI/CD pipeline

Quick Start

# Install
npm install -g agentic-qe

# Initialize your project (auto-detects tech stack, configures MCP)
cd your-project && aqe init --auto

# That's it — MCP tools are available immediately in Claude Code
# For other clients: aqe-mcp

After init, your coding agent can use AQE tools directly. For example in Claude Code:

"Generate tests for src/services/UserService.ts with 90% coverage target"
"Find coverage gaps in src/ and prioritize by risk"
"Run security scan on the authentication module"
"Analyze why tests in auth/ are flaky and suggest fixes"

Platform Support

AQE works with 11 coding agent platforms through a single MCP server:

Platform	Setup
Claude Code	`aqe init --auto` (built-in)
GitHub Copilot	`aqe init --auto --with-copilot`
Cursor	`aqe init --auto --with-cursor`
Cline	`aqe init --auto --with-cline`
OpenCode	`aqe init --auto --with-opencode`
AWS Kiro	`aqe init --auto --with-kiro`
Kilo Code	`aqe init --auto --with-kilocode`
Roo Code	`aqe init --auto --with-roocode`
OpenAI Codex CLI	`aqe init --auto --with-codex`
Windsurf	`aqe init --auto --with-windsurf`
Continue.dev	`aqe init --auto --with-continuedev`

# Set up all platforms at once
aqe init --auto --with-all-platforms

# Or add a platform later
aqe platform setup cursor
aqe platform list       # show install status
aqe platform verify cursor  # validate config

For detailed per-platform instructions, see Platform Setup Guide.

Usage Examples

Generate Tests

claude "Use qe-test-architect to create tests for PaymentService with 95% coverage target"

Output:

Generated 48 tests across 4 files
- unit/PaymentService.test.ts (32 unit tests)
- property/PaymentValidation.property.test.ts (8 property tests)
- integration/PaymentFlow.integration.test.ts (8 integration tests)
Coverage: 96.2%
Pattern reuse: 78% from learned patterns

Full Quality Pipeline

claude "Use qe-queen-coordinator to run full quality assessment:
1. Generate tests for src/services/*.ts
2. Analyze coverage gaps with risk scoring
3. Run security scan
4. Validate quality gate at 90% threshold
5. Provide deployment recommendation"

The Queen Coordinator spawns domain-specific agents, runs them in parallel, and synthesizes a final recommendation.

TDD Workflow

claude "Use qe-tdd-specialist to implement UserAuthentication with full RED-GREEN-REFACTOR cycle"

Coordinates 5 subagents: write failing tests → implement minimal code → refactor → code review → security review.

Security Audit

claude "Coordinate security audit:
- SAST/DAST scanning with qe-security-scanner
- Dependency vulnerability scanning with qe-dependency-mapper
- API security with qe-contract-validator
- Chaos resilience testing with qe-chaos-engineer"

60 QE Agents

The fleet is organized into 13 domains, coordinated by the qe-queen-coordinator:

Domain	Agents	What They Do
Test Generation	test-architect, tdd-specialist, mutation-tester, property-tester	Generate tests, TDD workflows, validate test effectiveness
Test Execution	parallel-executor, retry-handler, integration-tester	Run tests in parallel, handle retries, integration testing
Coverage Analysis	coverage-specialist, gap-detector	Find untested code, prioritize by risk
Quality Assessment	quality-gate, risk-assessor, deployment-advisor, devils-advocate	Go/no-go decisions, risk scoring, adversarial review
Defect Intelligence	defect-predictor, root-cause-analyzer, flaky-hunter, regression-analyzer	Predict bugs, find root causes, fix flaky tests
Requirements	requirements-validator, bdd-generator	Validate testability, generate BDD scenarios
Code Intelligence	code-intelligence, kg-builder, dependency-mapper, impact-analyzer	Knowledge graphs, semantic search, change impact
Security	security-scanner, security-auditor, pentest-validator	SAST/DAST, compliance audits, exploit validation
Contracts	contract-validator, graphql-tester	API contracts, GraphQL schema testing
Visual & A11y	visual-tester, accessibility-auditor, responsive-tester	Visual regression, WCAG compliance, viewport testing
Chaos & Performance	chaos-engineer, load-tester, performance-tester	Fault injection, load testing, performance validation
Learning	learning-coordinator, pattern-learner, transfer-specialist, metrics-optimizer	Cross-project learning, pattern discovery
Enterprise	soap-tester, sap-rfc-tester, sap-idoc-tester, sod-analyzer, odata-contract-tester, middleware-validator, message-broker-tester	SAP, SOAP, ESB, OData, JMS/AMQP/Kafka

Plus 7 TDD subagents (red, green, refactor, code/integration/performance/security reviewers) and the fleet-commander for large-scale orchestration.

74 QE Skills

Agents automatically apply relevant skills from the skill library. Skills are rated by trust tier:

Tier	Count	Meaning
Tier 3 — Verified	48	Full evaluation test suite, production-ready
Tier 2 — Validated	7	Has executable validator
Tier 1 — Structured	5	Has JSON output schema
Tier 0 — Advisory	5	Guidance only

View all 74 skills

Core Testing (12): agentic-quality-engineering, holistic-testing-pact, context-driven-testing, tdd-london-chicago, xp-practices, risk-based-testing, test-automation-strategy, refactoring-patterns, shift-left-testing, shift-right-testing, regression-testing, verification-quality

Specialized Testing (13): accessibility-testing, mobile-testing, database-testing, contract-testing, chaos-engineering-resilience, visual-testing-advanced, security-visual-testing, compliance-testing, compatibility-testing, localization-testing, mutation-testing, performance-testing, security-testing

Domain Skills (14): qe-test-generation, qe-test-execution, qe-coverage-analysis, qe-quality-assessment, qe-defect-intelligence, qe-requirements-validation, qe-code-intelligence, qe-security-compliance, qe-contract-testing, qe-visual-accessibility, qe-chaos-resilience, qe-learning-optimization, qe-iterative-loop, aqe-v2-v3-migration

Strategic (8): six-thinking-hats, brutal-honesty-review, sherlock-review, cicd-pipeline-qe-orchestrator, bug-reporting-excellence, consultancy-practices, quality-metrics, pair-programming

Testing Techniques (9): exploratory-testing-advanced, test-design-techniques, test-data-management, test-environment-management, test-reporting-analytics, testability-scoring, technical-writing, code-review-quality, api-testing-patterns

n8n Workflow Testing (5): n8n-workflow-testing-fundamentals, n8n-expression-testing, n8n-security-testing, n8n-trigger-testing-strategies, n8n-integration-testing-patterns

QCSD Swarms (5): qcsd-ideation-swarm, qcsd-refinement-swarm, qcsd-development-swarm, qcsd-cicd-swarm, qcsd-production-swarm

Accessibility (2): a11y-ally, accessibility-testing

Enterprise Integration (5): enterprise-integration-testing, middleware-testing-patterns, observability-testing-patterns, wms-testing-patterns, pentest-validation

Validation (1): validation-pipeline

How It Works

Agent Coordination

The Queen Coordinator orchestrates agents across all 13 domains. When you ask for a quality assessment, the Queen decomposes the task, spawns the right agents, coordinates their work in parallel, and synthesizes results. Agents communicate through shared memory namespaces and use consensus protocols for critical quality decisions.

Pattern Learning

AQE learns from every interaction. Successful test patterns, coverage strategies, and defect indicators are stored and indexed for fast retrieval. When generating tests for a new service, AQE searches for similar patterns from past sessions — even across different projects. Patterns improve over time through experience replay and dream cycles (background consolidation).

aqe learning stats      # view learning statistics
aqe learning dream      # trigger pattern consolidation
aqe brain export        # export learned patterns for sharing

Intelligent Model Routing

TinyDancer routes tasks to the right model tier to minimize cost without sacrificing quality:

Task Complexity	Model	Examples
Simple (0-20)	Haiku	Type additions, simple refactors
Moderate (20-70)	Sonnet	Bug fixes, test generation
Critical (70+)	Opus	Architecture, security, complex reasoning

Quality Gates

Anti-sycophancy scoring catches hollow tests. Tautological assertions (expect(true).toBe(true)) are rejected. Edge cases from historical patterns are injected into test generation. See Loki-mode features.

CLI Reference

aqe init [--auto]              # Initialize project
aqe agent list                 # List available agents
aqe fleet status               # Fleet health and coordination
aqe learning stats             # Learning statistics
aqe learning dream             # Trigger dream cycle
aqe brain export/import        # Portable intelligence
aqe platform list/setup/verify # Manage coding agent platforms
aqe health                     # System health check

LLM Providers

Provider	Type	Cost	Best For
Ollama	Local	Free	Privacy, offline
OpenRouter	Cloud	Varies	300+ models
Groq	Cloud	Free	High-speed
Claude API	Cloud	Paid	Highest quality
Google AI	Cloud	Free	Gemini models

export GROQ_API_KEY="gsk_..."  # or any supported provider
aqe init --auto

Documentation

Guide	Description
Platform Setup	Per-platform configuration instructions
Skill Validation	Trust tiers and evaluation system
Learning System	ReasoningBank pattern learning
Code Intelligence	Knowledge graph and semantic search
Loki-mode Features	Anti-sycophancy and quality gates
Release Notes	Version history and changelogs
Architecture Glossary	Technical terms and concepts

Development

git clone https://github.com/proffesor-for-testing/agentic-qe.git
cd agentic-qe
npm install
npm run build
npm test -- --run

Script	Description
`npm run build`	Compile TypeScript + CLI + MCP bundles
`npm test -- --run`	Run all tests
`npm run cli`	Run CLI in dev mode
`npm run mcp`	Start MCP server

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.

Support

Documentation: docs/
Issues: GitHub Issues
Discussions: GitHub Discussions

License

MIT — see LICENSE.

Contributors

@proffesor-for-testing Project Lead	@fndlalit QX Partner, Testability	@shaal Core Development	@mondweep Architecture

View all contributors | Become a contributor

Support the Project

If you find AQE valuable, consider supporting its development:

	Monthly	Annual (Save $10)
Price	$5/month	$50/year
Subscribe	Monthly	Annual

View sponsorship details

Acknowledgments

Claude Flow by @ruvnet — Multi-agent orchestration and MCP integration
Agentic Flow by @ruvnet — Agent patterns and learning systems
Built with TypeScript, Node.js, and better-sqlite3
Compatible with Jest, Cypress, Playwright, Vitest, Mocha, pytest, JUnit, and more

Made with care by the Agentic QE Team

Star us on GitHub | Sponsor | Contributors

Name		Name	Last commit message	Last commit date
Latest commit History 1,164 Commits
.agentic-qe		.agentic-qe
.claude		.claude
.devcontainer		.devcontainer
.github		.github
.husky		.husky
.kiro		.kiro
.opencode		.opencode
.ruvector		.ruvector
assets		assets
benchmarks		benchmarks
docs		docs
examples		examples
packages		packages
reports		reports
scripts		scripts
src		src
tests		tests
.env.example		.env.example
.eslintrc.js		.eslintrc.js
.gitignore		.gitignore
.npmignore		.npmignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
Dockerfile		Dockerfile
FUNDING.md		FUNDING.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
vitest.config.ts		vitest.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agentic Quality Engineering Fleet

What AQE Does For You

Quick Start

Platform Support

Usage Examples

Generate Tests

Full Quality Pipeline

TDD Workflow

Security Audit

60 QE Agents

74 QE Skills

How It Works

Agent Coordination

Pattern Learning

Intelligent Model Routing

Quality Gates

CLI Reference

LLM Providers

Documentation

Development

Contributing

Support

License

Contributors

Support the Project

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Agentic Quality Engineering Fleet

What AQE Does For You

Quick Start

Platform Support

Usage Examples

Generate Tests

Full Quality Pipeline

TDD Workflow

Security Audit

60 QE Agents

74 QE Skills

How It Works

Agent Coordination

Pattern Learning

Intelligent Model Routing

Quality Gates

CLI Reference

LLM Providers

Documentation

Development

Contributing

Support

License

Contributors

Support the Project

Acknowledgments

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages