Skip to content

Thor011/agentic-qe

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1,164 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Agentic Quality Engineering Fleet

npm version License: MIT TypeScript Monthly Downloads Total Downloads

Release Notes | Changelog | Issues | Discussions

AI-powered quality engineering agents that generate tests, find coverage gaps, detect flaky tests, and learn your codebase patterns — across 11 coding agent platforms.


What AQE Does For You

  • Generates comprehensive tests automatically — unit, integration, property-based, and BDD scenarios for your codebase with framework-specific output (Jest, Vitest, Playwright, Cypress, pytest, JUnit, Go, Rust, Swift, Flutter, and more)
  • Finds coverage gaps and prioritizes what to test — risk-weighted analysis identifies the most impactful untested code paths
  • Detects and fixes flaky tests — ML-powered detection with root cause analysis and stabilization recommendations
  • Learns your codebase patterns over time — remembered patterns are reused across sessions and projects, improving with every interaction
  • Coordinates 60 specialized QE agents — from test generation to security scanning to chaos engineering, orchestrated by a central coordinator
  • Reduces AI costs with intelligent routing — automatically routes tasks to the right model tier (fast/cheap for simple tasks, powerful for complex ones)
  • Works with your existing tools — integrates with 11 coding agent platforms and your existing CI/CD pipeline

Quick Start

# Install
npm install -g agentic-qe

# Initialize your project (auto-detects tech stack, configures MCP)
cd your-project && aqe init --auto

# That's it — MCP tools are available immediately in Claude Code
# For other clients: aqe-mcp

After init, your coding agent can use AQE tools directly. For example in Claude Code:

"Generate tests for src/services/UserService.ts with 90% coverage target"
"Find coverage gaps in src/ and prioritize by risk"
"Run security scan on the authentication module"
"Analyze why tests in auth/ are flaky and suggest fixes"

Platform Support

AQE works with 11 coding agent platforms through a single MCP server:

Platform Setup
Claude Code aqe init --auto (built-in)
GitHub Copilot aqe init --auto --with-copilot
Cursor aqe init --auto --with-cursor
Cline aqe init --auto --with-cline
OpenCode aqe init --auto --with-opencode
AWS Kiro aqe init --auto --with-kiro
Kilo Code aqe init --auto --with-kilocode
Roo Code aqe init --auto --with-roocode
OpenAI Codex CLI aqe init --auto --with-codex
Windsurf aqe init --auto --with-windsurf
Continue.dev aqe init --auto --with-continuedev
# Set up all platforms at once
aqe init --auto --with-all-platforms

# Or add a platform later
aqe platform setup cursor
aqe platform list       # show install status
aqe platform verify cursor  # validate config

For detailed per-platform instructions, see Platform Setup Guide.


Usage Examples

Generate Tests

claude "Use qe-test-architect to create tests for PaymentService with 95% coverage target"

Output:

Generated 48 tests across 4 files
- unit/PaymentService.test.ts (32 unit tests)
- property/PaymentValidation.property.test.ts (8 property tests)
- integration/PaymentFlow.integration.test.ts (8 integration tests)
Coverage: 96.2%
Pattern reuse: 78% from learned patterns

Full Quality Pipeline

claude "Use qe-queen-coordinator to run full quality assessment:
1. Generate tests for src/services/*.ts
2. Analyze coverage gaps with risk scoring
3. Run security scan
4. Validate quality gate at 90% threshold
5. Provide deployment recommendation"

The Queen Coordinator spawns domain-specific agents, runs them in parallel, and synthesizes a final recommendation.

TDD Workflow

claude "Use qe-tdd-specialist to implement UserAuthentication with full RED-GREEN-REFACTOR cycle"

Coordinates 5 subagents: write failing tests → implement minimal code → refactor → code review → security review.

Security Audit

claude "Coordinate security audit:
- SAST/DAST scanning with qe-security-scanner
- Dependency vulnerability scanning with qe-dependency-mapper
- API security with qe-contract-validator
- Chaos resilience testing with qe-chaos-engineer"

60 QE Agents

The fleet is organized into 13 domains, coordinated by the qe-queen-coordinator:

Domain Agents What They Do
Test Generation test-architect, tdd-specialist, mutation-tester, property-tester Generate tests, TDD workflows, validate test effectiveness
Test Execution parallel-executor, retry-handler, integration-tester Run tests in parallel, handle retries, integration testing
Coverage Analysis coverage-specialist, gap-detector Find untested code, prioritize by risk
Quality Assessment quality-gate, risk-assessor, deployment-advisor, devils-advocate Go/no-go decisions, risk scoring, adversarial review
Defect Intelligence defect-predictor, root-cause-analyzer, flaky-hunter, regression-analyzer Predict bugs, find root causes, fix flaky tests
Requirements requirements-validator, bdd-generator Validate testability, generate BDD scenarios
Code Intelligence code-intelligence, kg-builder, dependency-mapper, impact-analyzer Knowledge graphs, semantic search, change impact
Security security-scanner, security-auditor, pentest-validator SAST/DAST, compliance audits, exploit validation
Contracts contract-validator, graphql-tester API contracts, GraphQL schema testing
Visual & A11y visual-tester, accessibility-auditor, responsive-tester Visual regression, WCAG compliance, viewport testing
Chaos & Performance chaos-engineer, load-tester, performance-tester Fault injection, load testing, performance validation
Learning learning-coordinator, pattern-learner, transfer-specialist, metrics-optimizer Cross-project learning, pattern discovery
Enterprise soap-tester, sap-rfc-tester, sap-idoc-tester, sod-analyzer, odata-contract-tester, middleware-validator, message-broker-tester SAP, SOAP, ESB, OData, JMS/AMQP/Kafka

Plus 7 TDD subagents (red, green, refactor, code/integration/performance/security reviewers) and the fleet-commander for large-scale orchestration.


74 QE Skills

Agents automatically apply relevant skills from the skill library. Skills are rated by trust tier:

Tier Count Meaning
Tier 3 — Verified 48 Full evaluation test suite, production-ready
Tier 2 — Validated 7 Has executable validator
Tier 1 — Structured 5 Has JSON output schema
Tier 0 — Advisory 5 Guidance only
View all 74 skills

Core Testing (12): agentic-quality-engineering, holistic-testing-pact, context-driven-testing, tdd-london-chicago, xp-practices, risk-based-testing, test-automation-strategy, refactoring-patterns, shift-left-testing, shift-right-testing, regression-testing, verification-quality

Specialized Testing (13): accessibility-testing, mobile-testing, database-testing, contract-testing, chaos-engineering-resilience, visual-testing-advanced, security-visual-testing, compliance-testing, compatibility-testing, localization-testing, mutation-testing, performance-testing, security-testing

Domain Skills (14): qe-test-generation, qe-test-execution, qe-coverage-analysis, qe-quality-assessment, qe-defect-intelligence, qe-requirements-validation, qe-code-intelligence, qe-security-compliance, qe-contract-testing, qe-visual-accessibility, qe-chaos-resilience, qe-learning-optimization, qe-iterative-loop, aqe-v2-v3-migration

Strategic (8): six-thinking-hats, brutal-honesty-review, sherlock-review, cicd-pipeline-qe-orchestrator, bug-reporting-excellence, consultancy-practices, quality-metrics, pair-programming

Testing Techniques (9): exploratory-testing-advanced, test-design-techniques, test-data-management, test-environment-management, test-reporting-analytics, testability-scoring, technical-writing, code-review-quality, api-testing-patterns

n8n Workflow Testing (5): n8n-workflow-testing-fundamentals, n8n-expression-testing, n8n-security-testing, n8n-trigger-testing-strategies, n8n-integration-testing-patterns

QCSD Swarms (5): qcsd-ideation-swarm, qcsd-refinement-swarm, qcsd-development-swarm, qcsd-cicd-swarm, qcsd-production-swarm

Accessibility (2): a11y-ally, accessibility-testing

Enterprise Integration (5): enterprise-integration-testing, middleware-testing-patterns, observability-testing-patterns, wms-testing-patterns, pentest-validation

Validation (1): validation-pipeline


How It Works

Agent Coordination

The Queen Coordinator orchestrates agents across all 13 domains. When you ask for a quality assessment, the Queen decomposes the task, spawns the right agents, coordinates their work in parallel, and synthesizes results. Agents communicate through shared memory namespaces and use consensus protocols for critical quality decisions.

Pattern Learning

AQE learns from every interaction. Successful test patterns, coverage strategies, and defect indicators are stored and indexed for fast retrieval. When generating tests for a new service, AQE searches for similar patterns from past sessions — even across different projects. Patterns improve over time through experience replay and dream cycles (background consolidation).

aqe learning stats      # view learning statistics
aqe learning dream      # trigger pattern consolidation
aqe brain export        # export learned patterns for sharing

Intelligent Model Routing

TinyDancer routes tasks to the right model tier to minimize cost without sacrificing quality:

Task Complexity Model Examples
Simple (0-20) Haiku Type additions, simple refactors
Moderate (20-70) Sonnet Bug fixes, test generation
Critical (70+) Opus Architecture, security, complex reasoning

Quality Gates

Anti-sycophancy scoring catches hollow tests. Tautological assertions (expect(true).toBe(true)) are rejected. Edge cases from historical patterns are injected into test generation. See Loki-mode features.


CLI Reference

aqe init [--auto]              # Initialize project
aqe agent list                 # List available agents
aqe fleet status               # Fleet health and coordination
aqe learning stats             # Learning statistics
aqe learning dream             # Trigger dream cycle
aqe brain export/import        # Portable intelligence
aqe platform list/setup/verify # Manage coding agent platforms
aqe health                     # System health check

LLM Providers

Provider Type Cost Best For
Ollama Local Free Privacy, offline
OpenRouter Cloud Varies 300+ models
Groq Cloud Free High-speed
Claude API Cloud Paid Highest quality
Google AI Cloud Free Gemini models
export GROQ_API_KEY="gsk_..."  # or any supported provider
aqe init --auto

Documentation

Guide Description
Platform Setup Per-platform configuration instructions
Skill Validation Trust tiers and evaluation system
Learning System ReasoningBank pattern learning
Code Intelligence Knowledge graph and semantic search
Loki-mode Features Anti-sycophancy and quality gates
Release Notes Version history and changelogs
Architecture Glossary Technical terms and concepts

Development

git clone https://github.com/proffesor-for-testing/agentic-qe.git
cd agentic-qe
npm install
npm run build
npm test -- --run
Script Description
npm run build Compile TypeScript + CLI + MCP bundles
npm test -- --run Run all tests
npm run cli Run CLI in dev mode
npm run mcp Start MCP server

Contributing

We welcome contributions! Please see CONTRIBUTING.md for details.


Support


License

MIT — see LICENSE.


Contributors


@proffesor-for-testing
Project Lead

@fndlalit
QX Partner, Testability

@shaal
Core Development

@mondweep
Architecture

View all contributors | Become a contributor


Support the Project

If you find AQE valuable, consider supporting its development:

Monthly Annual (Save $10)
Price $5/month $50/year
Subscribe Monthly Annual

View sponsorship details


Acknowledgments

  • Claude Flow by @ruvnet — Multi-agent orchestration and MCP integration
  • Agentic Flow by @ruvnet — Agent patterns and learning systems
  • Built with TypeScript, Node.js, and better-sqlite3
  • Compatible with Jest, Cypress, Playwright, Vitest, Mocha, pytest, JUnit, and more

Made with care by the Agentic QE Team

Star us on GitHub | Sponsor | Contributors

About

Agentic QE Fleet is an open-source AI-powered quality engineering platform designed for use with Claude Code, featuring specialized agents and skills to support testing activities for a product at any stage of the SDLC. Free to use, fork, build, and contribute. Based on the Agentic QE Framework created by Dragan Spiridonov.

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • TypeScript 95.3%
  • JavaScript 2.2%
  • HTML 1.4%
  • Shell 1.0%
  • PLpgSQL 0.1%
  • Handlebars 0.0%