An AI-powered agent that analyzes knowledge bases, support data, and user queries to identify documentation gaps, auto-generate FAQs, and optimize content coverage. Built with LangChain, MCP (Model Context Protocol) integrations, and Next.js 15.
Organizations struggle with knowledge management:
- Incomplete Documentation: Critical topics missing from knowledge bases
- Outdated Content: Information becomes stale as products evolve
- User Frustration: Customers can't find answers to common questions
- Reactive Approach: Gaps only discovered after customer complaints
- Manual Analysis: Time-consuming to identify what's missing
Knowledge Gap Agent provides intelligent knowledge base optimization:
- Gap Detection: Analyzes search queries, support tickets, and user behavior to find missing content
- FAQ Generation: Automatically creates FAQs from common questions and their answers
- Coverage Analysis: Maps existing content against user needs
- Content Suggestions: Recommends new articles and topics to address gaps
- Quality Scoring: Evaluates content freshness, completeness, and relevance
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β Next.js 15 Frontend β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββββββββββ β
β β Gap β β FAQ β β Coverage β β Content β β
β β Analysis β βGenerator β β Map β β Suggestions β β
β ββββββββββββ ββββββββββββ ββββββββββββ ββββββββββββββββββββ β
βββββββββββββββββββββββββββββββ¬ββββββββββββββββββββββββββββββββββββ
β REST API
βββββββββββββββββββββββββββββββΌββββββββββββββββββββββββββββββββββββ
β FastAPI Backend β
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β β LangChain Agent Core ββ
β β βββββββββββββββ βββββββββββββββ βββββββββββββββββββββββ ββ
β β β Gap β β FAQ β β Content β ββ
β β β Detector β β Generator β β Analyzer β ββ
β β βββββββββββββββ βββββββββββββββ βββββββββββββββββββββββ ββ
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β β MCP Integrations ββ
β β ββββββββββ ββββββββββ ββββββββββ ββββββββββββββββββββββ ββ
β β β CMS β β Search β βSupport β β Database β ββ
β β β MCP β β MCP β β MCP β β MCP β ββ
β β ββββββββββ ββββββββββ ββββββββββ ββββββββββββββββββββββ ββ
β ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β
βββββββββββββββββββββββΌββββββββββββββββββββββ
βΌ βΌ βΌ
ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ
β Remote LLM β β Local LLM β β Vector DB β
β (OpenAI/etc) β β (Ollama) β β (ChromaDB) β
ββββββββββββββββ ββββββββββββββββ ββββββββββββββββ
- Python 3.11+
- FastAPI - High-performance async API framework
- LangChain 0.3+ - Agent orchestration and NLP
- LangGraph - Multi-agent workflow coordination
- ChromaDB - Vector storage for semantic search
- Pydantic - Data validation
- Next.js 15 - React framework with App Router
- TypeScript - Type-safe development
- TailwindCSS - Utility-first styling
- Shadcn/UI - Accessible components
- React Query - Server state management
- CMS MCP - Content management system access
- Search MCP - Search analytics integration
- Support MCP - Support ticket data
- Database MCP - Gap and FAQ storage
- Python 3.11+
- Node.js 20+
- Docker (optional)
git clone https://github.com/VaibhavJeet/knowledge-gap-agent.git
cd knowledge-gap-agentcd backend
python -m venv venv
source venv/bin/activate # Windows: venv\Scripts\activate
pip install -r requirements.txt
cp .env.example .env
uvicorn app.main:app --reloadcd frontend
npm install
cp .env.example .env.local
npm run dev- Frontend: http://localhost:3000
- API Docs: http://localhost:8000/docs
# Remote LLM
LLM_PROVIDER=openai
OPENAI_API_KEY=sk-your-key
OPENAI_MODEL=gpt-4-turbo-preview
# Local LLM (Ollama)
LLM_PROVIDER=ollama
OLLAMA_BASE_URL=http://localhost:11434
OLLAMA_MODEL=llama3.2Configure in config/mcp.yaml:
integrations:
cms:
enabled: true
provider: contentful # or strapi, notion
api_key: ${CMS_API_KEY}
search:
enabled: true
provider: algolia # or elasticsearch
app_id: ${ALGOLIA_APP_ID}
api_key: ${ALGOLIA_API_KEY}
support:
enabled: true
provider: zendesk # or freshdesk, intercom
api_key: ${SUPPORT_API_KEY}- Search query analysis
- Zero-result query tracking
- Support ticket pattern mining
- User feedback analysis
- Competitive content comparison
- Automatic Q&A extraction from tickets
- Similar question clustering
- Answer quality scoring
- Multi-language support
- Approval workflow
- Topic taxonomy mapping
- Content completeness scoring
- User journey coverage
- Search-to-content matching
- Freshness tracking
- Priority-ranked recommendations
- Outline generation
- Source material compilation
- SEO optimization hints
- Related content linking
GET /api/gaps- List identified gapsGET /api/gaps/{id}- Get gap detailsPOST /api/gaps/analyze- Run gap analysisPUT /api/gaps/{id}/status- Update gap status
GET /api/faqs- List generated FAQsPOST /api/faqs/generate- Generate FAQs from dataPUT /api/faqs/{id}- Update FAQPOST /api/faqs/{id}/publish- Publish FAQ
GET /api/content- List knowledge base contentGET /api/content/coverage- Get coverage reportPOST /api/content/suggestions- Get content suggestions
POST /api/analysis/run- Run full analysisGET /api/analysis/reports- List analysis reportsGET /api/analysis/reports/{id}- Get report details
See CONTRIBUTING.md for guidelines.
MIT License - see LICENSE for details.