A powerful, enterprise-grade desktop AI chat application that provides seamless access to multiple AI providers with advanced features including MCP (Model Context Protocol) integration, intelligent tool calling, comprehensive knowledge base management, custom agent creation, memory systems, modern UI architecture, and extensive file processing capabilities.
Now available on Windows, macOS (Intel + Apple Silicon), and Linux with a TypeScript-first architecture and zero compilation errors.
- OpenAI: GPT-4o, GPT-4-turbo, GPT-3.5-turbo, O1-preview, O1-mini β
- Anthropic: Claude 4 Sonnet/Opus, Claude 3.5 Sonnet, Claude 3 Opus/Sonnet/Haiku β
- Google Gemini: Gemini 2.5 Flash/Pro, Gemini 1.5 Pro/Flash β
- Mistral AI: Mistral Large, Medium, Small, Codestral β
- DeepSeek: DeepSeek Chat, DeepSeek Coder β
- DeepInfra: High-performance inference platform β
- LM Studio: Local server for any GGUF model (free) β
- Jan AI: Privacy-first local AI with OpenAI-compatible API β
- Ollama: Local models support with enhanced vision capabilities β
- OpenRouter: 150+ models from multiple providers β
- Requesty: 80+ models with smart routing β
- Replicate: Cloud-hosted models β
- n8n: Custom workflow integration with multipart form data support β
13 providers with broad streaming support, vision capabilities, and tool calling integration (where supported)
- Send images directly to vision-capable models
- Automatic image optimization (resize + compression)
- Enhanced Screenshot Capture: Full-resolution screenshots with one click
- Supports screenshots, photos, and documents
- OpenRouter vision API integration
- macOS screen recording permission handling
- Global keyboard shortcut (Ctrl+Shift+L)
- Prompts Menu: Ctrl+Shift+Space or click prompts button πͺ
- Floating window that stays on top
- Minimizes to system tray
- Auto-resize chat input (1.25β5 lines)
- Esc key closes overlays and dialogs
- Magic UI Components: Polished animated components with Framer Motion β¨
- Sidebar Navigation: Intuitive left sidebar with quick access to all features β¨
- Floating Panels: Draggable settings, prompts, and history overlays β¨
- Real-time Animations: Smooth transitions and visual feedback throughout β¨
- Responsive Design: Auto-resizing windows and adaptive layouts β¨
- Theme System: Predefined themes with live preview and instant switching β¨
- Visual Indicators: Real-time status for tools, knowledge base, and MCP servers β¨
- Unified Interface: Single, cohesive modern interface replacing legacy components β¨
- LanceDB Vector Database: High-performance vector storage with semantic search capabilities β¨
- Multi-Format Document Processing: Intelligent parsing of PDFs, DOCX, XLSX, TXT, RTF, HTML, XML, and more β¨
- Batch Document Upload: Process multiple documents simultaneously with real-time progress tracking β¨
- Google Docs Integration: Direct import from Google Docs URLs with automatic text extraction β¨
- Smart Text Chunking: Advanced text segmentation with configurable overlap for context preservation β¨
- Intelligent RAG Integration: Retrieval-Augmented Generation with relevance scoring and source attribution β¨
- Knowledge Base Registry: Centralized management system for multiple knowledge bases β¨
- Migration & Backup: Complete knowledge base export/import with version control β¨
- Performance Analytics: Real-time search analytics, success rate tracking, and optimization insights β¨
- Metadata-Based Filtering: Advanced search filtering by document type, date, source, and custom tags β¨
- Context Window Management: Intelligent token limit handling for optimal LLM performance β¨
- MCP (Model Context Protocol) Integration: Full support for MCP servers and tools β¨
- Internal Commands: Secure command execution with directory-scoped permissions β¨
- Tool Calling: User-controlled tool execution with native provider support β¨
- Web Search: Integrated web search capabilities with source attribution β¨
- File Operations: Read, write, and manipulate files with safety restrictions β¨
- Process Management: Terminal command execution with timeout controls β¨
- System Monitoring: CPU usage, memory stats, and process information β¨
- Custom Agents: Specialized AI agents with tool-specific configurations β¨
- Encrypted API Storage: Electron safeStorage for all API keys β¨
- Secure Command Execution: Directory-scoped permissions and rate limiting β¨
- Local Data Storage: All data stored locally with no telemetry β¨
- Permission Management: Granular control over tool and command access β¨
- Error Handling: Comprehensive error management with actionable messages β¨
- Debug Logging: Configurable logging with privacy protection β¨
- Settings Persistence: Race condition-free configuration management β¨
- Memory Safety: Automatic cleanup and resource management β¨
- Vision Support: PNG, JPG, GIF, WebP with automatic optimization and multi-model vision support β¨
- Office Suite: Word (DOCX/DOC), Excel (XLSX/XLS/ODS), PowerPoint (PPTX/PPT) with table extraction β¨
- Text & Markup: TXT, MD, RTF, CSV, JSON, HTML, XML with intelligent structure parsing β¨
- Calendar Integration: ICS (iCalendar) with event extraction, scheduling, and timezone support β¨
- Advanced PDF Processing: Native provider support with OCR fallback, table extraction, and metadata preservation β¨
- Knowledge Base Auto-Indexing: Automatic document indexing with vector embedding and semantic search β¨
- High-Performance Batch Processing: Upload and process multiple files simultaneously with progress tracking β¨
- Intelligent Text Chunking: Smart segmentation with configurable overlap and context preservation β¨
- Robust Error Recovery: Comprehensive fallback mechanisms with detailed error reporting and retry logic β¨
- Real-Time Analytics: Live parsing statistics, success rates, and performance monitoring β¨
- Enhanced Clipboard Support: Paste images, text, and files with automatic format detection β¨
- Modern Drag & Drop: Visual file attachment with real-time processing indicators and validation β¨
- Document Migration: Seamless import/export with version control and data integrity checks β¨
- Custom Agent Creation: Build specialized AI agents tailored for specific tasks and workflows β¨
- Comprehensive Template Library: Pre-built templates for Document Analysis, Web Research, Code Assistance, and more β¨
- AI-Powered Prompt Generation: Use LLMs to automatically generate specialized system prompts β¨
- Granular Tool Selection: Choose specific tools and MCP servers for each agent with advanced configuration β¨
- Provider & Model Configuration: Set default LLM provider, model, temperature, and token limits per agent β¨
- Knowledge Base Integration: Link agents to specific knowledge bases for enhanced RAG capabilities β¨
- Agent Import/Export: Share agents via JSON files with comprehensive dependency validation β¨
- Runtime Agent Switching: Select and switch agents directly from chat interface with automatic configuration β¨
- Visual Agent Management: Modern UI for creating, editing, duplicating, and organizing agents β¨
- Agent Lifecycle Management: Complete CRUD operations with version tracking and metadata β¨
- Template System: Reusable agent templates with categorization and customization options β¨
- π Document Analyst: Specialized for document analysis, summarization, and content extraction
- π Web Researcher: Expert at web browsing, information gathering, and source verification
- π» Code Assistant: Focused on software development, code review, and debugging
- π Data Analyst: Expert at data analysis, visualization, and statistical interpretation
- βοΈ Creative Writer: Specialized in creative writing, content creation, and storytelling
- π Business Analyst: Expert at business analysis, market research, and strategic planning
- π Technical Writer: Focused on technical documentation, API docs, and user guides
- π§ Customer Support: Specialized in customer service, support ticket resolution, and FAQ generation
β Cross-platform builds include modern UI, knowledge base, 13 AI providers, MCP integration, tool calling, and enhanced vision support
- Click the settings icon βοΈ
- Select your preferred provider
- Enter your API key:
- OpenAI: Get from platform.openai.com
- Anthropic: Get from console.anthropic.com
- Google Gemini: Get from aistudio.google.com
- Mistral AI: Get from console.mistral.ai
- DeepSeek: Get from platform.deepseek.com
- OpenRouter: Get from openrouter.ai
- Requesty: Get from app.requesty.ai
- LM Studio: No API key needed (local server)
- Jan AI: API key may be required (local AI application)
- Ollama: No API key needed (local models)
- Press Ctrl+Shift+L to open the modern chat interface
- Select a provider and model from the bottom input area
- Upload Documents to Knowledge Base:
- Click the knowledge base icon in the sidebar
- Upload PDFs, Office docs, or enter Google Docs URLs
- Watch real-time processing progress
- Enable RAG toggle for enhanced responses
- Create Custom Agents:
- Access agent management from the sidebar
- Choose from pre-built templates or create custom agents
- Configure tools, knowledge bases, and LLM settings
- Switch agents directly from the chat interface
- Enhanced Chat Experience:
- Type messages and press Enter or click send
- Attach files via paperclip icon or drag & drop
- Take screenshots with the camera icon
- Use tool calling toggle for advanced functionality
- Access prompts with Ctrl+Shift+Space
- Knowledge Base Management:
- Create multiple knowledge bases for different topics
- Monitor search analytics and performance
- Export/import knowledge bases for backup
- Configure RAG settings for optimal results
- Advanced Features:
- Copy text from clipboard, then select prompts with
{content}for automatic insertion - Use MCP servers for extended functionality
- Access chat history and memory management
- Customize themes and transparency settings
- Copy text from clipboard, then select prompts with
| Shortcut | Action |
|---|---|
Ctrl+Shift+L |
Open/Show LittleLLM Modern Interface |
Ctrl+Shift+Space |
Open Action Menu (Prompts) |
Ctrl+Shift+V |
Process Clipboard Content |
Esc |
Close overlays/dialogs |
Enter |
Send message |
Shift+Enter |
New line in chat input |
Ctrl+V |
Paste (including images and files) |
Ctrl+K |
Quick knowledge base search (when enabled) |
Ctrl+A |
Quick agent selection menu |
F11 |
Toggle fullscreen mode |
- Knowledge Base Toggle: Click the brain icon to enable/disable RAG
- Agent Switching: Use the agent dropdown for instant agent selection
- Tool Calling: Toggle tools on/off with the wrench icon
- MCP Servers: Manage MCP server connections from settings
- File Upload: Drag & drop files anywhere or use the paperclip icon
- Screenshot: Camera icon for instant screen capture
LittleLLM includes predefined themes such as:
- Dark mode (default)
- Light mode
- High contrast
- Custom color schemes
- Local storage: All settings stored locally
- No telemetry: No usage data collected
- API keys: Stored securely in encrypted local storage
- Open source: Full source code available
App won't start
- Check if port 3000 is available
- Try running as administrator
API errors
- Verify API key is correct
- Check internet connection
- Ensure sufficient API credits
Models not loading
- Check API key permissions
- Try refreshing the model list
- Use fallback models if API is down
- Create an issue on GitHub
- Check existing issues for solutions
- Node.js 18+
- npm or yarn
- macOS: Xcode Command Line Tools (
xcode-select --install) - Windows: Visual Studio Build Tools or Visual Studio Community
- Linux: build-essential package
# Clone the repository
git clone https://github.com/NickPittas/littlellm.git
cd littlellm
# Install dependencies
npm install
# Start development server
npm run dev
# In another terminal, start Electron
npm run electron-dev# Build for production
npm run build
# Build for current platform
npm run dist# Build both installer and portable
npm run build:windows
# Build specific versions
npm run dist:win-installer # Windows installer (.exe)
npm run dist:win-portable # Windows portable (.exe)# Build all macOS formats
npm run build:mac
# Build specific formats
npm run dist:mac-dmg # macOS DMG installer
npm run dist:mac-zip # macOS ZIP archive
npm run dist:mac-universal # Universal binary (Intel + Apple Silicon)
# Create macOS icon (requires macOS)
npm run create-mac-iconnpm run dist:linux # Linux (AppImage)- After running the macOS build commands (dist:mac / dist:mac-dmg / dist:mac-zip / dist:mac-universal), DMG/ZIP artifacts are created in the
dist/folder - Targets: x64, arm64, or universal (as configured)
- First launch: you may need to allow the app in System Settings > Privacy & Security, or right-click and select "Open"
- After running the Windows build commands (dist:win-installer / dist:win-portable), NSIS installer and Portable executables are created in the
dist/folder - Portable: runs without installation; Installer: recommended for most users
- Windows Defender may flag the app initially for new unsigned builds
- After running the Linux build command (dist:linux), an AppImage is created in the
dist/folder - Make executable:
chmod +x LittleLLM-<version>.AppImage - Run the AppImage directly on most modern Linux distributions
- Knowledge Base Registry: Centralized management system for multiple knowledge bases with metadata tracking
- Advanced Document Processing: Support for Google Docs URLs, batch processing, and improved text extraction
- Migration & Backup System: Complete knowledge base export/import with version control and data integrity
- Performance Optimization: Enhanced chunking strategies, improved search algorithms, and better memory management
- Integration Testing: Comprehensive test suite for knowledge base operations and RAG functionality
- Comprehensive Agent System: Complete lifecycle management with creation, editing, duplication, and deletion
- Knowledge Base Integration: Link agents to specific knowledge bases for enhanced RAG-powered responses
- Agent Templates: Expanded template library with specialized configurations for different use cases
- Enhanced UI Components: Modern interface for agent management with drag-and-drop organization
- Import/Export Functionality: Share agents via JSON with dependency validation and version tracking
- Zero Compilation Errors: Complete TypeScript compliance across the entire codebase
- Enhanced IPC Communication: Improved Electron-Next.js integration with proper type safety
- Service Layer Refactoring: Modular service architecture with better separation of concerns
- Memory Management: Intelligent cleanup and resource optimization for better performance
- Error Handling: Comprehensive error management with detailed logging and user feedback
- Document Parser Service: Dedicated service for handling multiple file formats with improved reliability
- Progress Monitoring: Real-time progress tracking for document processing and knowledge base operations
- Batch Operations: Enhanced batch processing capabilities with parallel document handling
- Error Recovery: Robust fallback mechanisms for failed document processing
- Content Parsing: Improved text extraction with better handling of complex document structures
- Integration Testing: Comprehensive test suite for knowledge base and agent management features
- Migration Scripts: Automated migration system for upgrading knowledge base structures
- Code Quality: Enhanced TypeScript definitions, better error handling, and improved documentation
- Performance Analytics: Real-time monitoring and analytics for system performance
- Debug Capabilities: Enhanced logging and debugging tools for better troubleshooting
- Frontend: Next.js 14 with App Router, React 18, TypeScript 5.8.3
- Desktop: Electron 37.1.0 with secure IPC communication and context isolation
- Build System: Electron Builder 25.1.8 for cross-platform distribution
- Type Safety: Complete TypeScript compliance with zero compilation errors
- Component Library: Magic UI components with Framer Motion 12.23.9 animations
- Design System: Radix UI primitives with custom component extensions
- Styling: Tailwind CSS 3.4.1 with custom design tokens and themes
- Icons: Lucide React with comprehensive icon set
- Responsive Design: Adaptive layouts with drag-and-drop interfaces
- Vector Database: LanceDB 0.21.1 for high-performance semantic search
- Document Processing: Multi-format parsers (PDF, Office, text, calendar)
- Secure Storage: Electron safeStorage for encrypted API key management
- File System: JSON-based configuration with atomic writes
- Memory Management: Intelligent conversation context with automatic cleanup
- Embeddings: @xenova/transformers 2.17.2 for local text embeddings
- MCP Protocol: @modelcontextprotocol/sdk 1.15.1 for tool integration
- Multi-Provider: 13+ LLM providers with unified interface
- RAG System: Advanced retrieval-augmented generation with relevance scoring
- Agent Framework: Custom agent creation with template system
- PDF: pdf-parse 1.1.1 with pdfjs-dist 5.4.54 for comprehensive text extraction
- Office: mammoth 1.9.1 (Word), xlsx 0.18.5 (Excel), node-pptx-parser (PowerPoint)
- Text Formats: RTF parser, XML2JS, HTML parser with intelligent content extraction
- Images: Sharp 0.34.3 for optimization and vision model integration
- Calendar: ical.js 2.2.0 for ICS file processing
- Encrypted Storage: Electron safeStorage with secure key management
- Process Isolation: Sandboxed renderer with secure IPC channels
- Memory Safety: Automatic resource cleanup and leak prevention
- Error Handling: Comprehensive error management with detailed logging
- Performance Monitoring: Real-time analytics and optimization insights
MIT License - see LICENSE file for details.
- Built with Next.js and Electron
- UI components from Radix UI
- Icons from Lucide
- Styling with Tailwind CSS
LittleLLM v4.2.0 - Your modern, enterprise-grade AI companion.
With advanced knowledge base management, custom agent creation, and comprehensive tool ecosystem. π