DocuChat 🏴‍☠️

A modern document chat application that allows users to upload documents and have intelligent conversations with them using AI. Built with FastAPI backend and Next.js frontend.

🌐 Live Demo

Frontend (Vercel): https://docu-chat-mu.vercel.app/
Backend (Render): https://docuchat-xurq.onrender.com/
API Documentation: https://docuchat-xurq.onrender.com/docs

🚀 Features

Document Upload: Support for PDF files with intelligent text extraction
AI-Powered Chat: Chat with your documents using Google Gemini AI
Session Management: Persistent chat sessions with configurable TTL
Modern UI: Beautiful, responsive interface built with Next.js and Tailwind CSS
Audio Support: Speech-to-text capabilities with Sarvam AI integration
Real-time Processing: Fast document processing and embedding generation

🏗️ Architecture

DocuChat/
├── backend/          # FastAPI Python backend
│   ├── app/         # Main application code
│   │   ├── routers/ # API endpoints
│   │   ├── services/ # Business logic
│   │   └── schemas/ # Pydantic models
│   └── requirements.txt
└── frontend/         # Next.js React frontend
    ├── app/         # Next.js app directory
    ├── components/  # React components
    └── package.json

🛠️ Tech Stack

Backend

FastAPI - Modern Python web framework
Google Gemini AI - Large language model for chat
LangChain - AI application framework
PyPDF2 - PDF text extraction
Uvicorn - ASGI server

Frontend

Next.js 14 - React framework
TypeScript - Type-safe JavaScript
Tailwind CSS - Utility-first CSS
Radix UI - Accessible component primitives
React Hook Form - Form handling

🚢 Getting Started

Prerequisites

Python 3.11+ (for backend)
Node.js 18+ (for frontend)
Google Gemini API Key (for AI chat)
Sarvam API Key (optional, for audio features)

Backend Setup

Navigate to backend directory
```
cd backend
```
Create virtual environment
```
python -m venv venv
```
Activate virtual environment

Windows:
```
venv\Scripts\activate
```
macOS/Linux:
```
source venv/bin/activate
```
Install dependencies
```
pip install -r requirements.txt
```

Set up environment variables

Create a .env file in the backend directory:

SESSION_TTL_SECONDS=3600
MAX_FILE_MB=100
EMBED_DIM=1024

SARVAM_API_KEY=your_sarvam_api_key_here
GEMINI_API_KEY=your_gemini_api_key_here

Start the backend server
```
uvicorn app.main:app --reload --host 0.0.0.0 --port 8000
```
The API will be available at http://localhost:8000

Frontend Setup

Navigate to frontend directory
```
cd frontend
```
Install dependencies
```
npm install
```
Start the development server
```
npm run dev
```
The frontend will be available at http://localhost:3000

🔧 API Endpoints

Core Endpoints

POST /upload - Upload and process documents
POST /chat - Send chat messages
GET /sessions - List chat sessions
POST /sessions - Create new session
GET /sessions/{session_id} - Get session details
POST /summarize - Generate document summaries

Health Check

GET /health - API health status

🐳 Docker Support

The project includes Docker configuration for easy deployment:

# Build and run with Docker Compose
docker-compose up --build

📝 Environment Variables

Backend (.env)

Variable	Description	Default
`SESSION_TTL_SECONDS`	Session timeout in seconds	`3600`
`MAX_FILE_MB`	Maximum file upload size	`100`
`EMBED_DIM`	Embedding dimensions	`1024`
`SARVAM_API_KEY`	Sarvam AI API key	Required for audio
`GEMINI_API_KEY`	Google Gemini API key	Required

🧪 Development

Backend Development

The backend uses FastAPI with automatic API documentation
Visit http://localhost:8000/docs for interactive API docs
Code is organized with routers, services, and schemas

Frontend Development

Built with Next.js App Router
Uses TypeScript for type safety
Tailwind CSS for styling
Radix UI components for accessibility

📚 API Documentation

Once the backend is running, visit:

Swagger UI: http://localhost:8000/docs
ReDoc: http://localhost:8000/redoc

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🆘 Troubleshooting

Common Issues

Virtual environment not activating
- Ensure Python 3.11+ is installed
- Check the activation script path
API key errors
- Verify your Google Gemini API key is valid
- Check environment variable names
Port conflicts
- Backend runs on port 8000
- Frontend runs on port 3000
- Change ports if needed
File upload issues
- Check file size limits
- Ensure PDF files are not corrupted

Ahoy! Welcome aboard the DocuChat ship! 🏴‍☠️

For more help, check the API documentation or open an issue on GitHub.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
backend		backend
frontend		frontend
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DocuChat 🏴‍☠️

🌐 Live Demo

🚀 Features

🏗️ Architecture

🛠️ Tech Stack

Backend

Frontend

🚢 Getting Started

Prerequisites

Backend Setup

Frontend Setup

🔧 API Endpoints

Core Endpoints

Health Check

🐳 Docker Support

📝 Environment Variables

Backend (.env)

🧪 Development

Backend Development

Frontend Development

📚 API Documentation

🤝 Contributing

📄 License

🆘 Troubleshooting

Common Issues

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DocuChat 🏴‍☠️

🌐 Live Demo

🚀 Features

🏗️ Architecture

🛠️ Tech Stack

Backend

Frontend

🚢 Getting Started

Prerequisites

Backend Setup

Frontend Setup

🔧 API Endpoints

Core Endpoints

Health Check

🐳 Docker Support

📝 Environment Variables

Backend (.env)

🧪 Development

Backend Development

Frontend Development

📚 API Documentation

🤝 Contributing

📄 License

🆘 Troubleshooting

Common Issues

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages