AI Knowledge Inbox

A small web app that lets a user save notes or URLs and ask questions over that content using a simple RAG (Retrieval Augmented Generation) setup.

currnetly: Single user, no auth, runs locally.

What it does

Save plain text notes
Save URLs (content is fetched server-side)
Chunk and embed saved content
Ask questions over saved data
Get an answer along with source snippets

Tech stack

Backend

FastAPI
SQLite
sentence-transformers (local embeddings)
requests + BeautifulSoup (URL content extraction)

Frontend

Vite
React
TypeScript
Tailwind CSS

Backend setup

Requirements

Python 3.11+

Install & run

cd back
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt
uvicorn app.main:app --reload

Backend runs at:

Frontend setup

Requirements

Node.js 16+
npm or yarn

Install & run

cd front
npm install
npm run dev

Frontend runs at:

http://localhost:5173

Frontend Features

Modern UI/UX: Clean, responsive design with Tailwind CSS
Mode-Independent: Consistent appearance regardless of system theme
Single Page Application: All functionality on one page with smooth navigation
Real-time Updates: Instant feedback when adding content or querying
Component-Based: Modular React components for maintainability

Frontend Tech Stack

React 18 with TypeScript for type safety
Vite for fast development and building
Tailwind CSS for utility-first styling
React Icons for consistent iconography
Axios for API communication

Complete Setup (Both Frontend & Backend)

1. Start Backend

cd back
python -m venv venv
source venv/bin/activate  # Windows: venv\Scripts\activate
pip install -r requirements.txt
uvicorn app.main:app --reload

2. Start Frontend (in separate terminal)

cd front
npm install
npm run dev

3. Access the Application

Frontend: http://localhost:5173
Backend API: http://127.0.0.1:8000
API Docs: http://127.0.0.1:8000/docs

API overview

Ingest content

POST /ingest

Note:

{
  "type": "note",
  "content": "FastAPI is a modern Python web framework"
}

URL:

{
  "type": "url",
  "source_url": "https://fastapi.tiangolo.com/"
}

List items

GET /items

Query knowledge

POST /query

{
  "question": "What is FastAPI?",
  "top_k": 3
}

Returns:

answer text
source chunks with similarity scores

How retrieval works (brief)

Content is split into chunks
Each chunk is embedded using a local model
Embeddings are stored in SQLite as JSON
Queries are embedded the same way
Cosine similarity is used to find relevant chunks
Retrieved chunks are passed as context to the LLM

Embeddings

Uses a local sentence-transformers model: all-MiniLM-L6-v2
No external API calls by default
Model is downloaded once and cached locally

Model reference:

LLM usage

Using a real LLM (Google Gemini)

The project supports using a real LLM for answer generation via Google Gemini (free tier).

By default, the app runs with a stubbed LLM so it works without any external API keys. To enable Gemini locally:

Create a Gemini API key:
- https://ai.google.dev/
- Go to Google AI Studio → Get API key
Create a .env file in the back/ directory:

USE_GEMINI=true
GEMINI_API_KEY=your_api_key_here

URL ingestion

URLs are fetched server-side using requests and BeautifulSoup
HTML content is extracted, cleaned, and processed
Scripts and styles are removed during text extraction
Extracted text follows the same chunk → embed → store pipeline

Technical considerations

SQLite chosen for simplicity and zero setup requirements
Raw SQL used instead of ORM for explicit behavior control
Similarity search uses full scan (suitable for current data scale)

For larger scale:

ANN indexes or a vector database would be needed
Background ingestion and better concurrency handling

References

FastAPI: https://fastapi.tiangolo.com/
SQLite: https://sqlite.org/
Python sqlite3: https://docs.python.org/3/library/sqlite3.html
Sentence-Transformers: https://www.sbert.net/
RAG overview: https://www.pinecone.io/learn/retrieval-augmented-generation/

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
back		back
front		front
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Knowledge Inbox

What it does

Tech stack

Backend

Frontend

Backend setup

Requirements

Install & run

Frontend setup

Requirements

Install & run

Frontend Features

Frontend Tech Stack

Complete Setup (Both Frontend & Backend)

1. Start Backend

2. Start Frontend (in separate terminal)

3. Access the Application

API overview

Ingest content

List items

Query knowledge

How retrieval works (brief)

Embeddings

LLM usage

Using a real LLM (Google Gemini)

URL ingestion

Technical considerations

References

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Knowledge Inbox

What it does

Tech stack

Backend

Frontend

Backend setup

Requirements

Install & run

Frontend setup

Requirements

Install & run

Frontend Features

Frontend Tech Stack

Complete Setup (Both Frontend & Backend)

1. Start Backend

2. Start Frontend (in separate terminal)

3. Access the Application

API overview

Ingest content

List items

Query knowledge

How retrieval works (brief)

Embeddings

LLM usage

Using a real LLM (Google Gemini)

URL ingestion

Technical considerations

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages