RAG-Based Data Retrieval with Ollama

This project demonstrates a Retrieval-Augmented Generation (RAG) system using the Ollama model for document-based question answering. The goal of the project is to integrate documents, retrieve relevant information, and generate concise answers using the Ollama Gemma 2B model.

Features

Loads and processes news articles from a specified directory (news_articles).
Splits documents into manageable chunks for efficient retrieval.
Uses ChromaDB for vector storage and retrieval of document embeddings.
Generates embeddings for documents using the Ollama Gemma 2B model.
Queries the document database and generates answers using Ollama's conversational model.

Setup

Clone the repository:

git clone https://github.com/your-username/RAG-Based-Data-Retrieval.git
cd RAG-Based-Data-Retrieval

Create a virtual environment:
```
python -m venv venv
```
Activate the virtual environment:
- On Windows:
```
venv\Scripts\activate
```
- On macOS/Linux:
```
source venv/bin/activate
```
Install required packages:
```
pip install -r requirements.txt
```
Set up environment variables by creating a .env file in the root directory:
```
CHROMA_PERSISTENT_STORAGE_PATH="chroma_persistent_storage"
```
Make sure you have access to Ollama for the embeddings and model queries.

Usage

Prepare a folder news_articles/ containing .txt files of articles you wish to process.
Run the application:
```
python app.py
```
Query the system by providing a question. The system will fetch relevant document chunks, generate embeddings, and generate a response based on the retrieved context.

Code Explanation

Loading Documents: The load_documents_from_directory function loads all .txt files from a specified directory and stores their content as documents.
Text Chunking: The split_text function splits long documents into smaller, manageable chunks to allow for more accurate retrieval.
Embedding Generation: The get_ollama_embedding function generates embeddings for document chunks using the Gemma 2B model from Ollama.
Storage in ChromaDB: The embeddings are stored in ChromaDB for fast retrieval during queries.
Query Processing: The query_documents function generates an embedding for the user’s query and retrieves the most relevant document chunks. The system uses the retrieved chunks to generate an answer.
Answer Generation: The generate_response function combines the relevant context and generates concise answers using Ollama's conversational model.

Example Query

You can use this example to query the system:

question = "What is human life expectancy in the US and Bangladesh?"
relevant_chunks = query_documents(question)
answer = generate_response(question, relevant_chunks)
print(answer)

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
chroma_persistent_storage		chroma_persistent_storage
news_articles		news_articles
.gitattributes		.gitattributes
README.md		README.md
app.py		app.py
hf.py		hf.py
llm.ipynb		llm.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG-Based Data Retrieval with Ollama

Features

Setup

Usage

Code Explanation

Example Query

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAG-Based Data Retrieval with Ollama

Features

Setup

Usage

Code Explanation

Example Query

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages