KlastroKnowledge-CUDA

A CUDA extension for Mahalanobis distance-based top-K matching, outperforming cosine similarity on complex multi-attribute queries, RAG retrieval pipelines, and GenAI applications

First Public Release: 2026-03-16
Last Updated: 2026-03-17

Motivation

This library proposes Mahalanobis distance as a principled alternative to cosine similarity for embedding-based matching. Interestingly, fields such as medicine, economics, and social statistics abandoned cosine similarity a long time ago — yet engineering disciplines have continued to rely on it as a default. This peculiar habit in engineering is precisely what motivated us to formalize this approach into a patent and, after validating strong performance in GenAI applications, to release it as open source.

Mahalanobis distance is named after the Indian statistician Prasanta Chandra Mahalanobis.
For more details, see Wikipedia (https://en.wikipedia.org/wiki/Prasanta_Chandra_Mahalanobis)

Also, regarding the Mahalanobis distance, there is a highly recommended lecture on this topic is also available on YouTube (https://youtu.be/rBv39pK1iEs?si=JhVeCs2nUTK-8Gdg&t=1346) by professor Gary King of Harvard University.

Quick Start

1. Conda Environment

cd anaconda  
conda env create -f environment.yml  
conda activate <env_name>

2. Build from Source

bash make_package.sh cuda12   # or cuda11, cuda13

3. Install

# After build, go to either of the generated folders, or cd klastroknowledge_cuda12_release  
cd deploy_exercise   
pip install .

Benchmarks

Navigate to the benchmarks folder and run the provided Jupyter notebooks step by step.

CLIP Embeddings (COCO val2017)

Margin(1-2): 18x larger than cosine similarity
Softmax Entropy: 3x lower than cosine similarity
Mahalanobis suppresses off-target images that cosine retains near the top, because cosine score gaps are too small to discriminate effectively.

Tabular Data (UCI Adult Dataset, k=25)

CKA improved from 0.910373 → 0.928622 (+0.018249 absolute, ~2.00% relative)
Mean covariance trace reduced from 59.435885 → 56.828646 (4.39% average reduction)
Consistent advantage observed across k=15 and k=25

Context Engineering Demo

Through context engineering, KlastroKnowledge was used to identify the optimal match without retraining the AI.

License

AGPL v3 — free for research and non-commercial use.
Commercial use requires a separate agreement.

This code is released to encourage collaboration across AI systems — not competition.
The goal is shared solutions, not shared resources.

For commercial licensing: leave a message on Discussions

Patent Information

Korean Patent No: 10-2937626
DOI: https://doi.org/10.8080/1020250102273

(registered March 6, 2026), titled "Method and Device for Determining the Reliability of AI Model Outputs."

Core claims covered by this patent:

Mapping AI model outputs to embedding space to obtain embedding vectors
Computing a covariance matrix from a reference vector pool
Calculating Mahalanobis distance between reference vectors and embedding vectors based on the covariance matrix
Selecting top-k reference vectors sorted by Mahalanobis distance
Assigning ranks to selected k vectors in ascending order
Computing softmax entropy and probability margin (rank-1 vs rank-2) from the resulting probability distribution
Determining output reliability based on entropy and probability margin
Applicable to text, image, and multimodal domains (CLIP-based embeddings supported)

Any commercial implementation of the above pipeline without a license agreement constitutes patent infringement.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
anaconda		anaconda
benchmarks		benchmarks
tools		tools
LICENSE		LICENSE
README.md		README.md
build.py		build.py
klastroknowledge.cu		klastroknowledge.cu
make_package.sh		make_package.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KlastroKnowledge-CUDA

Motivation

Quick Start

1. Conda Environment

2. Build from Source

3. Install

Benchmarks

CLIP Embeddings (COCO val2017)

Tabular Data (UCI Adult Dataset, k=25)

Context Engineering Demo

License

Patent Information

Copyright

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

KlastroKnowledge-CUDA

Motivation

Quick Start

1. Conda Environment

2. Build from Source

3. Install

Benchmarks

CLIP Embeddings (COCO val2017)

Tabular Data (UCI Adult Dataset, k=25)

Context Engineering Demo

License

Patent Information

Copyright

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages