A list of topics I want to learn over winter 2022/2023

General Goal

I wanted to learn rust and machine learning..so I thought why not do ML in rust. The notebooks, while in python, have rust bindings to the core ML/DL algorithms.

I have also written my own cuda matrix library, which I have painstakingly optimized to use batched operations, chunked memory management, asynchronous memory transfers, mempools/async mem management and more.
Why not use cublas you ask?

It was a good learning exercise, I don't pretend it is faster than existing frameworks
CUBLAS is there, just commented out (I used it for performance comparison and I am usually within a factor of 2-3x)

NO ML CODE WAS WRITTEN IN PYTHON!

Python is just used for preprocessing and loading of data, visualization and verification of results.

That means all the ml algorithms are written using good old if/else, for loops etc. No significant help from libraries.

The following are the only libraries used for actual ML logic:

Rust's itertools for iterating over data more easily
Rust's statsrs and Python's numpy for random number generation
Rust's Image lib to decode thousands of images of varying formats into a raw float array efficiently

Structure

PYO3 is used to create bindings from python to the RUST binaries. The python code is located under notebooks. The rust core ML functions are located under src. Any data is located in the data folder.

Goals

Algorithm	Status
K-Means	✅
K-Nearest Neighbors	✅
Naive Bayes	✅
Decision Trees/Random Forest	✅
Regression Tree	✅
Gradient Descent	✅
ADA Boost	✅
Gradient Boost	✅
XGBoost	✅
Neural Network w/t backpropogation	✅
Convolutional Neural Networks	✅
Recurrent Neural Networks	❌
Generative Adversarial Networks	❌
Large Language Models	❌
CUDA Acceleration w/t Rust FFI	✅

Setup Instructions

Install python3
Install Rust
Install nvidia cuda toolkit
Create .venv folder python -m venv .venv
Activate python virtual environment (platform dependent)
Windows: .\.venv\Scripts\activate.bat
Mac/Linux: source ./.venv/bin/activate
Install dependencies from requirements.txt
pip install -r requirements_{platform}.txt
Compile Rust Code using maturin develop or maturin develop --release
Open notebooks in jupyter notebook/jupyter lab/vscode etc..

Running Tests

Make sure to run tests using cargo test -- --test-threads=1. Running the tests in parallel may fail because the cuda matrix library is not thread safe (yet).

Credits

MNIST Handwritten digit database available from https://yann.lecun.com/exdb/mnist/
All Other datasets are publically available from University of California Irvine here: https://archive.ics.uci.edu/ml/index.php

Name		Name	Last commit message	Last commit date
Latest commit History 280 Commits
.github/workflows		.github/workflows
.ipynb_checkpoints		.ipynb_checkpoints
benches		benches
data		data
notebook_executables		notebook_executables
notebooks		notebooks
notes		notes
src		src
tensor_lib		tensor_lib
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
pyproject.toml		pyproject.toml
requirements_linux.txt		requirements_linux.txt
requirements_windows.txt		requirements_windows.txt
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A list of topics I want to learn over winter 2022/2023

General Goal

NO ML CODE WAS WRITTEN IN PYTHON!

Python is just used for preprocessing and loading of data, visualization and verification of results.

Structure

Goals

Setup Instructions

Running Tests

Credits

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

A list of topics I want to learn over winter 2022/2023

General Goal

NO ML CODE WAS WRITTEN IN PYTHON!

Python is just used for preprocessing and loading of data, visualization and verification of results.

Structure

Goals

Setup Instructions

Running Tests

Credits

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages