Skip to content

Kugman/CudaMatrixEngine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

About

High performance GPU engine for matrix multiplication, a core operation in AI/ML. Implemented in c++ / CUDA, optimized memory access and computation for maximum throughput, achieving significant speedups over CPU. Demonstrates expertise in GPU programming, parallel computing, and high performance computing.

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages