I have an application that runs on an embedded device, for performance reasons
I would like to use 32bit float instead of double.
I also have to use several other libraries that only accept 32bit float.
Iterating over a matrix data array and casting each 64bit float to 32bit float
is hurting performance.
Is it possible to create an 32bit float matrix & operations?