Add support for cuda
Is this possible to split the matrix multiplication algorithms to use processor and gpgpu all at the same time?
It would be an amazing feature to be added.
Nathan Allan commented
Better to use an open standard like OpenCL rather than nVidia's proprietary API.