Commit graph

9 commits

Author SHA1 Message Date
Seunghoon Lee d7714d84c0
Add support of ROCm 6. (#27)
* Add support of ROCm 6.1.2 for Windows.

* Fix CI.

* Use llvm.sqrt.f64.
2024-07-13 13:47:35 +09:00
Seunghoon Lee 7c3891e6b3
Fix cusparseDnMatGet. 2024-03-21 22:07:36 +09:00
Seunghoon Lee 87d2e3e163
Add match order fallback. 2024-03-20 14:59:10 +09:00
Seunghoon Lee 2b52d0a040
Fix typo. 2024-03-20 14:38:49 +09:00
Seunghoon Lee 6b2488395d
Implement cusparseCreateDnMat, cusparseDestroyDnMat, cusparseDnMat*. 2024-03-20 14:12:00 +09:00
Seunghoon Lee 4fab2af0f4
Implement cusparseXcoo2csr. 2024-03-20 10:30:31 +09:00
Seunghoon Lee f52edbd132
Implement cusparseXcoo2csr. 2024-03-20 10:29:09 +09:00
Seunghoon Lee 1ef7ef3938
Add support of cuBLAS, cuSPARSE for Windows. 2024-02-15 06:54:21 +09:00
Andrzej Janik 1b9ba2b233 Nobody expects the Red Team
Too many changes to list, but broadly:
* Remove Intel GPU support from the compiler
* Add AMD GPU support to the compiler
* Remove Intel GPU host code
* Add AMD GPU host code
* More device instructions. From 40 to 68
* More host functions. From 48 to 184
* Add proof of concept implementation of OptiX framework
* Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
* Improve ZLUDA launcher for Windows
2024-02-11 20:45:51 +01:00