Commit graph

4 commits

Author SHA1 Message Date
Seunghoon Lee d7714d84c0
Add support of ROCm 6. (#27)
* Add support of ROCm 6.1.2 for Windows.

* Fix CI.

* Use llvm.sqrt.f64.
2024-07-13 13:47:35 +09:00
Seunghoon Lee 2ad9ad6851
[Fix] Clean up Runtime API. 2024-05-21 10:49:19 +09:00
Seunghoon Lee 11cc584451
Implement cuda runtime api. (cudart) (#17)
* [WIP] Implement cudart.

* wip

* wip

* Implement cudart.

* wip

* Ready to merge.
2024-05-17 13:15:16 +09:00
Andrzej Janik 1b9ba2b233 Nobody expects the Red Team
Too many changes to list, but broadly:
* Remove Intel GPU support from the compiler
* Add AMD GPU support to the compiler
* Remove Intel GPU host code
* Add AMD GPU host code
* More device instructions. From 40 to 68
* More host functions. From 48 to 184
* Add proof of concept implementation of OptiX framework
* Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
* Improve ZLUDA launcher for Windows
2024-02-11 20:45:51 +01:00