Commit graph

16 commits

Author SHA1 Message Date
Seunghoon Lee 86cdab3b14
Follow-up changes of #27 (#32)
* Update README.

* Drop dead codes.

* Use addr_of.

* Update Dockerfiles.

* Disable warnings for nvrtc.

* nvrtc.
2024-07-14 16:37:39 +09:00
Seunghoon Lee d7714d84c0
Add support of ROCm 6. (#27)
* Add support of ROCm 6.1.2 for Windows.

* Fix CI.

* Use llvm.sqrt.f64.
2024-07-13 13:47:35 +09:00
Andrzej Janik 2d8c47f147
Support Meshroom (#153) 2024-05-17 00:35:38 +02:00
Andrzej Janik bdc652f9eb
Correctly report emulated wave32 CUDA device (#216) 2024-04-29 15:09:14 +02:00
Andrzej Janik 5d5f7cca75
Rewrite surface implementation to more accurately support unofficial CUDA semantics (#203)
This fixes black screen in some CompuBench tests (TV-L1 Optical Flow) and other apps that use CUDA surfaces incorrectly
2024-04-14 02:39:34 +02:00
Andrzej Janik 7d4147c8b2
Add Blender 4.2 support (#184)
Redo primary context and fix various long-standing bugs around this API
2024-03-28 17:12:10 +01:00
Andrzej Janik 4a81dbffb5
Update llama.cpp support (#102)
Add sign extension support to prmt, allow set.<op>.f16x2.f16x2, add more BLAS mappings
2024-02-16 00:01:21 +01:00
Andrzej Janik 1b9ba2b233 Nobody expects the Red Team
Too many changes to list, but broadly:
* Remove Intel GPU support from the compiler
* Add AMD GPU support to the compiler
* Remove Intel GPU host code
* Add AMD GPU host code
* More device instructions. From 40 to 68
* More host functions. From 48 to 184
* Add proof of concept implementation of OptiX framework
* Add minimal support of cuDNN, cuBLAS, cuSPARSE, cuFFT, NCCL, NVML
* Improve ZLUDA launcher for Windows
2024-02-11 20:45:51 +01:00
Andrzej Janik a906c350f2
Make misc fixes (#41)
* Update ze_loader.lib to the newest version
* Export _ptsz/_ptds for which we have a legacy stream implementations
* Stop producing build logs if we are not looking at them anyway
2021-02-22 01:29:03 +01:00
Andrzej Janik 3e2e73ac33 Add script for replaying dumped kernel (#34)
zluda_dump can already create traces of GPU execution, this script can replay those traces.
Additionally, changed added just enough code in core ZLUDA to support simple PyCUDAexecution
2021-01-23 16:57:07 +01:00
Andrzej Janik ff8135e8a3
Add a library for dumping kernels arguments before and after launch (#18) 2021-01-16 22:28:48 +01:00
Andrzej Janik 09f679693b
Prevent linker from stripping exports on Linux (#33) 2021-01-15 01:17:44 +01:00
Andrzej Janik 5cd9a5fbc4
Add empty implementation of cuDeviceGetLuid (#30)
This function is required by recent versions of CUDA runtime on Windows
2021-01-08 19:43:46 +01:00
Andrzej Janik 078ae20c2c
Improve build procedure and instructions (#28)
Fixes issues pointed out in #27:
* spirv_tools-sys was build in non-test profiles
* By default ZLUDA dll has a wrong name
* We relied on third-party OpenCL installation on Windows
* We encouraged building debug configuration
* We didn't provide build information for developers (cmake, python, submodules)
2021-01-08 17:17:46 +01:00
Andrzej Janik 690f4f3ad2 Append short project name to the device if there's not enough space for long name 2020-11-23 22:24:35 +01:00
Andrzej Janik eb7c9aeeee Rename everything 2020-11-23 20:01:10 +01:00