squiid/llvm

Author	SHA1	Message	Date
Aart Bik	b9f87e24f2	[mlir] add missing include, fix broken build Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108873	2021-08-28 09:36:38 -07:00
Uday Bondhugula	4edc9e2acf	[MLIR][GPU] Drop mgpuMemHostRegisterMemRef's dependence on LLVM Support Drop mgpuMemHostRegisterMemRef's dependence on LLVM Support. This method is the only one in CUDA runtime wrappers library that creates a dependence on libLLVMSupport due to its use of SmallVector and ArrayRef. The code can be as easily/compactly written without those ADT. The dependence on LLVMSupport adds a significant amount of additional complexity for external things that want to link this library in (both statically or as a shared object) since libLLVMSupport includes numerous other objects that are sensitive to C++ compiler version and ABI. Differential Revision: https://reviews.llvm.org/D108684	2021-08-28 11:37:55 +05:30
Aart Bik	6b26857dbf	[mlir][sparse] add asCOO() functionality to sparse tensor object This prepares general sparse to sparse conversions. The code that needs to be generated using this new feature is now simply: (1) coo = sparse_tensor_1->asCOO(); // source format1 (2) sparse_tensor_2 = newSparseTensor(coo); // destination format2 By using COO as an intermediate, we can do all conversions without having to implement the full O(N^2) conversion matrix. Note that we can always improve particular conversions individually if a faster solution is required. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108681	2021-08-25 21:50:39 -07:00
wren romano	90e0c657b7	[mlir][sparse] Correcting the use of emplace_back The emplace commands are variadic and should take all the constructor arguments directly, since they implicitly call the constructor themselves in order to avoid the cost of constructing and then moving/copying temporaries. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108670	2021-08-24 18:32:13 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
Aart Bik	05c7f450df	[mlir][sparse] add dense to sparse conversion implementation Implements lowering dense to sparse conversion, for static tensor types only. First step towards general sparse_tensor.convert support. Reviewed By: ThomasRaoux Differential Revision: https://reviews.llvm.org/D107681	2021-08-09 12:12:39 -07:00
Dimitry Andric	ab4b4684a2	[mlir] Avoid including <alloca.h> on FreeBSD and NetBSD Instead, include `<cstdlib>` which is the canonical header containing the declaration of `alloca()`. Reviewed By: bondhugula Differential Revision: https://reviews.llvm.org/D107699	2021-08-08 13:32:35 +02:00
Adrian Kuegel	76fd3d4410	[mlir][CPURunner] Avoid a crash in memrefCopy when called with empty shapes. Differential Revision: https://reviews.llvm.org/D107346	2021-08-03 16:02:01 +02:00
Aart Bik	52c87e0437	[mlir][sparse] use consistent type for COO object and sparse tensor storage There was a slightly mismatch between the double COO and actual numerical type in the final sparse tensor storage (due to external formats always using double). This minor revision removes that inconsistency by using a properly typed COO and casting during the "add" method instead. This also prepares alternative ways of initializing the COO object. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D107310	2021-08-02 15:24:43 -07:00
Aart Bik	1d77bb9e1b	[mlir][sparse] template the memory resident coordinate scheme storage Rationale: External file formats always store the values as doubles, so this was hard coded in the memory resident COO scheme used to pass data into the final sparse storage scheme during setup. However, with alternative methods on the horizon of setting up these temporary COO schemes, it is time to properly template this data structure. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D107001	2021-07-30 11:21:05 -07:00
Nikita Popov	ffe94738ed	[ExecutionEngine] Fix GEP type Fix bug introduced in `2c68ecccc9`, the GEP type was off-by-ptr. Apparently I didn't run the MLIR tests.	2021-07-17 23:45:00 +02:00
Nikita Popov	2c68ecccc9	[OpaquePtr] Remove uses of CreateGEP() without element type Remove uses of to-be-deprecated API. In cases where the correct element type was not immediately obvious to me, fall back to explicit getPointerElementType().	2021-07-17 22:56:27 +02:00
Aart Bik	afc760ef35	[mlir][sparse] add int64 storage type to sparse tensor runtime support library This format was missing from the support library. Although there are some subtleties reading in an external format for int64 as double, there is no good reason to omit support for this data type form the support library. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D106016	2021-07-15 12:14:31 -07:00
Aart Bik	04bddb6cc7	[mlir][crunner] fix bug in memref copy for rank 0 While replacing linalg.copy with the more desired memref.copy I found a bug in the support library for rank 0 memref copying. The code would loop for something like the following, since there is code for no-rank and rank > 0, but rank == 0 was unexpected. memref.copy %0, %1: memref<f32> to memref<f32> Note that a "regression test" for this will follow using the sparse compiler migration to memref.copy which exercises this case many times. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D106036	2021-07-15 01:15:07 -07:00
Stephan Herhut	88d5eba139	Revert "Revert "[mlir][memref] Implement lowering of memref.copy to llvm"" This reverts commit `7d6e589fc8`. Windows build was unbroken.	2021-06-28 18:48:00 +02:00
Stephan Herhut	e6450d88e2	[mlir][llvm] Fix windows build Gate the include of alloca.h behind _WIN32 guard. Differential Revision: https://reviews.llvm.org/D105036	2021-06-28 18:22:21 +02:00
Jacques Pienaar	7d6e589fc8	Revert "[mlir][memref] Implement lowering of memref.copy to llvm" This reverts commit `e939644977`. Breaks Windows build.	2021-06-28 07:50:11 -07:00
Stephan Herhut	e939644977	[mlir][memref] Implement lowering of memref.copy to llvm This lowering uses a library call to implement copying in the general case, i.e., supporting arbitrary rank and strided layouts.	2021-06-28 14:52:07 +02:00
Eugene Zhulenev	d43b23608a	[mlir:Async] Add the size parameter to the async.group Specify the `!async.group` size (the number of tokens that will be added to it) at construction time. `async.await_all` operation can potentially race with `async.execute` operations that keep updating the group, for this reason it is required to know upfront how many tokens will be added to the group. Reviewed By: ftynse, herhut Differential Revision: https://reviews.llvm.org/D104780	2021-06-25 10:26:50 -07:00
Aart Bik	12db09d7f3	[mlir][sparse] add more type combinations to sparse storage scheme Useful for "exhaustively" testing and benchmarking annotation combinations to verify correctness and perform state space search for best performing. Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D103566	2021-06-03 08:34:10 -07:00
Eugene Zhulenev	d8c84d2a4e	[mlir] Async: Add error propagation support to async groups Depends On D103109 If any of the tokens/values added to the `!async.group` switches to the error state, than the group itself switches to the error state. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103203	2021-05-27 09:35:11 -07:00
Eugene Zhulenev	39957aa424	[mlir] Add error state and error propagation to async runtime values Depends On D103102 Not yet implemented: 1. Error handling after synchronous await 2. Error handling for async groups Will be addressed in the followup PRs Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D103109	2021-05-27 09:28:47 -07:00
Aart Bik	ca446e58c8	[sparse][mlir] simplify sparse runtime support library Removed some of the older raw "MLIRized" versions that are no longer needed now that the sparse runtime support library can focus on the proper sparse tensor types rather than the opague pointer approach of the past. This avoids legacy... Reviewed By: penpornk Differential Revision: https://reviews.llvm.org/D102960	2021-05-25 09:39:14 -07:00
Uday Bondhugula	9c21ddb70a	[MLIR] Make MLIR cmake variable names consistent Fix inconsistent MLIR CMake variable names. Consistently name them as MLIR_ENABLE_<feature>. Eg: MLIR_CUDA_RUNNER_ENABLED -> MLIR_ENABLE_CUDA_RUNNER MLIR follows (or has mostly followed) the convention of naming cmake enabling variables in the from MLIR_ENABLE_... etc. Using a convention here is easy and also important for convenience. A counter pattern was started with variables named MLIR_..._ENABLED. This led to a sequence of related counter patterns: MLIR_CUDA_RUNNER_ENABLED, MLIR_ROCM_RUNNER_ENABLED, etc.. From a naming standpoint, the imperative form is more meaningful. Additional discussion at: https://llvm.discourse.group/t/mlir-cmake-enable-variable-naming-convention/3520 Switch all inconsistent ones to the ENABLE form. Keep the couple of old mappings needed until buildbot config is migrated. Differential Revision: https://reviews.llvm.org/D102976	2021-05-24 08:43:10 +05:30
Aart Bik	c194b49c9c	[mlir][sparse] add full dimension ordering support This revision completes the "dimension ordering" feature of sparse tensor types that enables the programmer to define a preferred order on dimension access (other than the default left-to-right order). This enables e.g. selection of column-major over row-major storage for sparse matrices, but generalized to any rank, as in: dimOrdering = affine_map<(i,j,k,l,m,n,o,p) -> (p,o,j,k,i,l,m,n)> Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102856	2021-05-21 12:35:13 -07:00
Aart Bik	64ab997ff4	[mlir][sparse] remove accidental debug code Differential Revision: https://reviews.llvm.org/D102545	2021-05-14 19:28:25 -07:00
Aart Bik	56fd4c1cf8	[mlir][sparse] prepare runtime support lib for multiple dim level types We are moving from just dense/compressed to more general dim level types, so we need more than just an "i1" array for annotations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102520	2021-05-14 19:12:07 -07:00
Aart Bik	96a23911f6	[mlir][sparse] complete migration to sparse tensor type A very elaborate, but also very fun revision because all puzzle pieces are finally "falling in place". 1. replaces lingalg annotations + flags with proper sparse tensor types 2. add rigorous verification on sparse tensor type and sparse primitives 3. removes glue and clutter on opaque pointers in favor of sparse tensor types 4. migrates all tests to use sparse tensor types NOTE: next CL will remove all obsoleted sparse code in Linalg Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D102095	2021-05-10 12:55:22 -07:00
Aart Bik	3acf49829c	[mlir][sparse] support integral types i32,i16,i8 for numerical values Some sparse matrices operate on integral values (in contrast with the common f32 and f64 values). This CL expands the compiler and runtime support to deal with several common type combinations. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99999	2021-04-07 10:01:37 -07:00
Aart Bik	a0c5b7e3b5	[mlir][sparse] support for very narrow index and pointer types Rationale: Small indices and values, when allowed by the required range of the input tensors, can reduce the memory footprint of sparse tensors even more. Note, however, that we must be careful zero extending the values (since sparse tensors never use negatives for indexing), but LLVM treats the index type as signed in most memory operations (like the scatter and gather). This CL dots all the i's in this regard. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D99777	2021-04-01 18:21:27 -07:00
Christian Sigg	a825fb2c07	[mlir] Remove mlir-rocm-runner This change combines for ROCm what was done for CUDA in D97463, D98203, D98360, and D98396. I did not try to compile SerializeToHsaco.cpp or test mlir/test/Integration/GPU/ROCM because I don't have an AMD card. I fixed the things that had obvious bit-rot though. Reviewed By: whchung Differential Revision: https://reviews.llvm.org/D98447	2021-03-19 00:24:10 -07:00
Nikita Popov	f3f0c6cd47	[mlir] Remove uses of type-less CreateLoad() APIs (NFC) For the use in LLVMOps.td I used the getPointerElementType() escape hatch, as it's not obvious to me how the load type should be properly obtained here.	2021-03-11 18:39:20 +01:00
Nicolas Vasilache	4f4f3f1e59	[mlir] NFC - Add runner util functions to only print MemRef metadata. These are useful to debug execution, without having to print the whole content of a memref.	2021-03-04 12:35:45 +00:00
Christian Sigg	f69d5a7fc7	[mlir] Initialize CUDA context lazily. So we can remove the ignore-warning pragma again. Reviewed By: herhut Differential Revision: https://reviews.llvm.org/D97864	2021-03-04 13:07:56 +01:00
Alex Zinenko	19db802e7b	[mlir] make implementations of translation to LLVM IR interfaces private There is no need for the interface implementations to be exposed, opaque registration functions are sufficient for all users, similarly to passes. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97852	2021-03-04 09:16:32 +01:00
Christian Sigg	b6ac26fce5	[mlir] Silence -Wglobal-constructors error in CudaRuntimeWrapper.cpp Until I have a better solution with dynamic initialization, to get the nvidia build bot green again.	2021-03-03 13:48:03 +01:00
Christian Sigg	9d7be77bf9	[mlir] Move cuda tests Move test inputs to test/Integration directory. Move runtime wrappers to ExecutionEngine. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97463	2021-03-03 13:16:51 +01:00
Kern Handa	3c4cdd0b6a	[mlir] ExecutionEngine needs special handling for COFF binaries Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D97141	2021-02-23 17:34:19 -08:00
Aart Bik	2556d62282	[mlir][sparse] assert fail on mismatch between rank and annotations array Rationale: Providing the wrong number of sparse/dense annotations was silently ignored or caused unrelated crashes. This minor change verifies that the provided number matches the rank. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D97034	2021-02-18 23:22:14 -08:00
Christian Sigg	c86c96a710	[mlir] Load dynamic libraries in JitRunner from absolute paths so that GDB can find the symbol tables. Reviewed By: mehdi_amini, ftynse Differential Revision: https://reviews.llvm.org/D96759	2021-02-19 07:33:35 +01:00
Aart Bik	ff6c84b803	[mlir][sparse] generalize sparse storage format to many more types Rationale: Narrower types for overhead storage yield a smaller memory footprint for sparse tensors and thus needs to be supported. Also, more value types need to be supported to deal with all kinds of kernels. Since the "one-size-fits-all" sparse storage scheme implementation is used instead of actual codegen, the library needs to be able to support all combinations of desired types. With some crafty templating and overloading, the actual code for this is kept reasonably sized though. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D96819	2021-02-17 18:20:23 -08:00
Alex Zinenko	ce8f10d6cb	[mlir] Simplify ModuleTranslation for LLVM IR A series of preceding patches changed the mechanism for translating MLIR to LLVM IR to use dialect interface with delayed registration. It is no longer necessary for specific dialects to derive from ModuleTranslation. Remove all virtual methods from ModuleTranslation and factor out the entry point to be a free function. Also perform some cleanups in ModuleTranslation internals. Depends On D96774 Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96775	2021-02-16 18:42:52 +01:00
Tobias Gysi	99f3510b41	Reland "[mlir] add support for verification in integration tests" The patch extends the runner utils by verification methods that compare two memrefs. The methods compare the content of the two memrefs and print success if the data is identical up to a small numerical error. The methods are meant to simplify the development of integration tests that compare the results against a reference implementation (cf. the updates to the linalg matmul integration tests). Originally landed in `5fa893c` (https://reviews.llvm.org/D96326) and reverted in `dd719fd` due to a Windows build failure. Changes: - Remove the max function that requires the "algorithm" header on Windows - Eliminate the truncation warning in the float specialization of verifyElem by using a float constant Reviewed By: Kayjukh Differential Revision: https://reviews.llvm.org/D96593	2021-02-14 20:30:05 +01:00
Alex Zinenko	9a08f760fe	[mlir] Make JitRunnerMain main take a DialectRegistry Historically, JitRunner has been registering all available dialects with the context and depending on them without the real need. Make it take a registry that contains only the dialects that are expected in the input and stop linking in all dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96436	2021-02-11 14:50:48 +01:00
Aart Bik	0b1764a3d7	[mlir][sparse] sparse tensor storage implementation This revision connects the generated sparse code with an actual sparse storage scheme, which can be initialized from a test file. Lacking a first-class citizen SparseTensor type (with buffer), the storage is hidden behind an opaque pointer with some "glue" to bring the pointer back to tensor land. Rather than generating sparse setup code for each different annotated tensor (viz. the "pack" methods in TACO), a single "one-size-fits-all" implementation has been added to the runtime support library. Many details and abstractions need to be refined in the future, but this revision allows full end-to-end integration testing and performance benchmarking (with on one end, an annotated Lingalg op and, on the other end, a JIT/AOT executable). Reviewed By: nicolasvasilache, bixia Differential Revision: https://reviews.llvm.org/D95847	2021-02-10 11:57:24 -08:00
Alex Zinenko	2996a8d675	[mlir] avoid exposing mutable DialectRegistry from MLIRContext MLIRContext allows its users to access directly to the DialectRegistry it contains. While sometimes useful for registering additional dialects on an already existing context, this breaks the encapsulation by essentially giving raw accesses to a part of the context's internal state. Remove this mutable access and instead provide a method to append a given DialectRegistry to the one already contained in the context. Also provide a shortcut mechanism to construct a context from an already existing registry, which seems to be a common use case in the wild. Keep read-only access to the registry contained in the context in case it needs to be copied or used for constructing another context. With this change, DialectRegistry is no longer concerned with loading the dialects and deciding whether to invoke delayed interface registration. Loading is concentrated in the MLIRContext, and the functionality of the registry better reflects its name. Depends On D96137 Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96331	2021-02-10 12:07:34 +01:00
Tobias Gysi	dd719fda76	Revert "[mlir] add support for verification in integration tests" This reverts commit `5fa893c`. Windows build bot fails due to missing header https://reviews.llvm.org/D96326	2021-02-09 19:16:02 +01:00
Tobias Gysi	5fa893cc38	[mlir] add support for verification in integration tests The patch extends the runner utils by verification methods that compare two memrefs. The methods compare the content of the two memrefs and print success if the data is identical up to a small numerical error. The methods are meant to simplify the development of integration tests that for example compare optimized and unoptimized code paths (cf. the updates to the linalg matmul integration tests). Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D96326	2021-02-09 17:43:11 +01:00
Mehdi Amini	d6efb6fc86	Rework ExecutionEngine::invoke() to make it more friendly to use from C++ This new invoke will pack a list of argument before calling the `invokePacked` method. It accepts returned value as output argument wrapped in `ExecutionEngine::Result<T>`, and delegate the packing of arguments to a trait to allow for customization for some types. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D95961	2021-02-06 01:32:50 +00:00
Matthew Parkinson	dd2dac2fd0	Fix MLIR Async Runtime DLL on Windows The AsyncRuntime declares prototypes for extern "C" functions inside a namespace in the header, but not inside that namespace in the definition. This causes Visual Studio to treat them as different entities and thus the dllexport is ignored for the definitions. Using the same namespace fixes this issue. Secondly, this commit moves the dllexport to be consistent with the JITs expectation. This is an update to https://reviews.llvm.org/D95386 that fixes the compile issues in old versions of Visual studio. Differential Revision: https://reviews.llvm.org/D95933	2021-02-03 12:23:41 +00:00

1 2 3 4 5

218 commits