squiid/llvm

Author	SHA1	Message	Date
Mehdi Amini	ce702fc8da	Add a `getUsedValuesDefinedAbove()` overload that takes an `Operation` pointer (NFC) This is a convenient utility around the existing `getUsedValuesDefinedAbove()` that take two regions. PiperOrigin-RevId: 266686854	2019-09-01 16:32:10 -07:00
Mehdi Amini	8ce2274d0d	Add a convenient `clone()` method on the `Op` class that forward to the underlying `Operation` (NFC) PiperOrigin-RevId: 266685852	2019-09-01 16:15:45 -07:00
Mehdi Amini	765d60fd4d	Add missing lowering to CFG in mlir-cpu-runner + related cleanup - the list of passes run by mlir-cpu-runner included -lower-affine and -lower-to-llvm but was missing -lower-to-cfg (because -lower-affine at some point used to lower straight to CFG); add -lower-to-cfg in between. IR with affine ops can now be run by mlir-cpu-runner. - update -lower-to-cfg to be consistent with other passes (create*Pass methods were changed to return unique ptrs, but -lower-to-cfg appears to have been missed). - mlir-cpu-runner was unable to parse custom form of affine op's - fix link options - drop unnecessary run options from test/mlir-cpu-runner/simple.mlir (none of the test cases had loops) - -convert-to-llvmir was changed to -lower-to-llvm at some point, but the create pass method name wasn't updated (this pass converts/lowers to LLVM dialect as opposed to LLVM IR). Fix this. (If we prefer "convert", the cmd-line options could be changed to "-convert-to-llvm/cfg" then.) Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#115 PiperOrigin-RevId: 266666909	2019-09-01 11:33:22 -07:00
Mehdi Amini	2634273c59	Add a link to the rational on lack of const for IR units in the developer guide PiperOrigin-RevId: 266583374	2019-08-31 15:24:02 -07:00
Mehdi Amini	d4d32f4cb8	Document that non-IR units are passed by non-const reference instead of pointer in general PiperOrigin-RevId: 266583029	2019-08-31 15:18:11 -07:00
Mahesh Ravishankar	49c3e4a508	Add floating-point comparison operations to SPIR-V dialect. Use the existing SPV_LogicalOp specification to add the floating-point comparison operations (both ordered and unordered versions). To make it easier to import the op-definitions automatically modify the dialect generation script to update the different .td files based on whether the operation is an arithmetic op, logical op, etc. Also allow specification of multiple opcodes with define_inst.sh. Since this reuses the SPV_LogicalOp framework, no tests specific to the floating point comparison ops are added with this CL. PiperOrigin-RevId: 266561634	2019-08-31 09:52:44 -07:00
Lei Zhang	5a7014c390	Add missing link dependency to MLIRTableGenTests PiperOrigin-RevId: 266561495	2019-08-31 09:50:33 -07:00
Uday Bondhugula	20d3e71c3f	update vim syntax file - more highlighting: numbers, elemental types inside shaped types - add some more keywords Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#110 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/110 from bondhugula:vim 029777db0ecb95bfc6453c0869af1c233d84d521 PiperOrigin-RevId: 266487768	2019-08-30 17:39:19 -07:00
River Riddle	9c8a8a7d0d	Add a canonicalization to erase empty AffineForOps. AffineForOp themselves are pure and can be removed if there are no internal operations. PiperOrigin-RevId: 266481293	2019-08-30 16:49:32 -07:00
MLIR Team	3d3f62323c	Splits DictionaryAttr into DictionaryAttrBase and DictionaryAttr. This maintains consistency with other *AttrBase/Attr splits. PiperOrigin-RevId: 266469869	2019-08-30 15:41:19 -07:00
Logan Chien	6b1d7f51ef	Add TensorRankOf for ranked tensor types with specific ranks This commit adds `TensorRankOf<types, typeNames, ranks>` to specify ranked tensor types with the specified types and ranks. For example, `TensorRankOf<[I32, F32], ["i32", "F32"], [0, 1]>` matches `tensor<i32>`, `tensor<?xi32>`, `tensor<f32>`, or `tensor<?xf32>`. PiperOrigin-RevId: 266461256	2019-08-30 14:54:14 -07:00
Rob Suderman	140757050b	Fix StructsGenTest.cpp CMakeFile build error PiperOrigin-RevId: 266452719	2019-08-30 14:10:17 -07:00
River Riddle	1dd9bf4739	Generalize the pass hierarchy by adding a general OpPass<PassT, OpT>. This pass class generalizes the current functionality between FunctionPass and ModulePass, and allows for operating on any operation type. The pass manager currently only supports OpPasses operating on FuncOp and ModuleOp, but this restriction will be relaxed in follow-up changes. A utility class OpPassBase<OpT> allows for generically referring to operation specific passes: e.g. FunctionPassBase == OpPassBase<FuncOp>. PiperOrigin-RevId: 266442239	2019-08-30 13:16:37 -07:00
Jacques Pienaar	06e8101034	Add mechanism to dump JIT-compiled objects to files This commit introduces the bits to be able to dump JIT-compile objects to external files by passing an object cache to OrcJit. The new functionality is tested in mlir-cpu-runner under the flag `dump-object-file`. Closes tensorflow/mlir#95 PiperOrigin-RevId: 266439265	2019-08-30 13:02:10 -07:00
Rob Suderman	8f90a442c3	Added a TableGen generator for structured data Similar to enum, added a generator for structured data. This provide Dictionary that stores a fixed set of values and guarantees the values are valid. It is intended to store a fixed number of values by a given name. PiperOrigin-RevId: 266437460	2019-08-30 12:52:13 -07:00
River Riddle	037742cdf2	Add support for early exit walk methods. This is done by providing a walk callback that returns a WalkResult. This result is either `advance` or `interrupt`. `advance` means that the walk should continue, whereas `interrupt` signals that the walk should stop immediately. An example is shown below: auto result = op->walk([](Operation *op) { if (some_invariant) return WalkResult::interrupt(); return WalkResult::advance(); }); if (result.wasInterrupted()) ...; PiperOrigin-RevId: 266436700	2019-08-30 12:47:53 -07:00
Lei Zhang	4f6c29223e	Add spv.Branch and spv.BranchConditional This CL just covers the op definition, its parsing, printing, and verification. (De)serialization is to be implemented in a subsequent CL. PiperOrigin-RevId: 266431077	2019-08-30 12:17:53 -07:00
River Riddle	3ee3710fd1	Change the parseSource* methods to return OwningModuleRef instead of ModuleOp. This avoids potential memory leaks from misuse of the API. PiperOrigin-RevId: 266305750	2019-08-29 22:20:10 -07:00
River Riddle	4bfae66d70	Refactor the 'walk' methods for operations. This change refactors and cleans up the implementation of the operation walk methods. After this refactoring is that the explicit template parameter for the operation type is no longer needed for the explicit op walks. For example: op->walk<AffineForOp>([](AffineForOp op) { ... }); is now accomplished via: op->walk([](AffineForOp op) { ... }); PiperOrigin-RevId: 266209552	2019-08-29 13:04:50 -07:00
Jacques Pienaar	a085700311	Make dumping using generic form more robust when IR ill-formed PiperOrigin-RevId: 266198057	2019-08-29 12:14:30 -07:00
Feng Liu	6de6c2c138	Add tests to verify 0.0 is quantized correctly We should consider both signed and narrow_range cases. PiperOrigin-RevId: 266167366	2019-08-29 10:09:22 -07:00
Uday Bondhugula	4bb6f8ecdb	Extend map canonicalization to propagate constant operands - extend canonicalizeMapAndOperands to propagate constant operands into the map's expressions (and thus drop those operands). - canonicalizeMapAndOperands previously only dropped duplicate and unused operands; however, operands that were constants were retained. This change makes IR maps/expressions generated by various utilities/passes even simpler; also makes some of the test checks more accurate and simpler -- for eg., 0' instead of symbol(%{{.*}}). Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#107 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/107 from bondhugula:canonicalize-maps c889a51486d14fbf7db489f224f881e7e1ff7d72 PiperOrigin-RevId: 266085289	2019-08-29 01:13:29 -07:00
Uday Bondhugula	bc2a543225	fix loop unroll and jam - operand mapping - imperfect nest case - fix operand mapping while cloning sub-blocks to jam - was incorrect for imperfect nests where def/use was across sub-blocks - strengthen/generalize the first test case to cover the previously missed scenario - clean up the other cases while on this. Previously, unroll-jamming the following nest ``` affine.for %arg0 = 0 to 2048 { %0 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %1 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> } ``` would yield ``` %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %0[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %0 : memref<512x10xf32> ``` instead of ``` module { affine.for %arg0 = 0 to 2048 step 2 { %0 = alloc() : memref<512x10xf32> %1 = affine.apply #map0(%arg0) %2 = alloc() : memref<512x10xf32> affine.for %arg1 = 0 to 10 { %4 = affine.load %0[%arg0, %arg1] : memref<512x10xf32> %5 = affine.apply #map0(%arg0) %6 = affine.load %2[%5, %arg1] : memref<512x10xf32> } dealloc %0 : memref<512x10xf32> %3 = affine.apply #map0(%arg0) dealloc %2 : memref<512x10xf32> } ``` Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#98 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/98 from bondhugula:ujam ddbc853f69b5608b3e8ff9b5ac1f6a5a0bb315a4 PiperOrigin-RevId: 266073460	2019-08-28 23:42:50 -07:00
Stephan Herhut	e90542c03b	Add verification for dimension attribute on GPUDialect index operations. PiperOrigin-RevId: 266073204	2019-08-28 23:39:57 -07:00
Uday Bondhugula	e8d43cafe6	Add vim scripts for indent/syntax - some of it has been adapted from LLVM's vim utils Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#90 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/90 from bondhugula:vim 22b1c958818c4b09de0ec8e1d7a4893171a03dbf PiperOrigin-RevId: 266071752	2019-08-28 23:23:19 -07:00
Feng Liu	7dd5efdf2c	Fix the equality check of two floating point values PiperOrigin-RevId: 266022088	2019-08-28 16:39:48 -07:00
River Riddle	29099e03ce	Generalize the analysis manager framework to work on any operation at any nesting. The pass manager is moving towards being able to run on operations at arbitrary nesting. An operation may have both parent and child operations, and the AnalysisManager must be able to handle this generalization. The AnalysisManager class now contains generic 'getCachedParentAnalysis' and 'getChildAnalysis/getCachedChildAnalysis' functions to query analyses on parent/child operations. This removes the hard coded nesting relationship between Module/Function. PiperOrigin-RevId: 266003636	2019-08-28 15:11:17 -07:00
Eric Schweitz	2225411690	Tweak to the pretty type parser to recognize that `->` is a special token. Tweak to the pretty type parser to recognize that `->` is a special token that shouldn't be split into two characters. This change allows dialect types to wrap function types as in `!my.ptr_type<(i32) -> i32>`. Closes tensorflow/mlir#105 COPYBARA_INTEGRATE_REVIEW=https://github.com/tensorflow/mlir/pull/105 from schweitzpgi:parse-arrow 8b2d768053f419daae5a1a864121a44c4319acbe PiperOrigin-RevId: 265986240	2019-08-28 13:55:42 -07:00
Stephan Herhut	c60c490356	Add implementation for tensor_load and tensor_store operations. This change adds definitions, parsing and verification for both ops. PiperOrigin-RevId: 265954051	2019-08-28 11:25:52 -07:00
Stephan Herhut	545c3e489f	Port mlir-cuda-runner to use dialect conversion framework. Instead of lowering the program in two steps (Standard->LLVM followed by GPU->NVVM), leading to invalid IR inbetween, the runner now uses one pattern based rewrite step to go directly from Standard+GPU to LLVM+NVVM. PiperOrigin-RevId: 265861934	2019-08-28 01:50:57 -07:00
Uday Bondhugula	aa2cee9cf5	Refactor / improve replaceAllMemRefUsesWith Refactor replaceAllMemRefUsesWith to split it into two methods: the new method does the replacement on a single op, and is used by the existing one. - make the methods return LogicalResult instead of bool - Earlier, when replacement failed (due to non-deferencing uses of the memref), the set of ops that had already been processed would have been replaced leaving the IR in an inconsistent state. Now, a pass is made over all ops to first check for non-deferencing uses, and then replacement is performed. No test cases were affected because all clients of this method were first checking for non-deferencing uses before calling this method (for other reasons). This isn't true for a use case in another upcoming PR (scalar replacement); clients can now bail out with consistent IR on failure of replaceAllMemRefUsesWith. Add test case. - multiple deferencing uses of the same memref in a single op is possible (we have no such use cases/scenarios), and this has always remained unsupported. Add an assertion for this. - minor fix to another test pipeline-data-transfer case. Signed-off-by: Uday Bondhugula <uday@polymagelabs.com> Closes tensorflow/mlir#87 PiperOrigin-RevId: 265808183	2019-08-27 17:56:56 -07:00
River Riddle	6f68def852	Update Ch.2 of the Toy tutorial. The code and documentation for this chapter of the tutorial have been updated to follow the new flow. The toy 'array' type has been replaced by usages of the MLIR tensor type. The code has also been cleaned up and modernized. Closes tensorflow/mlir#101 PiperOrigin-RevId: 265744086	2019-08-27 12:44:27 -07:00
MLIR Team	696fcb7520	Add 3 additional intrinsic ops to NVVM dialect, in preparation to implement block-wide reduce. PiperOrigin-RevId: 265720077	2019-08-27 10:56:18 -07:00
Lei Zhang	3af6b53381	[spirv] Fix the entry block to start with OpLabel Each basic block in SPIR-V must start with an OpLabel instruction. We don't support control flow yet, so this CL just makes sure that the entry block follows this rule and is valid. PiperOrigin-RevId: 265718841	2019-08-27 10:51:26 -07:00
Mahesh Ravishankar	4ced99c085	Enhance GPU To SPIR-V conversion to support builtins and load/store ops. To support a conversion of a simple load-compute-store kernel from GPU dialect to SPIR-V dialect, the conversion of operations like "gpu.block_dim", "gpu.thread_id" which allow threads to get the launch conversion is needed. In SPIR-V these are specified as global variables with builin attributes. This CL adds support to specify builtin variables in SPIR-V conversion framework. This is used to convert the relevant operations from GPU dialect to SPIR-V dialect. Also add support for conversion of load/store operation in Standard dialect to SPIR-V dialect. To simplify the conversion add a method to build a spv.AccessChain operation that automatically determines the return type based on the base pointer type and the indices provided. PiperOrigin-RevId: 265718525	2019-08-27 10:50:23 -07:00
Denis Khalikov	8f2dfb51d4	[spirv] Add Block decoration for spv.struct. Add Block decoration for top-level spv.struct. Closes tensorflow/mlir#102 PiperOrigin-RevId: 265716241	2019-08-27 10:41:42 -07:00
River Riddle	2f59f76876	NFC: Remove the explicit context from Operation::create and OperationState. The context can easily be recovered from the Location in these situations. PiperOrigin-RevId: 265578574	2019-08-26 17:34:48 -07:00
Eric Schweitz	761d57d187	Add FPToSI/FPExt/FPTrunc cast ops to the LLVM dialect. Closes tensorflow/mlir#99 PiperOrigin-RevId: 265538731	2019-08-26 14:19:19 -07:00
River Riddle	23251f9f3a	NFC: Remove unnecessary context parameters from several Location getters. The context can be recovered by other means in these methods and doesn't need to be passed explicitly. PiperOrigin-RevId: 265532956	2019-08-26 13:54:01 -07:00
MLIR Team	cb8c451541	Update documentation for custom rewrite specs. PiperOrigin-RevId: 265485862	2019-08-26 10:28:09 -07:00
Andy Ly	6a501e3d1b	Support folding of ops with inner ops in GreedyPatternRewriteDriver. This fixes a bug when folding ops with inner ops and inner ops are still being visited. PiperOrigin-RevId: 265475780	2019-08-26 09:44:39 -07:00
Alina Sbirlea	96e96e0243	NFC: Add doc for id-punct PiperOrigin-RevId: 265190168	2019-08-23 21:01:12 -07:00
River Riddle	4da37417ad	NFC: Update Ch.1 of the Toy tutorial. Change the use of 'array' to 'tensor' to reflect the new flow that the tutorial will follow. Also tidy up some of the documentation, code comments, and fix a few out-dated links. PiperOrigin-RevId: 265174676	2019-08-23 18:11:56 -07:00
Nicolas Vasilache	46084c03f4	Lower linalg.copy to LLVM dialect in the presence of transposes. Add an extra RewritePattern that does not convert types to rewrite a CopyOp that has non-identity permutations into a sequence of TransposeOp followed by a CopyOp without such permutations. This RewitePattern is made to fail in the non-permutation case so that the conversion pattern can kick in to lower to LLVM. This is an instance of A->A->B lowering where A->A is done by a RewritePattern in case_1 and A->B is done by a ConversionPatternRewriter when not(case_1). PiperOrigin-RevId: 265171380	2019-08-23 17:45:19 -07:00
Nicolas Vasilache	28e28e5d65	Lower linalg.transpose to LLVM dialect Add a conversion pattern that transforms a linalg.transpose op into: 1. A function entry `alloca` operation to allocate a ViewDescriptor. 2. A load of the ViewDescriptor from the pointer allocated in 1. 3. Updates to the ViewDescriptor to introduce the data ptr, offset, size and stride. Size and stride are permutations of the original values. 4. A store of the resulting ViewDescriptor to the alloca'ed pointer. The linalg.transpose op is replaced by the alloca'ed pointer. PiperOrigin-RevId: 265169112	2019-08-23 17:29:26 -07:00
Nicolas Vasilache	2c2c9ffd80	Add a linalg.transpose op A linalg.transpose op is a pure metadata operation that takes a view + permutation map and produces another view of the same underlying data, with a different reindexing. This is a pure metadata operation that does not touch the underlying data. Example: ``` %t = linalg.transpose %v (i, j) -> (j, i) : !linalg.view<?x?xf32> ``` PiperOrigin-RevId: 265139429	2019-08-23 14:48:13 -07:00
River Riddle	32052c8417	NFC: Add a note to 'applyPatternsGreedily' that it also performs folding/dce. Fixes tensorflow/mlir#72 PiperOrigin-RevId: 265097597	2019-08-23 11:28:45 -07:00
Nicolas Vasilache	7f42b3d721	Add lowering of linalg.copy to an external C++ library and a test. This CL extends support for lowering of linalg to external C++ libraries with CopyOp. Currently this can only work when the permutation maps in the copies are identity. Future support for permutations will be added later. PiperOrigin-RevId: 265093025	2019-08-23 11:09:53 -07:00
Nicolas Vasilache	3d409386a0	Update Linalg slice and subview documentation - NFC PiperOrigin-RevId: 265092922	2019-08-23 11:09:02 -07:00
Lei Zhang	0e131d83fe	[spirv] NFC: move SPIR-V control flow ops to a separate file This CL is also purely moving code around for better file organization. PiperOrigin-RevId: 265092566	2019-08-23 11:07:52 -07:00

... 3 4 5 6 7 ...

2691 commits