squiid/llvm

Author	SHA1	Message	Date
wren romano	90e0c657b7	[mlir][sparse] Correcting the use of emplace_back The emplace commands are variadic and should take all the constructor arguments directly, since they implicitly call the constructor themselves in order to avoid the cost of constructing and then moving/copying temporaries. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108670	2021-08-24 18:32:13 -07:00
Rob Suderman	a7bf93807b	[mlir][tosa] Fix conv/depthwise conv padding for quantized values When padding quantized operations, the padding needs to equal the zero point of the input value. Corrected the pass to change the padding value if quantized. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108440	2021-08-24 18:13:22 -07:00
Chenggang Zhao	2b2c13e672	[mlir][docs] A friendlier improvement for the Toy tutorial chapter 4. Add notes for discarding private-visible functions in the Toy tutorial chapter 4. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108026	2021-08-25 00:44:51 +00:00
Matthias Springer	2de2dbef2a	[mlir][linalg] Replace AffineMinSCFCanonicalizationPattern with SCF reimplementation Use the new canonicalization pattern in the SCF dialect. Differential Revision: https://reviews.llvm.org/D107732	2021-08-25 08:52:56 +09:00
Aart Bik	c5735fada4	[mlir][sparse] enable a few vectorized runs in integration tests Recent changes outside sparse compiler exposed the requirement of running a new pass (lower-affine) but this only became apparent with private testing. By adding some vectorized runs to integration test, we will detect the need for such changes earlier and also widen codegen coverage of course. Reviewed By: gussmith23 Differential Revision: https://reviews.llvm.org/D108667	2021-08-24 16:08:01 -07:00
Matthias Springer	98aa694d0d	[mlir][scf] Add general affine.min canonicalization pattern This canonicalization simplifies affine.min operations inside "for loop"-like operations (e.g., scf.for and scf.parallel) based on two invariants: * iv >= lb * iv < lb + step * ((ub - lb - 1) floorDiv step) + 1 This commit adds a new pass `canonicalize-scf-affine-min` (instead of being a canonicalization pattern) to avoid dependencies between the Affine dialect and the SCF dialect. Differential Revision: https://reviews.llvm.org/D107731	2021-08-25 07:32:30 +09:00
Logan Chien	88125e8af1	[mlir] Fix attachInterface typo This commit fixes the documentation typo regarding `attachInterface`. Differential Revision: https://reviews.llvm.org/D108666	2021-08-24 15:17:52 -07:00
Tyler Augustine	d25e91d7f6	Support alias.scope and noalias metadata Introduces new Ops to represent 1. alias.scope metadata in LLVM, and 2. domains for these scopes. These correspond to the metadata described in https://llvm.org/docs/LangRef.html#noalias-and-alias-scope-metadata. Lists of scopes are modeled the same way as access groups - as an ArrayAttr on the Op (added in https://reviews.llvm.org/D97944). Lowering 'noalias' attributes on function parameters is already supported. However, lowering `noalias` metadata on individual Ops is not, which is added in this change. LLVM uses the same keyword for these, but this change introduces a separate attribute name 'noalias_scopes' to represent this distinct concept. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D107870	2021-08-24 20:42:59 +02:00
Aart Bik	fda176892e	[mlir][sparse] use new permutation utility to avoid codedup Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D108636	2021-08-24 08:48:17 -07:00
Aart Bik	a643bd3189	[mlir] add permutation utility I found myself typing this code several times at different places by now, so time to make this a general utility instead. Given a permutation, it returns the permuted position of the input, for example (i,j,k) -> (k,i,j) yields position 1 for input 0. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D108347	2021-08-24 08:07:40 -07:00
Matthias Springer	ebf35370ff	[mlir][tensor] Insert explicit tensor.cast ops for insert_slice src If additional static type information can be deduced from a insert_slice's size operands, insert an explicit cast of the op's source operand. This enables other canonicalization patterns that are matching for tensor_cast ops such as `ForOpTensorCastFolder` in SCF. Differential Revision: https://reviews.llvm.org/D108617	2021-08-24 19:45:04 +09:00
Matthias Springer	0c36082963	[mlir][SCF] Use symbols in loop peeling rewrite Use symbols in the affine map instead of dims. Dims should not be divided. Differential Revision: https://reviews.llvm.org/D108431	2021-08-24 19:39:19 +09:00
MaheshRavishankar	b546f4347b	[mlir]Linalg] Allow controlling fusion of linalg.generic -> linalg.tensor_expand_shape. Differential Revision: https://reviews.llvm.org/D108565	2021-08-23 16:28:10 -07:00
Aart Bik	236a90802d	[mlir][sparse] replace support lib conversion with actual MLIR codegen Rationale: Passing in a pointer to the memref data in order to implement the dense to sparse conversion was a bit too low-level. This revision improves upon that approach with a cleaner solution of generating a loop nest in MLIR code itself that prepares the COO object before passing it to our "swiss army knife" setup. This is much more intuitive and now also allows for dynamic shapes. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108491	2021-08-23 14:26:05 -07:00
River Riddle	4e103a12d9	[mlir] Add support for VariadicOfVariadic operands This revision adds native ODS support for VariadicOfVariadic operand groups. An example of this is the SwitchOp, which has a variadic number of nested operand ranges for each of the case statements, where the number of case statements is variadic. Builtin ODS support allows for generating proper accessors for the nested operand ranges, builder support, and declarative format support. VariadicOfVariadic operands are supported by providing a segment attribute to use to store the operand groups, mapping similarly to the AttrSizedOperand trait (but with a user defined attribute name). `build` methods for VariadicOfVariadic operand expect inputs of the form `ArrayRef<ValueRange>`. Accessors for the variadic ranges return a new `OperandRangeRange` type, which represents a contiguous range of `OperandRange`. In the declarative assembly format, VariadicOfVariadic operands and types are by default formatted as a comma delimited list of value lists: `(<value>, <value>), (), (<value>)`. Differential Revision: https://reviews.llvm.org/D107774	2021-08-23 20:32:31 +00:00
MaheshRavishankar	4aeeb91a92	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-23 13:06:34 -07:00
River Riddle	da12d88b1c	[mlir][NFC] Add inlineRegion overloads that take a block iterator insert position This allows for inlining into an empty block or to the beginning of a block. NFC as the existing implementations now foward to this overload. Differential Revision: https://reviews.llvm.org/D108572	2021-08-23 19:49:53 +00:00
River Riddle	e4635e6328	[mlir][FoldUtils] Ensure the created constant dominates the replaced op This revision fixes a bug where an operation would get replaced with a pre-existing constant that didn't dominate it. This can occur when a pattern inserts operations to be folded at the beginning of the constants insertion block. This revision fixes the bug by moving the existing constant before the replaced operation in such cases. This is fine because if a constant didn't already exist, a new one would have been inserted before this operation anyways. Differential Revision: https://reviews.llvm.org/D108498	2021-08-23 18:48:24 +00:00
Krzysztof Drewniak	469172f3f4	[MLIR][Docs] Fix broken link to tuple type rationale Reviewed By: rriddle Differential Revision: https://reviews.llvm.org/D108135	2021-08-23 18:35:36 +00:00
Matthias Springer	bc194a5bb5	[mlir][SCF] Do not peel loops inside partial iterations Do not apply loop peeling to loops that are contained in the partial iteration of an already peeled loop. This is to avoid code explosion when dealing with large loop nests. Can be controlled with a new pass option `skip-partial`. Differential Revision: https://reviews.llvm.org/D108542	2021-08-23 21:35:46 +09:00
Stella Laurenzo	a8de667af0	[mlir] Add op for NCHW conv2d. * This is the native data layout for PyTorch and npcomp was using the prior version before cleanup. Differential Revision: https://reviews.llvm.org/D108527	2021-08-22 17:27:33 -07:00
Stella Laurenzo	64e74e9d7c	[mlir][linalg] Add script to update the LinalgNamedStructuredOps.yaml. nfc Also adds banners to the files with update instructions. Differential Revision: https://reviews.llvm.org/D108529	2021-08-22 16:54:51 -07:00
Stella Laurenzo	e78b745cf2	[mlir][python] Makes C++ extension code relocatable by way of a macro. * Resolves a TODO by making this configurable by downstreams. * This seems to be the last thing allowing full use of the Python bindings as a library within another project (i.e. be embedding them). Differential Revision: https://reviews.llvm.org/D108523	2021-08-22 13:46:14 -07:00
William S. Moses	973cb2c326	[MLIR][OMP] Ensure nested scf.parallel execute all iterations Presently, the lowering of nested scf.parallel loops to OpenMP creates one omp.parallel region, with two (nested) OpenMP worksharing loops on the inside. When lowered to LLVM and executed, this results in incorrect results. The reason for this is as follows: An OpenMP parallel region results in the code being run with whatever number of threads available to OpenMP. Within a parallel region a worksharing loop divides up the total number of requested iterations by the available number of threads, and distributes accordingly. For a single ws loop in a parallel region, this works as intended. Now consider nested ws loops as follows: omp.parallel { A: omp.ws %i = 0...10 { B: omp.ws %j = 0...10 { code(%i, %j) } } } Suppose we ran this on two threads. The first workshare loop would decide to execute iterations 0, 1, 2, 3, 4 on thread 0, and iterations 5, 6, 7, 8, 9 on thread 1. The second workshare loop would decide the same for its iteration. This means thread 0 would execute i \in [0, 5) and j \in [0, 5). Thread 1 would execute i \in [5, 10) and j \in [5, 10). This means that iterations i in [5, 10), j in [0, 5) and i in [0, 5), j in [5, 10) never get executed, which is clearly wrong. This permits two options for a remedy: 1) Change the semantics of the omp.wsloop to be distinct from that of the OpenMP runtime call or equivalently #pragma omp for. This could then allow some lowering transformation to remedy the aforementioned issue. I don't think this is desirable for an abstraction standpoint. 2) When lowering an scf.parallel always surround the wsloop with a new parallel region (thereby causing the innermost wsloop to use the number of threads available only to it). This PR implements the latter change. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D108426	2021-08-20 19:06:28 -04:00
Rob Suderman	871c812483	[mlir][linalg] Finish refactor of TC ops to YAML Multiple operations were still defined as TC ops that had equivalent versions as YAML operations. Reducing to a single compilation path guarantees that frontends can lower to their equivalent operations without missing the optimized fastpath. Some operations are maintained purely for testing purposes (mainly conv{1,2,3}D as they are included as sole tests in the vectorizaiton transforms. Differential Revision: https://reviews.llvm.org/D108169	2021-08-20 12:35:04 -07:00
Aart Bik	758ccf8506	[mlir][sparse] add test for DimOp folding Folding in the MLIR uses the order of the type directly but folding in the underlying implementation must take the dim ordering into account. These tests clarify that behavior and verify it is done right. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108474	2021-08-20 11:24:09 -07:00
Aart Bik	24ea94ad0c	[mlir][sparse][python] migrate more code from boilerplate into proper numpy land The boilerplate was setting up some arrays for testing. To fully illustrate python - MLIR potential, however, this data should also come from numpy land. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108336	2021-08-20 09:18:17 -07:00
Jacques Pienaar	a232a48dca	[mlir][ods] Skip adding TOC in doc gen when present Enables adding a TOC in the description to be able to interleave documentation before and after the TOC.	2021-08-20 07:01:54 -07:00
Denys Shabalin	1631d9a7ea	[mlir][linalg] Fix __repr__ implementation in const from opdsl Reviewed By: gysit Differential Revision: https://reviews.llvm.org/D108369	2021-08-20 12:39:57 +02:00
Vladislav Vinogradov	9775c0c9f0	[mlir] Fix ControlFlowInterfaces implementation for Async dialect * Add `RegionBranchTerminatorOpInterface` to `YieldOp`. * Implement `getSuccessorEntryOperands` in `ExecuteOp`. * Fix `getSuccessorRegions` implementation in `ExecuteOp`. Reviewed By: ezhulenev Differential Revision: https://reviews.llvm.org/D108373	2021-08-20 12:14:45 +03:00
Vladislav Vinogradov	d1883bc322	[mlir][NFC] Use explicit ::mlir namespace in mlir-tblgen generated code Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D108376	2021-08-20 11:52:25 +03:00
Rob Suderman	3205ee7e81	[mlir][tosa] Support UInt8 inputs and outputs for tosa.rescale Tosa rescale can contain uint8 types. Added support for these types using an unrealized conversion cast. Optimistically it would be better to use bitcast however it does not support unsigned integers. Differential Revision: https://reviews.llvm.org/D108427	2021-08-19 18:58:44 -07:00
Morten Borup Petersen	6c1436a9b0	[MLIR][SCF] Parenthesize multiple return types in scf.execute_region asm op Previously, ExecuteRegionOps with multiple return values would fail a round-trip test due to missing parenthesis around the types. Differential Revision: https://reviews.llvm.org/D108402	2021-08-19 21:31:51 +01:00
MaheshRavishankar	16ffb283c5	Revert "[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes." This reverts commit `95ddc8341a`. Differential Revision: https://reviews.llvm.org/D108396	2021-08-19 11:53:41 -07:00
MaheshRavishankar	95ddc8341a	[mlir][Linalg] Allow all build methods of Structured ops to specify additional attributes. Differential Revision: https://reviews.llvm.org/D108338	2021-08-19 11:14:35 -07:00
Matthias Springer	76a1861816	[mlir][SparseTensor] Split scf.for loop into masked/unmasked parts Apply the "for loop peeling" pattern from SCF dialect transforms. This pattern splits scf.for loops into full and partial iterations. In the full iteration, all masked loads/stores are canonicalized to unmasked loads/stores. Differential Revision: https://reviews.llvm.org/D107733	2021-08-19 21:53:11 +09:00
Matthias Springer	8e8b70aa84	[mlir][scf] Simplify affine.min ops after loop peeling Simplify affine.min ops, enabling various other canonicalizations inside the peeled loop body. affine.min ops such as: ``` map = affine_map<(d0)[s0, s1] -> (s0, -d0 + s1)> %r = affine.min #affine.min #map(%iv)[%step, %ub] ``` are rewritten them into (in the case the peeled loop): ``` %r = %step ``` To determine how an affine.min op should be rewritten and to prove its correctness, FlatAffineConstraints is utilized. Differential Revision: https://reviews.llvm.org/D107222	2021-08-19 17:24:53 +09:00
John Demme	96fbd5cd5e	[MLIR] [Python] Add `owner` to `mlir.ir.Block` Provides a way for python users to access the owning Operation from a Block.	2021-08-19 00:02:09 -07:00
Tobias Gysi	234c4d2362	[mlir][linalg] Set result types in all builders. Add code to set the result types in all yaml op builders. Reviewed By: nicolasvasilache Differential Revision: https://reviews.llvm.org/D108273	2021-08-19 06:19:12 +00:00
Matthias Springer	08dbed8a57	[mlir][linalg] Canonicalize dim ops of tiled_loop block args E.g.: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %x, %c0 : tensor<...> } ``` is rewritten to: ``` %y = ... : tensor<...> linalg.tiled_loop ... ins(%x = %y : tensor<...>) { tensor.dim %y, %c0 : tensor<...> } ``` Differential Revision: https://reviews.llvm.org/D108272	2021-08-19 11:24:33 +09:00
Matthias Springer	9329438244	[mlir][linalg] Remove ConstraintsSet class The same functionality can be implemented with FlatAffineValueConstraints. Differential Revision: https://reviews.llvm.org/D108179	2021-08-19 10:57:35 +09:00
Matthias Springer	c777e51468	[mlir][Analysis][NFC] FlatAffineConstraints: Use BoundType enum in functions Differential Revision: https://reviews.llvm.org/D108185	2021-08-19 10:33:42 +09:00
Aart Bik	d37d72eaf8	[mlir][sparse] use shared util for DimOp generation This shares more code with existing utilities. Also, to be consistent, we moved dimension permutation on the DimOp to the tensor lowering phase. This way, both pre-existing DimOps on sparse tensors (not likely but possible) as well as compiler generated DimOps are handled consistently. Reviewed By: bixia Differential Revision: https://reviews.llvm.org/D108309	2021-08-18 17:12:32 -07:00
Diego Caballero	b7cac864b2	[mlir] Fix typo in SuperVectorizer NFC. Reviewed By: aartbik Differential Revision: https://reviews.llvm.org/D108334	2021-08-18 22:55:12 +00:00
Chia-hung Duan	41e5dbe0fa	Enables inferring return types for Shape op if possible Reviewed By: jpienaar Differential Revision: https://reviews.llvm.org/D102565	2021-08-18 21:36:55 +00:00
Robert Suderman	76c9712196	[mlir][tosa] Fix clamp to restrict only within valid bitwidth range Its possible for the clamp to have invalid min/max values on its range. To fix this we validate the range of the min/max and clamp to a valid range. Reviewed By: NatashaKnk Differential Revision: https://reviews.llvm.org/D108256	2021-08-18 12:14:01 -07:00
William S. Moses	8c2ff7b69e	[MLIR] Correct linkage of lowered globalop LLVM considers global variables marked as externals to be defined within the module if it is initialized (including to an undef). Other external globals are considered as being defined externally and imported into the current translation unit. Lowering of MLIR Global Ops does not properly propagate undefined initializers, resulting in a global which is expected to be defined within the current TU, not being defined. Differential Revision: https://reviews.llvm.org/D108252	2021-08-18 11:09:43 -04:00
Butygin	ddc3d51d58	[mlir][spirv] Add (InBounds)PtrAccessChain ops Differential Revision: https://reviews.llvm.org/D108070	2021-08-18 17:59:21 +03:00
Jacques Pienaar	b41bfb819d	[mlir][ods] Fix packing in OperandOrAttribute Wrong combiner was used which led to information loss.	2021-08-17 20:55:48 -07:00
Lei Zhang	4c15ad2321	[mlir][linalg] Don't drop existing attributes when creating ops Reviewed By: mravishankar Differential Revision: https://reviews.llvm.org/D108219	2021-08-17 15:44:56 -04:00

1 2 3 4 5 ...

8429 commits