squiid/llvm

Author	SHA1	Message	Date
Alex Zinenko	9a08f760fe	[mlir] Make JitRunnerMain main take a DialectRegistry Historically, JitRunner has been registering all available dialects with the context and depending on them without the real need. Make it take a registry that contains only the dialects that are expected in the input and stop linking in all dialects. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D96436	2021-02-11 14:50:48 +01:00
Valeriy Savchenko	81a9707723	[Attr] Apply GNU-style attributes to expression statements Before this commit, expression statements could not be annotated with statement attributes. Whenever parser found attribute, it unconditionally assumed that it was followed by a declaration. This not only doesn't allow expression attributes to have attributes, but also produces spurious error diagnostics. In order to maintain all previously compiled code, we still assume that GNU attributes are followed by declarations unless ALL of those are statement attributes. And even in this case we are not forcing the parser to think that it should parse a statement, but rather let it proceed as if no attributes were found. Differential Revision: https://reviews.llvm.org/D93630	2021-02-11 16:44:41 +03:00
Pavel Labath	7df4eaaa93	[lldb/test] Automatically find debug servers to test Our test configuration logic assumes that the tests can be run either with debugserver or with lldb-server. This is not entirely correct, since lldb server has two "personalities" (platform server and debug server) and debugserver is only a replacement for the latter. A consequence of this is that it's not possible to test the platform behavior of lldb-server on macos, as it is not possible to get a hold of the lldb-server binary. One solution to that would be to duplicate the server configuration logic to be able to specify both executables. However, that seems excessively redundant. A well-behaved lldb should be able to find the debug server on its own, and testing lldb with a different (lldb-\|debug)server does not seem very useful (even in the out-of-tree debugserver setup, we copy the server into the build tree to make it appear "real"). Therefore, this patch deletes the configuration altogether and changes the low-level server retrieval functions to be able to both lldb-server and debugserver paths. They do this by consulting the "support executable" directory of the lldb under test. Differential Revision: https://reviews.llvm.org/D96202	2021-02-11 14:43:53 +01:00
Simon Tatham	69f1a7ad82	[ARM] Copy-paste error in ARMv87a architecture definition. In the tablegen architecture definition, the Name field for the ARMv87a record read "ARMv86a". All the other records contain their own names. Corrected it to "ARMv87a", and added the necessary value in ARMArchEnum for that to refer to. Reviewed By: pratlucas Differential Revision: https://reviews.llvm.org/D96493	2021-02-11 13:35:56 +00:00
Sven van Haastregt	3a29ac2a61	[OpenCL] Fix missing const attributes for get_image_ builtins Various get_image builtin function declarations did not have the const attribute. Bring the const attributes of `-fdeclare-opencl-builtins` more in sync with `opencl-c.h`.	2021-02-11 13:05:26 +00:00
Max Kazantsev	418c218efa	Return "[Codegenprepare][X86] Use usub with overflow opt for IV increment" The patch did not account for one corner case where cmp does not dominate the loop latch. This patch adds this check, hopefully it's cheap because the CFG does not change during the transform, so DT queries should be executed quickly. If you see compile time slowness from this, please revert. Differential Revision: https://reviews.llvm.org/D96119	2021-02-11 19:49:23 +07:00
Nico Weber	78717f56ba	[gn build] Port `b4993cf54d`	2021-02-11 07:20:21 -05:00
Max Kazantsev	af1cccfa12	[Test] Add test that exposed failure on reverted patch in codegen	2021-02-11 19:16:55 +07:00
Aaron Ballman	81bc1365d8	Correct swift_bridge duplicate attribute warning logic The swift_bridge attribute warns when the attribute is applied multiple times to the same declaration. However, it warns about the arguments being different to the attribute without ever checking if the arguments actually are different. If the arguments are different, diagnose, otherwise silently accept the code. Either way, drop the duplicated attribute.	2021-02-11 07:11:27 -05:00
David Green	e771614bae	[ARM] Change getScalarizationOverhead overload used in gather costs. NFC This changes which of the getScalarizationOverhead overloads is used in the gather/scatter cost to use the base variant directly, not relying on the version using heuristics on the number of args with no args provided. It should still produce the same costs for scalarized gathers/scatters.	2021-02-11 11:58:55 +00:00
James Henderson	a31eae8405	[test][Dexter] Fix test failure if space in python path The '%dexter_regression_test' substitution was missing quotes around the python executable, unlike other substitutions of a similar nature in the file. This changes fixes the issue. Differential Revision: https://reviews.llvm.org/D96420 Reviewed by: jmorse, aganea	2021-02-11 11:46:39 +00:00
Andrzej Warzynski	0feff71eab	[flang][driver] Move standard macro predefs to a dedicated method (nfc) This patch just addresses one of the outstanding TODOs. More specifically, it moves all the outstanding standard macro predefinitions from `SetDefaultFortranOpts` to `setDefaultPredefinitions`. This dedicated method for standard macro predefs was introduced in: * https://reviews.llvm.org/D96032	2021-02-11 11:42:57 +00:00
Carl Ritson	c16f776028	[AMDGPU] Move kill lowering to WQM pass and add live mask tracking Move implementation of kill intrinsics to WQM pass. Add live lane tracking by updating a stored exec mask when lanes are killed. Use live lane tracking to enable early termination of shader at any point in control flow. Reviewed By: piotr Differential Revision: https://reviews.llvm.org/D94746	2021-02-11 20:31:29 +09:00
Sander de Smalen	41500836b0	NFC: Migrate CodeMetrics to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D96030	2021-02-11 11:08:41 +00:00
Max Kazantsev	90081f3020	Revert "[Codegenprepare][X86] Use usub with overflow opt for IV increment" This reverts commit `3d15b7e7df`. We've found an internal failure, need to analyze.	2021-02-11 17:52:11 +07:00
David Green	7786ac8377	[ARM] Remove dead mov's in preheader of tail predicated loops With t2DoLoopDec we can be left with some extra MOV's in the preheaders of tail predicated loops. This removes them, in the same way we remove other dead variables. Differential Revision: https://reviews.llvm.org/D91857	2021-02-11 10:48:20 +00:00
Haojian Wu	6c47eafb39	[clang][index] report references from unreslovedLookupExpr. Fix https://github.com/clangd/clangd/issues/675 Differential Revision: https://reviews.llvm.org/D96262	2021-02-11 11:08:26 +01:00
Sam McCall	5c55d3747b	[CodeComplete] Member completion: heuristically resolve some dependent base exprs Today, inside a template, you can get completion for: Foo<T> t; t.^ t has dependent type Foo<T>, and we use the primary template to find its members. However we also want this to work: t.foo.bar().^ The type of t.foo.bar() is DependentTy, so we attempt to resolve using similar heuristics (e.g. primary template). Differential Revision: https://reviews.llvm.org/D96376	2021-02-11 11:03:40 +01:00
Raphael Isemann	a874d182c6	[DebugInfo] Prevent inlining in NRVO-string test cases Since the new pass manager has been enabled by default these tests had their -O1 variations failing due to the tested functions being inlined. This just adds no_inline to the respective code similar to what we did in other tests (e.g. `aa56b30014` ).	2021-02-11 10:33:30 +01:00
Sven van Haastregt	0b448854da	[OpenCL] Add cl_khr_subgroup_extended_types to TableGen BIFs Add the builtin functions brought by the cl_khr_subgroup_extended_types extension to `-fdeclare-opencl-builtins`. Differential Revision: https://reviews.llvm.org/D96279	2021-02-11 09:32:42 +00:00
Sander de Smalen	703130fb01	[TTI] Change TargetTransformInfo::getMinimumVF to return ElementCount This will be needed in the loop-vectorizer where the minimum VF requested may be a scalable VF. getMinimumVF now takes an additional operand 'IsScalableVF' that indicates whether a scalable VF is required. Reviewed By: kparzysz, rampitec Differential Revision: https://reviews.llvm.org/D96020	2021-02-11 09:08:48 +00:00
Stephan Herhut	33a58c1c5c	[mlir][gpu] Allow all dialects in SCF to GPU conversion. With the standard dialect being split up, the set of dialects that are used when converting to GPU is growing. This change modifies the SCFToGpu pass to allow all operations inside launch bodies. Differential Revision: https://reviews.llvm.org/D96480	2021-02-11 10:02:26 +01:00
Markus Lavin	9498315c9b	Expand masked mem intrinsics correctly wrt big-endian Need to take endianness into account when doing vector to scalar casts such as %bc = bitcast <8 x i1> %v to i8 Companion commit for https://reviews.llvm.org/D94867 Upload in response to https://lists.llvm.org/pipermail/llvm-dev/2021-January/147862.html Attempting to document the actual memory layout rules for vectors in https://reviews.llvm.org/D94964 Differential Revision: https://reviews.llvm.org/D94765	2021-02-11 08:59:52 +00:00
David Green	1db7b9ceaa	[ARM] Make a BE predicate bitcast consistent with the rest of llvm We were storing predicate registers, such as a <8 x i1>, in the opposite order to how the rest of llvm expects. This actually turns out to be correct for the one place that usually uses it - the ScalarizeMaskedMemIntrin pass, but only because the pass was incorrect itself. This fixes the order so that bits are stored in the opposite order and bitcasts work as expected. This allows the Scalarization pass to be fixed, as in https://reviews.llvm.org/D94765. Differential Revision: https://reviews.llvm.org/D94867	2021-02-11 08:59:52 +00:00
Haojian Wu	e159a3ced4	[Syntax] Remove a strict valid source location assertion for TypeLoc. The EndLoc of a type loc can be invalid for broken code. Also extend the existing test to support error code with `error-ok` annotation. Differential Revision: https://reviews.llvm.org/D96261	2021-02-11 09:53:52 +01:00
Haojian Wu	35a5e88390	[Syntax] NFC, Simplify a test with annotations	2021-02-11 09:49:06 +01:00
Sander de Smalen	be9bbb57f4	[LoopVectorize] NFC: Change selectVectorizationFactor to work on ElementCount. This patch is NFC and changes occurrences of `unsigned Width` and `unsigned i` to work on type ElementCount instead. This patch is a preparatory patch with the ultimate goal of making `computeMaxVF()` return both a max fixed VF and a max scalable VF, so that `selectVectorizationFactor()` can pick the most cost-effective vectorization factor. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D96019	2021-02-11 08:47:59 +00:00
Haojian Wu	df1a17c219	[clang-check] Add tokens-dump in clang-check. It is useful for syntax-tree developement. Reviewed By: kbobyrev Differential Revision: https://reviews.llvm.org/D96017	2021-02-11 09:40:47 +01:00
Sander de Smalen	3b4f706ae1	[AArch64][SVE] Asm: Fix supported immediates for DUP/CPY This patch fixes an issue in the implementation of DUP/CPY where certain immediates were not accepted. Immediates should be interpreted as a two's complement encoding of a value that fits the number of bits of the element type. mov z0.b, p0/z, #127 <=> mov z0.b, p0/z, #-129 <=> mov z0.b, p0/z, #0xffffffffffffff7f This behaviour is in line with the GNU assembler. Reviewed By: c-rhodes Differential Revision: https://reviews.llvm.org/D94776	2021-02-11 08:14:15 +00:00
Hanhan Wang	9325b8da17	[mlir][Linalg] Add conv ops with TF definition. The dimension order of a filter in tensorflow is [filter_height, filter_width, in_channels, out_channels], which is different from current definition. The current definition follows TOSA spec. Add TF version conv ops to .tc, so we do not have to insert a transpose op around a conv op. Reviewed By: antiagainst Differential Revision: https://reviews.llvm.org/D96038	2021-02-10 22:59:38 -08:00
Arthur Eubanks	8334cdde2e	[NFC] Don't pass redundant arguments Some parameters were already part of the Config passed in.	2021-02-10 22:06:49 -08:00
Sanjoy Das	bac1f12727	NFC; fix typo in comment This should have gone in with `a76761cf0d`.	2021-02-10 21:34:29 -08:00
Sanjoy Das	a76761cf0d	NFC comment-only cleanups - Remove leftover comment from `de2568aab8` - Fix a typo in a comment	2021-02-10 21:30:52 -08:00
Max Kazantsev	3d15b7e7df	[Codegenprepare][X86] Use usub with overflow opt for IV increment Function `replaceMathCmpWithIntrinsic` artificially limits the scope of the optimization, setting a requirement of two instructions be in the same block, due to two reasons: - usage of DT for more general check is costly in terms of compile time; - risk of creating a new value that lives through multiple blocks. Because of this, two semantically equivalent tests may be or not be the subject of this opt depending on where the binary operation is located. See `test/CodeGen/X86/usub_inc_iv.ll` for motivation There is one important particular case where this limitation is too strict: it is when the binary operation is the increment of the induction variable. As result, the application of this opt becomes fragile and highly reliant on where other passes decide to place IV increment. In most cases, they place it in the end of the latch block, killing the opt opportunity (when in fact it does not matter where to insert the actual instruction). This patch handles this particular case separately. - The detector does not use dom tree and has constant cost; - The value of IV or IV.next lives through all loop in any case, so this should not create a new unexpected long-living value. As result, the transform becomes more robust. It also seems to lead to better code generation in some cases (see `test/CodeGen/X86/lsr-loop-exit-cond.ll`). Differential Revision: https://reviews.llvm.org/D96119 Reviewed By: spatel, reames	2021-02-11 11:59:45 +07:00
Max Kazantsev	6efcc2fd3f	[Test] Add negative tests where usub optimization should not apply	2021-02-11 11:59:44 +07:00
Yang Fan	984cfdc6ee	[clang][cli] Fix gcc warning (NFC) GCC warning: ``` /llvm-project/clang/lib/Frontend/TestModuleFileExtension.cpp:131:20: warning: ‘llvm::raw_ostream& clang::operator<<(llvm::raw_ostream&, const clang::TestModuleFileExtension&)’ has not been declared within ‘clang’ 131 \| llvm::raw_ostream &clang::operator<<(llvm::raw_ostream &OS, \| ^~~~~ In file included from /llvm-project/clang/lib/Frontend/TestModuleFileExtension.cpp:8: /llvm-project/clang/lib/Frontend/TestModuleFileExtension.h:75:3: note: only here as a ‘friend’ 75 \| operator<<(llvm::raw_ostream &OS, const TestModuleFileExtension &Extension); \| ^~~~~~~~ ```	2021-02-11 12:38:09 +08:00
Carl Ritson	e5b0b434f6	[AMDGPU] Refactor MIMG tables to better handle hardware variants Add mimgopc object to represent the opcode allowing different opcodes for different hardware variants. This enables image_atomic_fcmpswap, image_atomic_fmin, and image_atomic_fmax on GFX10 Reviewed By: foad, rampitec Differential Revision: https://reviews.llvm.org/D96309	2021-02-11 13:22:41 +09:00
Michael Kruse	23753c6088	[Polly] Hide Simplify implementation from header. NFC. Move SimplifiyVisitor from Simplify.h to Simplify.cpp. It is not relevant for applying the pass in either the NewPM or the legacyPM. Rename it to SimplifyImpl to account for that. This is possible due its state not being necessary to be preserved between runs and thefore SimplifyImpl not needed to be held in the pass object. Instead, SimplifyImpl is only instatiated for the current Scop. In the NewPM as a function-local variable, and in the legacy PM inside a llvm::Optional object because the state must be preserved between the printScop (invoked by opt -analyze) and the most recent runOnScop calls.	2021-02-10 22:11:52 -06:00
Kazu Hirata	c5e90a8857	[AsmPrinter] Use range-based for loops (NFC)	2021-02-10 20:01:22 -08:00
Kazu Hirata	b16c6b2a83	[TableGen] Use ListSeparator (NFC)	2021-02-10 20:01:20 -08:00
Kazu Hirata	d12a0f4fc0	[GCOV] Drop unnecessary const from return types (NFC) Identified with readability-const-return-type.	2021-02-10 20:01:18 -08:00
Craig Topper	5189c5b940	[X86] Simplify patterns for avx512 vpcmp. NFC This removes the commuted PatFrags that only existed to carry an SDNodeXForm in its OperandTransform field. We know all the places that need to use the commuted SDNodeXForm and there is one transform shared by signed and unsigned compares. So just hardcode the the SDNodeXForm where it is needed and use the non commuted PatFrag in the pattern. I think when I wrote this I thought the SDNodeXForm name had to match what is in the PatFrag that is being used. But that's not true. The OperandTransform is only used when the PatFrag is used in an instruction pattern and not a separate Pat pattern. All the commuted cases are Pat patterns.	2021-02-10 19:24:27 -08:00
Michael Kruse	91ca9adc9e	[Polly] Avoid "using namespace llvm" in public headers. NFC. "using namespace" pollutes the namespace of every file that includes such a header and universally considered a bad thing. Even the variant namespace polly { using namespace llvm; } (previously used by LoopGenerators.h) imports more symbols than the file is in control of. The header may include a fixed set of files from LLVM, but the header itself may by be included together with other headers from LLVM. For instance, LLVM's MemorySSA.h and Polly's ScopInfo.h both declare a class 'MemoryAccess' which may conflict. Instead of prefixing everything in Polly's header files, this patch adds 'using' statements to import only the symbols that are actually referenced in Polly. This approach is also used by MLIR to import commonly used symbols into the mlir namespace. This patch also puts the symbols declared in IslNodeBuilder.h into the Polly namespace to also be able to use the imported symbols.	2021-02-10 20:58:33 -06:00
Valentin Clement	5ad416ca78	[flang][fir] Fix Werror build failure after D96422	2021-02-10 21:44:16 -05:00
Daniel Hwang	2407eb08a5	[analyzer] Update static analyzer to be support sarif-html Updates static analyzer to be able to generate both sarif and html output in a single run similar to plist-html. Differential Revision: https://reviews.llvm.org/D96389	2021-02-10 18:34:53 -08:00
Jessica Clarke	ca606dc988	[RISCV] More whitespace and comment typo fixes in RISCVInstrInfoC.td	2021-02-11 02:32:36 +00:00
Jessica Clarke	0973ce8596	[RISCV] Fix whitespace in RISCVInstrInfoC.td	2021-02-11 02:23:09 +00:00
Aart Bik	11bec2a81c	[mlir][sparse] reduce tensor dimensions in sparse test Rationale: BuiltinTypes.cpp observed overflow when computing size of tensor<100x200x300x400x500x600x700x800xf32>. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D96475	2021-02-10 17:59:19 -08:00
Craig Topper	350ab4e617	[RISCV] Use OperandTransform field of ImmLeaf to slightly simplify a couple bitmanip patterns. NFC This binds the SDNodeXForm to the ImmLeaf so we only need to mention the ImmLeaf in both the input and output pattern.	2021-02-10 17:52:07 -08:00
Mehdi Amini	b1aaed023e	Enable `Pass::initialize()` to fail by returning a LogicalResult Differential Revision: https://reviews.llvm.org/D96474	2021-02-11 01:51:53 +00:00

1 2 3 4 5 ...

379651 commits