squiid/llvm

Author	SHA1	Message	Date
Stella Laurenzo	9f37775472	[cmake] Prefix gtest and gtest_main with "llvm_". The upstream project ships CMake rules for building vanilla gtest/gmock which conflict with the names chosen by LLVM. Since LLVM's build rules here are quite specific to LLVM, prefixing them to avoid collision is the right thing (i.e. there does not appear to be a path to letting someone replace LLVM's googletest with one they bring, so co-existence should be the goal). This allows LLVM to be included with testing enabled within projects that themselves have a dependency on an official gtest release. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D120789	2022-03-02 10:53:32 -08:00
Philip Reames	738042711b	Reapply "[SLP] Schedule only sub-graph of vectorizable instructions"" Root issue which triggered the revert was fixed in 689bab. No changes in the reapplied patch. Original commit message follows: SLP currently schedules all instructions within a scheduling window which stretches from the first instr uction potentially vectorized to the last. This window can include a very large number of unrelated instruct ions which are not being considered for vectorization. This change switches the code to only schedule the su b-graph consisting of the instructions being vectorized and their transitive users. This has the effect of greatly reducing the amount of work performed in large basic blocks, and thus greatly improves compile time on degenerate examples. To understand the effects, I added some statistics (not planned for upstream contribution). Here's an illustration from my motivating example: Before this patch: 704357 SLP - Number of calcDeps actions 699021 SLP - Number of schedule calls 5598 SLP - Number of ReSchedule actions 59 SLP - Number of ReScheduleOnFail actions 10084 SLP - Number of schedule resets 8523 SLP - Number of vector instructions generated After this patch: 102895 SLP - Number of calcDeps actions 161916 SLP - Number of schedule calls 5637 SLP - Number of ReSchedule actions 55 SLP - Number of ReScheduleOnFail actions 10083 SLP - Number of schedule resets 8403 SLP - Number of vector instructions generated I do want to highlight that there is a small difference in number of generated vector instructions. This example is hitting the bailout due to maximum window size, and the change in scheduling is slightly perturbing when and how we hit it. This can be seen in the RescheduleOnFail counter change. Given that, I think we can safely ignore. The downside of this change can be seen in the large test diff. We group all vectorizable instructions together at the bottom of the scheduling region. This means that vector instructions can move quite far from their original point in code. While maybe undesirable, I don't see this as being a major problem as this pass is not intended to be a general scheduling pass. For context, it's worth noting that the pre-scheduling that SLP does while building the vector tree is exactly the sub-graph scheduling implemented by this patch. Differential Revision: https://reviews.llvm.org/D118538	2022-03-02 10:47:20 -08:00
Louis Dionne	17e53983b8	[NFC] Fix typo in CMake comment	2022-03-02 13:28:34 -05:00
Philip Reames	689babdf68	[SLP] Don't try to vectorize allocas While a collection of allocas are technically vectorizeable - by forming a wider alloca - this was not a transform SLP actually knows how to do. Instead, we were forming a bundle with missing dependencies, and then relying on the scheduling code to preserve program order if multiple instructions were scheduleable at once. I haven't been able to write a test case, but I'm 99% sure this was wrong in some edge case. The unknown op case was flowing down the shufflevector path. This did result in some splat handling being lost with this change, but the same lack of splat handling is visible in a whole bunch of simple examples for the gather path. I didn't consider this interesting to fix given how narrow the splat of allocas case is.	2022-03-02 10:08:43 -08:00
David Green	97e0366d67	[AArch64] Add some fp16 conversion cost tests. NFC	2022-03-02 18:07:14 +00:00
Joseph Huber	3f7c3ff90e	[OpenMP] Handle sysroot option in offloading linker wrapper Summary: This patch correctly handles the `--sysroot=` option when passed to the linker wrapper. This allows users to correctly find libraries that may contain offloading code if using this option.	2022-03-02 13:02:41 -05:00
William S. Moses	758ddba381	[MLIR] Use Datalayout defaults when importing LLVM LLVM defines several default datalayouts for integer and floating point types that are not being considered when importing into MLIR. This patch remedies this. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120832	2022-03-02 13:00:53 -05:00
Craig Topper	ab7a7cc1dd	Revert "[LegalizeTypes][VP] Add splitting and widening support for VP_FNEG." This reverts commit `ac93f95861`. Committed by accident.	2022-03-02 10:00:22 -08:00
Stephen Long	2f6c14816a	[LoopPeel] Add EXPENSIVE_CHECKS ifdef guard around domtree verify call The verify call was taking 50% of the compile time in our internal LLVM fork when trying to unroll many loops. Differential Revision: https://reviews.llvm.org/D113028	2022-03-02 09:56:20 -08:00
Craig Topper	324c0a7206	[SelectionDAG][RISCV] Emit a canonical sign bit test from ExpandIntRes_ABS. Instead of emitting 0 > Hi, emit Hi < 0. If Hi needs to be expanded again this will allow the special case for sign bit tests in ExpandIntOp_SETCC to trigger. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D120761	2022-03-02 09:47:26 -08:00
Craig Topper	a1f8349d77	[RISCV] Don't combine ROTR ((GREV x, 24), 16)->(GREV x, 8) on RV64. This miscompile was introduced in D119527. This was a special pattern for rotate+bswap on RV32. It doesn't work for RV64 since the rotate needs to be half the bitwidth. The equivalent pattern for RV64 is ROTR ((GREV x, 56), 32) so match that instead. This could be generalized further as noted in the new FIXME. Reviewed By: Chenbing.Zheng Differential Revision: https://reviews.llvm.org/D120686	2022-03-02 09:47:06 -08:00
Craig Topper	ac93f95861	[LegalizeTypes][VP] Add splitting and widening support for VP_FNEG. Differential Revision: https://reviews.llvm.org/D120785	2022-03-02 09:47:05 -08:00
William S. Moses	bf6477ebeb	[MLIR][OpenMP] Place alloca scope within wsloop in scf.parallel to omp lowering https://reviews.llvm.org/D120423 replaced the use of stacksave/restore with memref.alloca_scope, but kept the save/restore at the same location. This PR places the allocation scope within the wsloop, thus keeping the same allocation scope as the original scf.parallel (e.g. no longer over stack allocating). Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120772	2022-03-02 12:46:58 -05:00
Philip Reames	29028e47bd	[slp] Add tests for cause of D118538 revert	2022-03-02 09:45:17 -08:00
Nikolas Klauser	b324798fc8	[libc++] Check clang-tidy version Reviewed By: ldionne, #libc Spies: libcxx-commits, arichardson Differential Revision: https://reviews.llvm.org/D120087	2022-03-02 18:42:04 +01:00
Sander de Smalen	ef9816e43c	[AArch64][SME] Don't infer -neon from +streaming-sve. In Streaming SVE mode full NEON is not available, even though this is implied from armv8-a. LLVM previously inferred that NEON needed to be disabled when setting +streaming-sve, but there is no need to infer this from +streaming-sve, because we can explicitly disable NEON using LLVM's attribute mechanism. This is specifically relevant because +streaming-sve is not a user-facing feature, but rather an LLVM internal feature. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D120809	2022-03-02 17:33:06 +00:00
Simon Pilgrim	75c4a92706	[X86] Enable v32i16 FSHL/FSHR support Now that we've improved splat detection we no longer see regressions in the funnel-shift-by-splat-amount test cases	2022-03-02 17:32:38 +00:00
William S. Moses	2af81c6978	[MLIR][Arith] Canonicalize cmpi of extui/extsi Canonicalize cmpi(eq, ext a, ext b) and cmpi(ne, ext a, ext b) Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120620	2022-03-02 12:30:03 -05:00
Valentin Clement	17d71347b2	[flang] Handle module in lowering pass This patch enables the lowering of basic modules and functions/subroutines in modules. This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: PeteSteinfeld Differential Revision: https://reviews.llvm.org/D120819 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-03-02 18:26:43 +01:00
Arthur O'Dwyer	e0e7bd15b9	[libc++] Add missing std:: qualification to __synth_three_way. This might be unobservable, since __synth_three_way is only ever called as a result of using an (ADL) operator on std::pair or std::tuple.	2022-03-02 12:15:19 -05:00
Valentin Clement	7e32cada01	[flang] Lower inquire statement This patch adds the lowering of the `inquire` statement. This patch is part of the upstreaming effort from fir-dev branch. Depends on D120822 Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D120823 Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-03-02 18:03:29 +01:00
Valentin Clement	46f46a3763	[flang] Lower basic IO file statements This patches adds lowering for couple of basic io statements such as `flush`, `endfile`, `backspace` and `rewind` This patch is part of the upstreaming effort from fir-dev branch. Depends on D120821 Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D120822 Co-authored-by: Eric Schweitz <eschweitz@nvidia.com> Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-03-02 18:01:23 +01:00
William S. Moses	db31da279f	[MLIR][Arith] Add constant folder for left shift Add constant folder for left shift Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D120661	2022-03-02 12:00:23 -05:00
Akira Hatanaka	d112cc2756	[NFC][Clang][OpaquePtr] Remove the call to Address::deprecated in CreatePointerBitCastOrAddrSpaceCast Differential Revision: https://reviews.llvm.org/D120757	2022-03-02 08:58:00 -08:00
Valentin Clement	db48f7b2f7	[flang] Lower IO open and close statements This patch adds the lowering of open and close statements This patch is part of the upstreaming effort from fir-dev branch. Reviewed By: schweitz Differential Revision: https://reviews.llvm.org/D120821 Co-authored-by: Jean Perier <jperier@nvidia.com>	2022-03-02 17:57:08 +01:00
Marek Kurdej	13351fdf8c	[clang-format] Recognize "if consteval". Fixes https://github.com/llvm/llvm-project/issues/54140. Reviewed By: MyDeveloperDay, JohelEGP Differential Revision: https://reviews.llvm.org/D120806	2022-03-02 17:46:45 +01:00
Daniel McIntosh	d636b76eca	[CodeGen] Use AdjustStackOffset for Callee Saved Registers in PEI::calculateFrameObjectOffsets Also, changes how the CSR loop is indexed, which should avoid bugs like the one fixed by rG4a57bb5a3b74bdad9b0518009a7d7ac7ca2ac650 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D120668	2022-03-02 11:41:12 -05:00
Nikita Popov	98cfcae4e9	Revert "[RISCV] Add cost modelling for masked memory op" This reverts commit `76f243b53b`. The newly added test fails.	2022-03-02 17:32:10 +01:00
Simon Pilgrim	3c568ee659	[X86] Add XOP coverage for vector-popcnt tests	2022-03-02 16:25:26 +00:00
Florian Hahn	8777cb66a8	[VPlan] Remove reliance on underlying instr for ScalarIVSteps (NFCI). Instead of relying on underlying instructions, this patch updates VPScalarIVStepsRecipe to only store the required type information. This removes access to unrelated information, as well as avoiding issues with the same underlying instruction being shared by multiple recipes. This change should only change the debug output and not cause any codegen changes, hence NFCI.	2022-03-02 16:23:19 +00:00
Jay Foad	5ddfedc956	[AMDGPU] Fix deleting of move-immediate instructions after folding SIInstrInfo::FoldImmediate tried to delete move-immediate instructions after folding them into their only use. This did not work because it was checking hasOneNonDBGUse after doing the fold, at which point there should be no uses. This seems to have no effect on codegen, it just means less stuff for DCE to clean up later. Differential Revision: https://reviews.llvm.org/D120815	2022-03-02 16:11:16 +00:00
Simon Pilgrim	7848bf16fe	[ObjectYAML] WasmWriter::writeSectionContent - use llvm::enumerate to fix 'side effect in assert' warning	2022-03-02 16:09:09 +00:00
Simon Pilgrim	ca94f28d15	[clang] ExprEngine::VisitCXXNewExpr - remove superfluous nullptr tests FD has already been dereferenced	2022-03-02 15:59:10 +00:00
Nikita Popov	6fde043951	[MachineSink] Disable if there are any irreducible cycles This is an alternative to D120330, which disables MachineSink for functions with irreducible cycles entirely. This avoids both the correctness problem, and ensures we don't perform non-profitable sinks into cycles. At the same time, it may also disable profitable sinks in the same function. This can be made more precise by using MachineCycleInfo in the future. Fixes https://github.com/llvm/llvm-project/issues/53990. Differential Revision: https://reviews.llvm.org/D120800	2022-03-02 16:57:29 +01:00
Alex Zinenko	eb27da7dec	[mlir] Ignore index data layout in translation to LLVM It can be present, but is irrelevant for the translation.	2022-03-02 16:56:21 +01:00
Nikita Popov	61580d0949	Reapply [InstCombine] Remove one-use limitation from X-Y==0 fold This is a recommit without changes. I originally reverted this due to a significant code-size regression on tramp3d-v4, however further investigation showed that in the tramp3d-v4 case this change enables additional optimizations (in particular more jump threading), which happens to reduce the size of a function just enough to be eligible for inlining at hot callsites, which results in the code size increase. As such, this was just bad luck. ----- This one-use limitation is artificial, we do not increase instruction count if we perform the fold with multiple uses. The motivating case is shown in @sub_eq_zero_select, where the one-use limitation causes us to miss a subsequent select fold. I believe the backend is pretty good about reusing flag-producing subs for cmps with same operands, so I think doing this is fine. Differential Revision: https://reviews.llvm.org/D120337	2022-03-02 16:43:33 +01:00
Simon Pilgrim	5cce97d61e	[DAG] isSplatValue - improve ISD::VECTOR_SHUFFLE splat detection Currently we only check for splat shuffles, this extends it to see if the source operand is a splat across the demanded elts based upon the shuffle mask	2022-03-02 15:32:24 +00:00
Arthur O'Dwyer	7624552ead	[libc++] Explicitly reject URNG types with signed result_types. Fixes #48965. Differential Revision: https://reviews.llvm.org/D120630	2022-03-02 10:28:48 -05:00
spupyrev	bcdc047731	speeding up ext-tsp for huge instances Differential Revision: https://reviews.llvm.org/D120780	2022-03-02 07:17:48 -08:00
Alex Zinenko	59814a8c99	[mlir] more Bazel changes for `23aa5a7446`	2022-03-02 16:16:14 +01:00
Chuanqi Xu	3eb2da76d7	[NFC] [C++20] [Modules] Simplify ActOnModuleImport by merging Path and Parition Reviewed By: iains Differential Revision: https://reviews.llvm.org/D120793	2022-03-02 23:06:36 +08:00
Momchil Velikov	63c9aca12a	Revert "[AArch64] Async unwind - function epilogues" This reverts commit `74319d6794`. It causes test failures that look like infinite loop in asan/hwasan unwinding.	2022-03-02 15:01:57 +00:00
Florian Hahn	9e46866c0c	[LV] Remove dead EntryVal argument from buildScalarSteps (NFC). The EntryVal argument is not needed after recent refactoring. Remove it.	2022-03-02 14:59:22 +00:00
Alex Tsao	76f243b53b	[RISCV] Add cost modelling for masked memory op The patch adds very basic cost model for masked memory op on scalable vector. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D117884	2022-03-02 22:48:41 +08:00
Pavel Labath	e8784289c0	Revert "Remove a top-level "using namespace" from TargetTransformInfoImpl.h" Causing failures on many bots. This reverts commit `31efecfde9`.	2022-03-02 15:47:41 +01:00
David Green	02de975259	[AArch64] Add some tests for the cost of extending an extract. NFC	2022-03-02 14:47:32 +00:00
Groverkss	bb9013555f	[MLIR][Presburger] Move functionality from IntegerPolyhedron to IntegerRelation This patch moves all functionality from IntegerPolyhedron to IntegerRelation. IntegerPolyhedron is now implemented as a relation with no domain. All existing functionality is extended to work on relations. This patch does not affect external users like FlatAffineConstraints as they can still continue to use IntegerPolyhedron abstraction. This patch is part of a series of patches to support relations in Presburger library. Reviewed By: arjunp Differential Revision: https://reviews.llvm.org/D120652	2022-03-02 20:10:44 +05:30
Pavel Labath	31efecfde9	Remove a top-level "using namespace" from TargetTransformInfoImpl.h Move it into the implementation of the function that needs it. Avoids polluting the namespace of all files including the header.	2022-03-02 15:38:20 +01:00
Pavel Labath	11511e9357	Remove "using namespace llvm" from ReleaseModeModelRunner.h A using directive in a header pollutes the namespace of all files which include that header. It seems this snuck in in D115764 by moving some code from a cpp file.	2022-03-02 15:29:12 +01:00
Pavel Labath	d2edca6276	[lldb/Platform] Prepare decouple instance and plugin names This patch changes the return value of Platform::GetName() to a StringRef, and uses the opportunity (compile errors) to change some callsites to use GetPluginName() instead. The two methods still remain hardwired to return the same thing, but this will change once the ideas in <https://discourse.llvm.org/t/multiple-platforms-with-the-same-name/59594> are implemented. Differential Revision: https://reviews.llvm.org/D119146	2022-03-02 14:57:01 +01:00

1 2 3 4 5 ...

416700 commits