squiid/llvm

Author	SHA1	Message	Date
Joseph Huber	260c5df2d5	[OpenMP] Add better testing for the linker wrapper The linker wrapper is used to perform linking and wrapping of embedded device object files. Currently its internals are not able to be tested easily. This patch adds the `--dry-run` and `--print-wrapped-module` options to investigate the link jobs that will be run along with the wrapped code that will be created to register the binaries. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D124039	2022-04-19 18:37:09 -04:00
Peter Klausler	497a5f0415	[BPF] Fix a bug in BPFMISimplifyPatchable pass LLVM BPF pass SimplifyPatchable is used to do necessary code conversion for CO-RE operations. When studying bpf selftest 'exhandler', I found a corner case not handled properly. The following is the C code, modified from original 'exhandler' code. int g; int test(struct t1 p) { struct t2 q = p->q; if (q) return 0; struct t3 f = q->f; if (!f) g = 5; return 0; } For code: struct t3 f = q->f; if (!f) ... The IR before BPFMISimplifyPatchable pass looks like: %5:gpr = LD_imm64 @"llvm.t2:0:8$0:1" %6:gpr = LDD killed %5:gpr, 0 %7:gpr = LDD killed %6:gpr, 0 JNE_ri killed %7:gpr, 0, %bb.3 JMP %bb.2 Note that compiler knows q = 0 based dataflow and value analysis. The correct generated code after the pass should be %5:gpr = LD_imm64 @"llvm.t2:0:8$0:1" %7:gpr = LDD killed %5:gpr, 0 JNE_ri killed %7:gpr, 0, %bb.3 JMP %bb.2 But the current implementation did further optimization for the above code and generates %5:gpr = LD_imm64 @"llvm.t2:0:8$0:1" JNE_ri killed %5:gpr, 0, %bb.3 JMP %bb.2 which is incorrect. This patch added a cache to remember those load insns not associated with CO-RE offset value and will skip these load insns during transformation. Differential Revision: https://reviews.llvm.org/D123883	2022-04-19 15:24:26 -07:00
John Demme	6b0bed7ea5	[MLIR] [Python] Add a method to clear live operations map Introduce a method on PyMlirContext (and plumb it through to Python) to invalidate all of the operations in the live operations map and clear it. Since Python has no notion of private data, an end-developer could reach into some 3rd party API which uses the MLIR Python API (that is behaving correctly with regard to holding references) and grab a reference to an MLIR Python Operation, preventing it from being deconstructed out of the live operations map. This allows the API developer to clear the map when it calls C++ code which could delete operations, protecting itself from its users. Reviewed By: ftynse Differential Revision: https://reviews.llvm.org/D123895	2022-04-19 15:14:09 -07:00
Craig Topper	6db0afb44e	[RISCV] Fold (xor (sllw 1, x), -1) -> (rolw ~1, x). There's an existing generic combine that does this for legal types. This patch adds a RISCV specific combine for W instructions. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D123983	2022-04-19 15:03:43 -07:00
Fangrui Song	e3cf153522	[Driver][test] Remove unneeded clang from -cc1 CHECK lines The convention is to omit "clang" for -cc1 CHECK lines and test that -triple is adjacent to -cc1.	2022-04-19 14:58:48 -07:00
Richard Smith	63814be4fa	[modules] Merge variable template specializations.	2022-04-19 14:48:42 -07:00
Yonghong Song	954ba6045d	[BPF] Emit fatal error if out of range for FK_PCRel_2 branch target Currently for the branch insn like "if $dst "#OpcodeStr#" $imm goto $BrDst" The $BrDst range needs to be in the range of [INT16_MIN, INT16_MAX]. When running bpf selftest with latest llvm, I found pyperf600.o generated insn with range outside of [INT16_MIN, INT16_MAX], which caused verifier failure. See below insn #12. 0000000000000000 <on_event>: ; { 0: 7b 1a 00 ff 00 00 00 00 (u64 )(r10 - 256) = r1 ; uint64_t pid_tgid = bpf_get_current_pid_tgid(); 1: 85 00 00 00 0e 00 00 00 call 14 2: bf 06 00 00 00 00 00 00 r6 = r0 ; pid_t pid = (pid_t)(pid_tgid >> 32); 3: bf 61 00 00 00 00 00 00 r1 = r6 4: 77 01 00 00 20 00 00 00 r1 >>= 32 5: 63 1a fc ff 00 00 00 00 (u32 )(r10 - 4) = r1 6: bf a2 00 00 00 00 00 00 r2 = r10 7: 07 02 00 00 fc ff ff ff r2 += -4 ; PidData* pidData = bpf_map_lookup_elem(&pidmap, &pid); 8: 18 01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 r1 = 0 ll 10: 85 00 00 00 01 00 00 00 call 1 11: bf 08 00 00 00 00 00 00 r8 = r0 ; if (!pidData) 12: 15 08 15 e8 00 00 00 00 if r8 == 0 goto -6123 <LBB0_27588+0xffffffffffdae100> 13: b4 01 00 00 00 00 00 00 w1 = 0 We may need to add new insn to extend the range of $BrDst. This patch added a fatal error if out of range so compiler can warn the otherwise incorrect code generation. Differential Revision: https://reviews.llvm.org/D123877	2022-04-19 14:41:08 -07:00
LLVM GN Syncbot	858e40d8f3	[gn build] Port `bac6cd5bf8`	2022-04-19 21:23:58 +00:00
Paul Kirth	bac6cd5bf8	[misexpect] Re-implement MisExpect Diagnostics Reimplements MisExpect diagnostics from D66324 to reconstruct its original checking methodology only using MD_prof branch_weights metadata. New checks rely on 2 invariants: 1) For frontend instrumentation, MD_prof branch_weights will always be populated before llvm.expect intrinsics are lowered. 2) for IR and sample profiling, llvm.expect intrinsics will always be lowered before branch_weights are populated from the IR profiles. These invariants allow the checking to assume how the existing branch weights are populated depending on the profiling method used, and emit the correct diagnostics. If these invariants are ever invalidated, the MisExpect related checks would need to be updated, potentially by re-introducing MD_misexpect metadata, and ensuring it always will be transformed the same way as branch_weights in other optimization passes. Frontend based profiling is now enabled without using LLVM Args, by introducing a new CodeGen option, and checking if the -Wmisexpect flag has been passed on the command line. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D115907	2022-04-19 21:23:48 +00:00
Yitzhak Mandelbaum	eb2131bdba	[clang][dataflow] Do not crash on missing `Value` for struct-typed variable init. Remove constraint that an initializing expression of struct type must have an associated `Value`. This invariant is not and will not be guaranteed by the framework, because of potentially uninitialized fields. Differential Revision: https://reviews.llvm.org/D123961	2022-04-19 20:52:29 +00:00
Atmn Patel	489894f363	[Libomptarget][remote] Fix compile-time error This fixes a compile-time error recently introduced within the remote offloading plugin. This patch also removes some extra linker flags that are unnecessary, and adds an explicit abseil linker flag without which we occasionally get problems. Differential Revision: https://reviews.llvm.org/D119984	2022-04-19 16:46:01 -04:00
LLVM GN Syncbot	b5e962a510	[gn build] Port `c57f03415f`	2022-04-19 20:13:49 +00:00
Alex Langford	c57f03415f	[clang][Sema] Add flag to LookupName to force C/ObjC codepath Motivation: The intent here is for use in Swift. When building a clang module for swift consumption, swift adds an extension block to the module for name lookup purposes. Swift calls this a SwiftLookupTable. One purpose that this serves is to handle conflicting names between ObjC classes and ObjC protocols. They exist in different namespaces in ObjC programs, but in Swift they would exist in the same namespace. Swift handles this by appending a suffix to a protocol name if it shares a name with a class. For example, if you have an ObjC class named "Foo" and a protocol with the same name, the protocol would be renamed to "FooProtocol" when imported into swift. When constructing the previously mentioned SwiftLookupTable, we use Sema::LookupName to look up name conflicts for the previous problem. By this time, the Parser has long finished its job so the call to LookupName gets nullptr for its Scope (TUScope will be nullptr by this point). The C/ObjC path does not have this problem because it only uses the Scope in specific scenarios. The C++ codepath uses the Scope quite extensively and will fail early on if the Scope it gets is null. In our very specific case of looking up ObjC classes with a specific name, we want to force sema::LookupName to take the C/ObjC codepath even if C++ or ObjC++ is enabled.	2022-04-19 12:57:14 -07:00
Krzysztof Drewniak	ddc2eb0ada	[mlir] Adds getUpperBound() to LoopLikeInterface. getUpperBound is analogous to getLowerBound(), except for the upper bound, and is used in range analysis. Reviewed By: Mogball Differential Revision: https://reviews.llvm.org/D124020	2022-04-19 19:56:44 +00:00
Alex Zinenko	0eb403ad1b	[mlir][transform] Introduce transform.sequence op Sequence is an important transform combination primitive that just indicates transform ops being applied in a row. The simplest version requires fails immediately if any transformation in the sequence fails. Introducing this operation allows one to start placing transform IR within other IR. Depends On D123135 Reviewed By: Mogball, rriddle Differential Revision: https://reviews.llvm.org/D123664	2022-04-19 21:41:02 +02:00
Denys Petrov	e37726beb2	[analyzer] Implemented RangeSet::Factory::castTo function to perform promotions, truncations and conversions. Summary: Handle casts for ranges working similarly to APSIntType::apply function but for the whole range set. Support promotions, truncations and conversions. Example: promotion: char [0, 42] -> short [0, 42] -> int [0, 42] -> llong [0, 42] truncation: llong [4295033088, 4295033130] -> int [65792, 65834] -> short [256, 298] -> char [0, 42] conversion: char [-42, 42] -> uint [0, 42]U[4294967254, 4294967295] -> short[-42, 42] Differential Revision: https://reviews.llvm.org/D103094	2022-04-19 22:34:03 +03:00
Ashay Rane	25c218be36	[MLIR] Add function to create BFloat16 array attribute This patch adds a new function `mlirDenseElementsAttrBFloat16Get()`, which accepts the shaped type, the number of BFloat16 values, and a pointer to an array of BFloat16 values, each of which is a `uint16_t` value. Reviewed By: stellaraccident Differential Revision: https://reviews.llvm.org/D123981	2022-04-19 19:27:06 +00:00
Jonas Paulsson	0f8c626723	[BuildLibCalls] Introduce getOrInsertLibFunc() for use when building libcalls. A new set of overloaded functions named getOrInsertLibFunc() are now supposed to be used instead of getOrInsertFunction() when building a libcall from within an LLVM optimizer(). The idea is that this new function also makes sure that any mandatory argument attributes are added to the function prototype (after calling getOrInsertFunction()). inferLibFuncAttributes() is renamed to inferNonMandatoryLibFuncAttrs() as it only adds attributes that are not necessary for correctness but merely helping with later optimizations. Generally, the front end is responsible for building a correct function prototype with the needed argument attributes. If the middle end however is the one creating the call, e.g. when replacing one libcall with another, it then must take this responsibility. This continues the work of properly handling argument extension if required by the target ABI when building a lib call. getOrInsertLibFunc() now does this for all libcalls currently built by any LLVM optimizer. It is expected that when in the future a new optimization builds a new libcall with an integer argument it is to be added to getOrInsertLibFunc() with the proper handling. Note that not all targets have it in their ABI to sign/zero extend integer arguments to the full register width, but this will be done selectively as determined by getExtAttrForI32Param(). Review: Eli Friedman, Nikita Popov, Dávid Bolvanský Differential Revision: https://reviews.llvm.org/D123198	2022-04-19 21:22:07 +02:00
Sanjay Patel	8a9c70fc01	[InstCombine] C0 shift (X add nuw C) --> (C0 shift C) shift X With 'nuw' we can convert the increment of the shift amount into a pre-shift (constant fold) of the shifted constant: https://alive2.llvm.org/ce/z/FkTyR2 Fixes issue #41976	2022-04-19 15:21:34 -04:00
Sanjay Patel	a9aa14e0cb	[InstCombine] add tests for shift-of-add with constants; NFC	2022-04-19 15:21:34 -04:00
Kirill Stoimenov	ab99a414ef	[ASan] Removed checks if the tested functions were emitted. Reviewed By: eugenis Differential Revision: https://reviews.llvm.org/D124030	2022-04-19 19:20:52 +00:00
Vasileios Porpodas	8d4b5e0833	[NFC][SLP] Improved description of getShallowScore() and getScoreAtLevelRec() Differential Revision: https://reviews.llvm.org/D124027	2022-04-19 12:15:36 -07:00
Yaxun (Sam) Liu	800f26386c	[CUDA][HIP] Fix delete operator for -fopenmp When new operator is called in OpenMP parallel region, delete operator is resolved and checked. Due to similar issue fixed by https://reviews.llvm.org/D121765, when resolving delete operator, the caller was not determined correctly, which results in error as shown in https://godbolt.org/z/jKhd8qKos. This patch fixes the issue in a similar way as https://reviews.llvm.org/D121765 Reviewed by: Artem Belevich Differential Revision: https://reviews.llvm.org/D123976	2022-04-19 14:28:03 -04:00
Andrew Litteken	3de29ad209	[IRSim] Ignore debug instructions when creating canonical numbering When constructing canonical relationships between two regions, the first instruction of a basic block from the first region is used to find the corresponding basic block from the second region. However, debug instructions are not included in similarity matching, and therefore do not have a canonical numbering. This patch makes sure to ignore the debug instructions when finding the first instruction in a basic block. Reviewer: paquette Differential Revision: https://reviews.llvm.org/D123903	2022-04-19 13:18:28 -05:00
Fangrui Song	06cafd045e	[Go] Remove PopulateLTOPassManager binding after D123882	2022-04-19 11:16:27 -07:00
Nico Weber	f6b2ddbf38	[compiler-rt] Use ld64 flag -lto_library instead of DYLD_LIBRARY_PATH Makes bin/llvm-lit \ projects/compiler-rt/test/profile/Profile-arm64/instrprof-darwin-dead-strip.c pass on my machine. Without this change, ld64 complains that the bitcode was generated by LLVM 15 while the reader is 13.1 -- the version of Xcode on my machine. Looks like the DYLD_LIBRARY_PATH technique isn't working. -lto_library was added back in ld64-136, which was in Xcode 4.6, which was released over 10 years ago. So relying on it should be safe by now. Differential Revision: https://reviews.llvm.org/D124018	2022-04-19 13:54:57 -04:00
Mehdi Amini	83892d76f4	Print custom assembly on pass failure by default The printer is now resilient to invalid IR and will already automatically fallback to the generic form on invalid IR. Using the generic printer on pass failure was a conservative option before the printer was made failsafe. Reviewed By: lattner, rriddle, jpienaar, bondhugula Differential Revision: https://reviews.llvm.org/D123915	2022-04-19 17:29:08 +00:00
Kadir Cetinkaya	1aa3a54921	[clangd] Dont include version string in update tasks This increases cardinality of span latency metrics. Currently this was being shown to the user via file status updates as `Running Update (x)` after this change we'll only display `Running Update`. This also affects logs in case of a crash, but contents and version number for inputs are printed separately in that case already. Differential Revision: https://reviews.llvm.org/D124013	2022-04-19 19:27:04 +02:00
Mehdi Amini	2d6335421f	Apply clang-tidy fixes for llvm-qualified-auto in OpenMPToLLVMIRTranslation.cpp (NFC)	2022-04-19 17:20:57 +00:00
Mehdi Amini	f9735be7e2	Apply clang-tidy fixes for performance-unnecessary-value-param in ControlFlowInterfaces.cpp (NFC)	2022-04-19 17:20:57 +00:00
Sanjay Patel	5f7c385498	[InstCombine] add tests for freeze of partial undef vector constants; NFC	2022-04-19 12:41:50 -04:00
Nikita Popov	f2d955a8a4	[OCaml] Fix pass builder test The LTO API has been removed.	2022-04-19 18:34:53 +02:00
Dmitry Makogon	084ad1ebee	[Test] Add more tests showing duplicate PHIs generated by RS4GC (NFC) This adds more tests with derived pointers.	2022-04-19 23:05:50 +07:00
Nikita Popov	dbe6d85b8b	[PPCGCodeGeneration] Look for function instead of function pointer type What this code is actually interested in are references to functions. Use of a function pointer type is being used as an imprecise proxy for that.	2022-04-19 17:59:34 +02:00
Nikita Popov	880014b593	[PPCGCodeGeneration] Avoid another pointer element type access Use an API that returns both the address and the element type, and use that for the load type.	2022-04-19 17:26:33 +02:00
David Green	cc03414125	[PerfectShuffle] Remove unused variables from D123386. NFC	2022-04-19 16:22:04 +01:00
Florian Hahn	4026b718b8	[VPlan] Remove unused SCEV forward declaration (NFC).	2022-04-19 17:16:17 +02:00
Nikita Popov	ee6bd28f23	[PPCGCodeGeneration] Avoid pointer element type access Pass through the ArrayTy instead.	2022-04-19 17:09:34 +02:00
Kirill Stoimenov	64c929ec09	[ASan] Fixed a reporting bug in (load\|store)N functions which would print unknown-crash instead of the proper error message when a the data access is unaligned. Reviewed By: kda, eugenis Differential Revision: https://reviews.llvm.org/D123643	2022-04-19 15:07:17 +00:00
Jonas Paulsson	4aa5dc15f0	[SystemZ] Handle SystemZ specific inline assembly address operands. Handle ZQ, ZR, ZS and ZT inline assembly operand constraints. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D110267	2022-04-19 16:55:45 +02:00
Tom Ritter	82f3ed9904	[analyzer] Expose Taint.h to plugins Reviewed By: NoQ, xazax.hun, steakhal Differential Revision: https://reviews.llvm.org/D123155	2022-04-19 16:55:01 +02:00
gbreynoo	42865819b2	[llvm-ar][test] Rename two tests and use correct thin command Two tests used the term "full archive" rather than "regular", these have been updated including the test names. They now also use --thin rather than the deprecated T. This change was made in preparation of D123142. Differential Revision: https://reviews.llvm.org/D123778	2022-04-19 15:13:37 +01:00
Qiongsi Wu	2512a875cc	[clang] Adding Platform/Architecture Specific Resource Header Installation Targets The goal of this patch is to improve distribution build's flexibility to include only applicable header files. Currently, the clang-resource-headers target contains nearly all the files in clang/lib/Headers. Most of these files are platform specific (e.g. immintrin.h is x86 specific). A distribution build will have to either include all the headers for all the platforms, or not include any headers. For example, if a distribution build for powerpc includes the clang-resource-headers target, it will include all the x86 specific headers, even-though the x86 specific headers cannot be used. This patch breaks up the clang-resource-headers list to a core list and platform specific lists. With the patch, a distribution build can now include the ppc-resource-headers to include the headers applicable to the powerpc platform. Specifically, one can now have cmake ... LLVM_DISTRIBUTION_COMPONENTS="clang;ppc-resource-headers" ... ../llvm ninja install-distribution then installs the powerpc headers. Similarly, one can do cmake ... LLVM_DISTRIBUTION_COMPONENTS="clang;x86-resource-headers" ... ../llvm to include headers applicable to the x86 platform in a distribution installation. To implement this behaviour, the patch does two things: * It breaks up the long files header file list to a core list and platform specific lists. * It adds numerous platform specific installation targets. Differential Revision: https://reviews.llvm.org/D123498	2022-04-19 10:10:07 -04:00
David Spickett	218b5c8394	[clang][AArch64] Remove BTI after setjmp from release notes This is now going into 14.0.2 as `571c7d8f6d` so will not be new in clang-15.	2022-04-19 13:49:55 +00:00
David Green	73dc996428	[AArch64] Add lane moves to PerfectShuffle tables This teaches the perfect shuffle tables about lane inserts, that can help reduce the cost of many entries. Many of the shuffle masks are one-away from being correct, and a simple lane move can be a lot simpler than trying to use ext/zip/etc. Because they are not exactly like the other masks handled in the perfect shuffle tables, they require special casing to generate them, with a special InsOp Operator. The lane to insert into is encoded as the RHSID, and the move from is grabbed from the original mask. This helps reduce the maximum perfect shuffle entry cost to 3, with many more shuffles being generatable in a single instruction. Differential Revision: https://reviews.llvm.org/D123386	2022-04-19 14:49:50 +01:00
Alexey Bataev	7adfa31bc6	[SLP][NFC]Add a test for reducing same values, NFC.	2022-04-19 06:48:21 -07:00
Alexey Bataev	883571928c	Revert "[SLP]Improve reductions analysis and emission, part 1." This reverts commit `0e1f4d4d3c` to fix a crash reported in PR54976	2022-04-19 06:17:03 -07:00
Kirill Bobyrev	bdf0b757d5	[clangd] IncludeCleaner: Add filtering mechanism This introduces filtering out inclusions based on the resolved path. This mechanism will be important for disabling warnings for headers that we can not diagnose correctly yet. Reviewed By: sammccall Differential Revision: https://reviews.llvm.org/D123488	2022-04-19 14:56:27 +02:00
Joseph Huber	0f8b8d79af	[OpenMP][Docs] Remove old 14.0 release information Summary: This patch removes the OpenMP sections in the release notes. These will be filled once the release is close and implementations are finalized.	2022-04-19 08:45:51 -04:00
Joseph Huber	944b25aee3	[OpenMP] Make Xopenmp-target args compile-only to silence warnings Summary: Previously we needed the `Xopenmp-target=` option during the linking phase so the old offloading driver knew which items to extract and link for the device. Now that the new driver has become the default this is no longer necessary and will cause a warning to be emitted for the unused argument. This should be silenced to avoid noise.	2022-04-19 08:42:43 -04:00

1 2 3 4 5 ...

421565 commits