wasmtime

Author	SHA1	Message	Date
Sam Parker	acfeda4d80	[AArch64] Port IaddPairwise to ISLE (#4201 ) Copyright (c) 2022, Arm Limited.	2022-06-06 15:37:13 +01:00
Anton Kirilov	edf07a8da6	Cranelift AArch64: Migrate Bitselect and Vselect to ISLE (#4139 ) Copyright (c) 2022, Arm Limited.	2022-05-16 09:39:28 -07:00
Sam Parker	12b4374cd5	[AArch64] Port atomic rmw to ISLE (#4021 ) Also fix and extend the current implementation: - AtomicRMWOp::Clr != AtomicRmwOp::And, as the input needs to be inverted first. - Inputs to the cmp for the RMWLoop case are sign-extended when needed. - Lower Xchg to Swp. - Lower Sub to Add with a negated input. - Added more runtests. Copyright (c) 2022, Arm Limited.	2022-04-27 13:13:59 -07:00
Chris Fallin	dd45f44511	x64 backend: add lowerings with load-op-store fusion. (#4071 ) x64 backend: add lowerings with load-op-store fusion. These lowerings use the `OP [mem], reg` forms (or in AT&T syntax, `OP %reg, (mem)`) -- i.e., x86 instructions that load from memory, perform an ALU operation, and store the result, all in one instruction. Using these instruction forms, we can merge three CLIF ops together: a load, an arithmetic operation, and a store.	2022-04-26 18:58:26 -07:00
Sam Parker	e142f587a7	[AArch64] Refactor ALUOp3 (#3950 ) As well as adding generic pattern for msub along with runtests for madd and msub. Copyright (c) 2022, Arm Limited.	2022-04-14 12:16:56 -07:00
Andrew Brown	7a55779c6b	x64: fix miscompilation of `select.i128` (#4017 ) Issue #3963 identified a miscompilation with select in which the second in the pair of `CMOV`s (one pair per `i128` register) used the wrong flag. This change fixes the error in the x64 ISLE helper function emitting these `CMOV` instructions.	2022-04-12 09:56:57 -07:00
FreddieLiardet	13b9396931	Add vector compare to 0 optims (#3887 ) Signed-off-by: Freddie Liardet <frederick.liardet@arm.com>	2022-03-09 16:20:06 -08:00
Chris Fallin	26ce9a3853	Fix uextend on x64 for non-i32-source cases. (#3906 ) In #3849, I moved uextend over to ISLE in the x64 backend. Unfortunately, the lowering patterns had a bug in the i32-to-i64 special case (when we know the generating instruction zeroes the upper 32 bits): it wasn't actually special casing for an i32 source! This meant that e.g. zero extends of the results of i8 adds did not work properly. This PR fixes the bug and updates the runtest for extends significantly to cover the narrow-value cases. No security impact to Wasm as Wasm does not use narrow integer types. Thanks @bjorn3 for reporting!	2022-03-09 11:10:59 -08:00
Andrew Brown	f87c61176a	x64: port select to ISLE (#3682 ) * x64: port `select` using an FP comparison to ISLE This change includes quite a few interlocking parts, required mainly by the current x64 conventions in ISLE: - it adds a way to emit a `cmove` with multiple OR-ing conditions; because x64 ISLE cannot currently safely emit a comparison followed by several jumps, this adds `MachInst::CmoveOr` and `MachInst::XmmCmoveOr` macro instructions. Unfortunately, these macro instructions hide the multi-instruction sequence in `lower.isle` - to properly keep track of what instructions consume and produce flags, @cfallin added a way to pass around variants of `ConsumesFlags` and `ProducesFlags`--these changes affect all backends - then, to lower the `fcmp + select` CLIF, this change adds several `cmove*_from_values` helpers that perform all of the awkward conversions between `Value`, `ValueReg`, `Reg`, and `Gpr/Xmm`; one upside is that now these lowerings have much-improved documentation explaining why the various `FloatCC` and `CC` choices are made the the way they are. Co-authored-by: Chris Fallin <chris@cfallin.org>	2022-02-23 10:03:16 -08:00
Chris Fallin	1c014d129a	Cranelift: ensure ISA level needed for SIMD is present when SIMD is enabled. (#3816 ) Addresses #3809: when we are asked to create a Cranelift backend with shared flags that indicate support for SIMD, we should check that the ISA level needed for our SIMD lowerings is present.	2022-02-16 17:29:30 -08:00
Mrmaxmeier	84b9c7bb8a	cranelift/x64: lower min and max for <= `i64` (#3748 ) * cranelift/x64: lower min and max for <= `i64` * cranelift: add runtests for integer min/max	2022-02-14 10:21:19 -08:00
Ulrich Weigand	9c5c872b3b	s390x: Add support for all remaining atomic operations (#3746 ) This adds support for all atomic operations that were unimplemented so far in the s390x back end: - atomic_rmw operations xchg, nand, smin, smax, umin, umax - $I8 and $I16 versions of atomic_rmw and atomic_cas - little endian versions of atomic_rmw and atomic_cas All of these have to be implemented by a compare-and-swap loop; and for the $I8 and $I16 versions the actual atomic instruction needs to operate on the surrounding aligned 32-bit word. Since we cannot emit new control flow during ISLE instruction selection, these compare-and-swap loops are emitted as a single meta-instruction to be expanded at emit time. However, since there is a large number of different versions of the loop required to implement all the above operations, I've implemented a facility to allow specifying the loop bodies from within ISLE after all, by creating a vector of MInst structures that will be emitted as part of the meta-instruction. There are still restrictions, in particular instructions that are part of the loop body may not modify any virtual register. But even so, this approach looks preferable to doing everything in emit.rs. A few instructions needed in those compare-and-swap loop bodies were added as well, in particular the RxSBG family of instructions as well as the LOAD REVERSED in-register byte-swap instructions. This patch also adds filetest runtests to verify the semantics of all operations, in particular the subword and little-endian variants (those are currently only executed on s390x).	2022-02-08 13:48:44 -08:00
Nick Fitzgerald	95d8dd1424	cranelift: Re-add some tests that were accidentally removed	2022-01-07 11:00:58 -08:00
Nick Fitzgerald	6b5e9d8732	Merge pull request #3659 from fitzgen/vselect-isle cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:51:33 -08:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Alex Crichton	72e2b7fe80	aarch64: Migrate bitrev/clz/cls/ctz to ISLE (#3658 ) This commit migrates these existing instructions to ISLE from the manual lowerings implemented today. This was mostly straightforward but while I was at it I fixed what appeared to be broken translations for I{8,16} for `clz`, `cls`, and `ctz`. Previously the lowerings would produce results as-if the input was 32-bits, but now I believe they all correctly account for the bit-width.	2022-01-06 15:18:32 -06:00
Nick Fitzgerald	b60a4df2af	cranelift: Move `bitselect` runtest file to shared runtests directory	2022-01-06 11:25:27 -08:00
bjorn3	32c3afe4b3	Add regression runtests	2021-12-17 20:58:32 +01:00
Chris Fallin	1323ae417e	Fix some 16- and 8-bit behavior in x64 backend related to rotates. Uncovered by @bjorn3 (thanks!): 8- and 16-bit rotates were not working properly in recent versions of Cranelift with part of the lowering migrated to ISLE. This PR fixes a few issues: - 8- and 16-bit rotate-left needs to mask a constant amount, if any, because we use a 32-bit rotate instruction and so don't get the appropriate shift-amount masking for free from x86 semantics. - `operand_size_from_type` was incorrect: it only handled 32- and 64-bit types and silently returned `OperandSize::Size32` for everything else. Now uses the `OperandSize::from_ty(ty)` helper as the pre-ISLE code did. Our test coverage for narrow value types is not great; this PR adds some runtests for rotl/rotr but more would always be better!	2021-12-16 11:34:24 -08:00
Alex Crichton	d29b7c8a59	Fix a simd shuffle test (#3607 ) Cranelift shuffles require indices to be in-bounds, which the avx512-using backend also requires via a debug assert, so this commit fixes a test with simd shuffles to only use in-bounds indices. This is motivated by another failure on CI where the machine we were running on presumably had avx512 things enabled. This should fix those failures. Closes #3581	2021-12-16 10:36:52 -08:00
bjorn3	1fd491dadd	Remove fallthrough instruction	2021-10-12 14:22:07 +02:00
bjorn3	3f87b768d5	Update filetests	2021-10-11 17:44:21 +02:00
Afonso Bordado	fc33700071	cranelift: Enable `umulhi` tests for s390x	2021-10-06 20:59:53 +01:00
bjorn3	4b6d20d03f	Fix extend test for AArch64	2021-09-29 19:45:49 +02:00
bjorn3	a646f68553	Remove legacy x86_64 backend tests	2021-09-29 17:37:23 +02:00
Chris Fallin	344a219245	Merge pull request #3383 from akirilov-arm/vany_true Cranelift AArch64: Fix the VanyTrue implementation for 64-bit elements	2021-09-24 09:26:36 -07:00
Anton Kirilov	0fb3acfb94	Cranelift AArch64: Fix the VanyTrue implementation for 64-bit elements Copyright (c) 2021, Arm Limited.	2021-09-23 20:39:46 +01:00
Anton Kirilov	930b1f17f0	Cranelift AArch64: Implement scalar FmaxPseudo and FminPseudo Copyright (c) 2021, Arm Limited.	2021-09-23 15:11:01 +01:00
Chris Fallin	65fde3a86b	Merge pull request #3380 from dheaton-arm/implement-iabs Implement `Iabs` for the interpreter	2021-09-22 10:00:53 -07:00
Chris Fallin	b076c99af9	Merge pull request #3379 from dheaton-arm/implement-sqmulroundsat Implement `SqmulRoundSat` for interpreter	2021-09-22 09:59:13 -07:00
Chris Fallin	dd7310df04	Merge pull request #3361 from dheaton-arm/implement-vecops Implement `VhighBits` & `Vselect` for interpreter	2021-09-22 09:22:52 -07:00
Chris Fallin	76f9cfd79c	Merge pull request #3354 from afonso360/interp-b Add `bextend`,`breduce` and `bmask` to interpreter	2021-09-22 09:22:04 -07:00
Chris Fallin	3474965ca6	Merge pull request #3322 from sparker-arm/aarch64-lse-ops AArch64 LSE atomic_rmw support	2021-09-22 09:21:28 -07:00
dheaton-arm	faaf6b537a	Prevent running tests on legacy backend. Copyright (c) 2021, Arm Limited	2021-09-22 13:50:31 +01:00
dheaton-arm	539b1de5f4	Prevent test running on legacy backend. Copyright (c) 2021, Arm Limited	2021-09-22 13:48:59 +01:00
dheaton-arm	cb30ecc7bc	Implement `Iabs` for the interpreter Implemented `Iabs` to return the absolute integer value with wrapping. Copyright (c) 2021, Arm Limited	2021-09-22 12:59:30 +01:00
dheaton-arm	02ff19f2fc	Implement `SqmulRoundSat` for interpreter Implemented `SqmulRoundSat` for the Cranelift interpreter, performing QN-format fixed point multiplication for 16 and 32-bit integers in SIMD vectors. Copyright (c) 2021, Arm Limited	2021-09-22 12:58:41 +01:00
dheaton-arm	63d85e1dc3	Prevent running `simd-vhighbits.clif` on legacy backend. Copyright (c) 2021, Arm Limited.	2021-09-22 11:43:57 +01:00
dheaton-arm	335177a97e	Remove legacy backend from test Copyright (c) 2021, Arm Limited	2021-09-22 09:42:18 +01:00
Afonso Bordado	9a95ce75f1	cranelift: Add `bmask` to interpreter	2021-09-21 18:43:53 +01:00
Afonso Bordado	3ee180420e	cranelift: Add `breduce` tests to interpreter	2021-09-21 18:21:48 +01:00
Afonso Bordado	c7d595ae46	cranelift: Add `bextend` tests to interpreter	2021-09-21 18:21:48 +01:00
Chris Fallin	38728c5746	Merge pull request #3362 from dheaton-arm/implement-unarrow Implement `Unarrow`, `Uunarrow`, and `Snarrow` for the interpreter	2021-09-21 10:06:46 -07:00
Chris Fallin	e0bd4bd007	Merge pull request #3363 from dheaton-arm/implement-widening-pairwise-dotprod Implement `WideningPairwiseDotProductS` for interpreter	2021-09-21 10:05:07 -07:00
Chris Fallin	ebe2af6eaa	Merge pull request #3351 from afonso360/parser-i128 cranelift: Add support for parsing i128 data values	2021-09-21 10:04:27 -07:00
Ulrich Weigand	51131a3acc	Fix s390x regressions (#3330 ) - Add relocation handling needed after PR #3275 - Fix incorrect handling of signed constants detected by PR #3056 test - Fix LabelUse max pos/neg ranges; fix overflow in buffers.rs - Disable fuzzing tests that require pre-built v8 binaries - Disable cranelift test that depends on i128 - Temporarily disable memory64 tests	2021-09-20 09:12:36 -05:00
Afonso Bordado	eae1b2d246	cranelift: Update i128 tests to use i128 values in functions	2021-09-19 15:02:06 +01:00
Chris Fallin	6a98fe2104	Merge pull request #3332 from afonso360/interp-icmp cranelift: Add SIMD `icmp` to interpreter	2021-09-17 15:13:44 -07:00
dheaton-arm	2f0ce4c86c	Implement `Smulhi` for interpreter Implemented `Smulhi` for the Cranelift interpreter, performing signed integer multiplication and producing the high half of a double-length result. Copyright (c) 2021, Arm Limited	2021-09-17 16:49:38 +01:00
dheaton-arm	3b9bfc8187	Implement `WideningPairwiseDotProductS` for interpreter Implemented `WideningPairwiseDotProductS` to perform sign-extending length-doubling multiplication on corresponding elements from two `i16x8` SIMD vectors, performing a pairwise add on the results (thus returning `i32x4`). Copyright (c) 2021, Arm Limited	2021-09-17 13:31:16 +01:00

1 2 3

124 Commits