wasmtime

Author	SHA1	Message	Date
Joey Gouly	5355c3e3d5	arm64: Implement Vselect opcode This is implemented the same as Bitselect, as the controlling vector is a boolean vector. A boolean vector in cranelift has elements that are either 0 or all 1s, so it can be used to select elements lane wise. Copyright (c) 2020, Arm Limited.	2020-07-22 12:50:29 +01:00
Anton Kirilov	420c4f06b8	AArch64: Improve code generation for Extractlane + Sextend / Uextend Copyright (c) 2020, Arm Limited.	2020-07-22 11:47:51 +01:00
Chris Fallin	96ef2f1a1b	Fix `u8::MAX` -> `std::u8::MAX`. (#2047 ) As per Carlo Kok on Zulip #cranelift, this breaks builds with stable Rust pre-1.43, as `core::u8::MAX` was only stabilized then. We'd like to support older versions if we can easily do so. This PR also adds `cranelift-tools` to the crates checked on CI with Rust 1.41.0, which pulls in all backends (including `aarch64`).	2020-07-20 14:59:15 -05:00
Chris Fallin	784e2f1480	Merge pull request #2038 from jgouly/arith2 arm64: Enable arith2 tests	2020-07-20 09:00:10 -07:00
Chris Fallin	1b3b2dbfd0	Merge pull request #2043 from cfallin/csel-opt Aarch64: handle csel with icmp/fcmp source without materializing the bool.	2020-07-18 19:33:47 -07:00
Chris Fallin	ea894c0eeb	Merge pull request #2042 from cfallin/aarch64-fix-regshift-mask Aarch64: mask shift-amounts incorporated into reg-reg-shift ALU insts.	2020-07-18 19:33:35 -07:00
Chris Fallin	21dac670f0	Aarch64: handle csel with icmp/fcmp source without materializing the bool. Previously, we simply compared the input bool to 0, which forced the value into a register (usually via a cmp and cset), zero-extended it, etc. This patch performs the same pattern-matching that branches do to directly perform the cmp and use its flag results with the csel. On the `bz2` benchmark, the runtime is affected as follows (measuring with `perf stat`, using wasmtime with its cache enabled, and taking the second run after the first compiles and populates the cache): pre: 1117.232000 task-clock (msec) # 1.000 CPUs utilized 133 context-switches # 0.119 K/sec 1 cpu-migrations # 0.001 K/sec 5,041 page-faults # 0.005 M/sec 3,511,615,100 cycles # 3.143 GHz 4,272,427,772 instructions # 1.22 insn per cycle <not supported> branches 27,980,906 branch-misses 1.117299838 seconds time elapsed post: 1003.738075 task-clock (msec) # 1.000 CPUs utilized 121 context-switches # 0.121 K/sec 0 cpu-migrations # 0.000 K/sec 5,052 page-faults # 0.005 M/sec 3,224,875,393 cycles # 3.213 GHz 4,000,838,686 instructions # 1.24 insn per cycle <not supported> branches 27,928,232 branch-misses 1.003440004 seconds time elapsed In other words, with this change, on `bz2`, we see a 6.3% reduction in executed instructions.	2020-07-17 21:10:21 -07:00
Chris Fallin	9bd9c628aa	Aarch64: mask shift-amounts incorporated into reg-reg-shift ALU insts. We had previously fixed a bug in which constant shift amounts should be masked to modulo the number of bits in the operand; however, we did not fix the analogous case for shifts incorporated into the second register argument of ALU instructions that support integrated shifts. This failure to mask resulted in illegal instructions being generated, e.g. in https://bugzilla.mozilla.org/show_bug.cgi?id=1653502. This PR fixes the issue by masking the amount, as the shift semantics require.	2020-07-17 14:55:23 -07:00
Johnnie Birch	a7cedf3100	Add support for 32 bit and 64 bit fcmp for the new backend Implements commiss and commisd.	2020-07-17 13:46:54 -07:00
Joey Gouly	40473dffed	arm64: Enable arith2 tests Copyright (c) 2020, Arm Limited.	2020-07-17 15:58:16 +01:00
Benjamin Bouvier	ead8a835c4	machinst x64: add more FP support	2020-07-17 15:56:44 +02:00
bjorn3	7b7b1f4997	Rename sarg__ to sarg_t	2020-07-17 12:03:17 +02:00
bjorn3	4431ac1108	Implement SystemV struct argument passing	2020-07-17 12:03:17 +02:00
Benjamin Bouvier	bab337fc32	Address review comments;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	5a55646fc3	machinst x64: support out-of-bounds memory accesses;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	ea33ce9116	machinst x64: basic support for baldrdash + fix multi-value support	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	00b38c91f6	machinst x64: fix generation of RegMemImm immediate operands;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	1430c5e436	machinst x64: fix index handling of jump table; The index should be truncated to 32 bits before being used for the jump table entry computation.	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	55b9059954	machinst x64: remove spurious assertion about FP offset requiring to be 16-bytes aligned	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	3905a1b17b	machinst x64: implement SymbolValue and FuncAddr with a movabsq+reloc;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	cfa0a0c4e8	machinst x64: lower resumable_trap as trap;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	311027869b	machinst x64: implement popcnt.i64	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	d9310e8d90	machinst x64: fix checked div sequence - it should mark as clobbering (def) rdx, not modifying it - the signed-div check requires a temporary to compare against int64_min	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	f932bccaf8	machinst x64: fix sign-extension at boundary	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	6f5403a94b	machinst x64: lower Ctz using the Bsf x86 instruction	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	33e0d05645	machinst x64: have cmov modify its destination operand; This is tricky: the control flow implicitly implied by the operand makes it so that the output register may be undefined, if we mark it only as a "def". Make it a "mod" instead, which matches our usage in the codebase, and will make it crash if the output operand isn't unconditionally defined before the instruction.	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	aa7db7fd7b	machinst x64: fix JmpUnknown register mapping;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	fe7dd41435	machinst x64: fix iconst emission	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	ec2209665a	machinst x64: implement bsr and lower Clz;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	eda2d143ed	machinst x64: add support for umulhi/smulhi;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	571061fe4c	machinst x64: add support for rotations;	2020-07-16 18:21:06 +02:00
Benjamin Bouvier	22892466e7	machinst x64: fix implementation of *reduce; They should just generate a plain move, since the high bits are then ignored, and not an extended move.	2020-07-16 18:21:06 +02:00
Andrew Brown	c8ddf8a34c	Encode `[u\|s]widen_low` for x86	2020-07-15 11:32:08 -07:00
Andrew Brown	fafef7db77	Add `x86_palignr` instructions This instruction is necessary for implementing `[s\|u]widen_high`.	2020-07-15 11:32:08 -07:00
Chris Fallin	12a31c88d7	Merge pull request #2021 from akirilov-arm/VectorSize AArch64: Introduce an enum to specify vector instruction operand sizes	2020-07-15 09:43:18 -07:00
Benjamin Bouvier	abf157bd69	machinst x64: Only use the feature flag to enable the x64 new backend; Before this patch, running the x64 new backend would require both compiling with --features experimental_x64 and running with `use_new_backend`. This patches changes this behavior so that the runtime flag is not needed anymore: using the feature flag will enforce usage of the new backend everywhere, making using and testing it much simpler: cargo run --features experimental_x64 ;; other CLI options/flags This also gives a hint at what the meta language generation would look like after switching to the new backend. Compiling only with the x64 codegen flag gives a nice compile time speedup.	2020-07-15 13:11:28 +02:00
Anton Kirilov	95b0b05af2	AArch64: Introduce an enum to specify vector instruction operand sizes Copyright (c) 2020, Arm Limited.	2020-07-14 21:37:44 +01:00
Anton Kirilov	400639245c	AArch64: Remove show_freg_sized() It provides the same functionality as show_vreg_scalar(). Copyright (c) 2020, Arm Limited.	2020-07-14 11:27:46 -07:00
Chris Fallin	4ba3ee3368	Merge pull request #2016 from jgouly/saturating-math arm64: Implement saturating SIMD arithmetic	2020-07-14 11:24:10 -07:00
Joey Gouly	aa84a4173c	arm64: Implement saturating SIMD arithmetic Copyright (c) 2020, Arm Limited.	2020-07-14 18:19:11 +01:00
Chris Fallin	26529006e0	Address review comments.	2020-07-14 10:17:29 -07:00
Chris Fallin	08353fcc14	Reftypes part two: add support for stackmaps. This commit adds support for generating stackmaps at safepoints to the new backend framework and to the AArch64 backend in particular. It has been tested to work with SpiderMonkey.	2020-07-14 10:17:27 -07:00
Chris Fallin	b93e8c296d	Initial reftype support in aarch64, modulo safepoints. This commit adds the inital support to allow reftypes to flow through the program when targetting aarch64. It also adds a fix to the `ModuleTranslationState` needed to send R32/R64 types over from the SpiderMonkey embedding. This commit does not include any support for safepoints in aarch64 or the `MachInst` infrastructure; that is in the next commit. This commit also makes a drive-by improvement to `Bint`, avoiding an unneeded zero-extension op when the extended value comes directly from a conditional-set (which produces a full-width 0 or 1).	2020-07-14 10:14:18 -07:00
Anton Kirilov	79dfac5514	Refactor the InstSize enum in the AArch64 backend The main issue with the InstSize enum was that it was used both for GPR and SIMD & FP operands, even though machine instructions do not mix them in general (as in a destination register is either a GPR or not). As a result it had methods such as sf_bit() that made sense only for one type of operand. Another issue was that the enum name was not reflecting its purpose accurately - it was meant to represent an instruction operand size, not an instruction size, which is fixed in A64 (always 4 bytes). Now the enum is split into one for GPR operands and another for scalar SIMD & FP operands. Copyright (c) 2020, Arm Limited.	2020-07-14 15:04:35 +01:00
Alex Crichton	85ffc8f595	Switch CI back to nightly channel (#2014 ) * Switch CI back to nightly channel I think all upstream issues are now fixed so we should be good to switch back to nightly from our previously pinned version. * Fix doc warnings	2020-07-13 18:40:47 -05:00
Chris Fallin	4b6ebc0c27	Merge pull request #1999 from bnjbvr/fix-aarch64-ishl-by-zero machinst aarch64: fix encoding generation of left-shift by 0	2020-07-13 09:25:21 -07:00
Chris Fallin	f4f5eed254	Merge pull request #1977 from jgouly/simd_shift arm64: Implement SIMD shift instructions	2020-07-13 09:24:19 -07:00
Benjamin Bouvier	f316bef763	machinst aarch64: fix encoding generation of left-shift by 0; The ARM book says that the immr field should contain (-count % 64); the existing code was approximating this with (64 - count), which is not correct for a zero count.	2020-07-09 15:43:42 +02:00
Andrew Brown	c5a69cee9f	Add x86 legalization for fcvt_to_uint_sat.i32x4 This converts an `f32x4` into an `i32x4` (unsigned) with rounding by using a long sequence of SSE4.1 compatible instructions.	2020-07-08 10:20:01 -07:00
Peter Huene	d6ae72abe6	Merge pull request #1983 from peterhuene/fix-unwind-info Remove 'set frame pointer' unwind code from Windows x64 unwind.	2020-07-06 22:26:41 -07:00

... 7 8 9 10 11 ...

698 Commits