wasmtime

Author	SHA1	Message	Date
Ulrich Weigand	071d3a68d0	ISLE: Fix clif.isle InstructionData entries Attempt to match a Jump instruction in ISLE will currently lead to the generated files not compiling. This is because the definition of the InstructionData enum in clif.isle does not match the actual type used in Rust code. Specifically, clif.isle erroneously omits the ValueList variable-length argument entry if the format does not use a typevar operand. This is the case for Jump and a few other formats. The problem is caused by a bug in the gen_isle routine in meta/src/gen_inst.rs.	2022-01-24 12:54:16 +01:00
Chris Fallin	ef1b2d2fa8	Cranelift: Fix cold-blocks-related lowering bug. If a block is marked cold but has side-effect-free code that is only used by side-effectful code in non-cold blocks, we will erroneously fail to emit it, causing a regalloc failure. This is due to the interaction of block ordering and lowering: we rely on block ordering to visit uses before defs (except for backedges) so that we can effectively do an inline liveness analysis and skip lowering operations that are not used anywhere. This "inline DCE" is needed because instruction lowering can pattern-match and merge one instruction into another, removing the need to generate the source instruction. Unfortunately, the way that I added cold-block support in #3698 was oblivious to this -- it just changed the block sort order. For efficiency reasons, we generate code in its final order directly, so it would not be tenable to generate it in e.g. RPO first and then reorder cold blocks to the bottom; we really do want to visit in the same order as the final code. This PR fixes the bug by moving the point at which cold blocks are sunk to emission-time instead. This is cheaper than either trying to visit blocks during lowering in RPO but add to VCode out-of-order, or trying to do some expensive analysis to recover proper liveness. It's not clear that the latter would be possible anyway -- the need to lower some instructions depends on other instructions' isel results/merging success, so we really do need to visit in RPO, and we can't simply lower all instructions as side-effecting roots (some can't be toplevel nodes). The one downside of this approach is that the VCode itself still has cold blocks inline; so in the text format (and hence compile-tests) it's not possible to see the sinking. This PR adds a test for cold-block sinking that actually verifies the machine code. (The test also includes an add-instruction in the cold path that would have been incorrectly skipped prior to this fix.) Fortunately this bug would not have been triggered by the one current use of cold blocks in #3699, because there the only operation in the cold block was an (always effectful) call instruction. The worst-case effect of the bug in other code would be a regalloc panic; no silent miscompilations could result.	2022-01-21 10:47:49 -08:00
Ulrich Weigand	be60a19623	ISLE standard prelude: Additional types and helpers In preparing to move the s390x back-end to ISLE, I noticed a few missing pieces in the common prelude code. This patch: - Defines the reference types $R32 / $R64. - Provides a trap_code_bad_conversion_to_integer helper. - Provides an avoid_div_traps helper. This requires passing the generic flags in addition to the ISA-specifc flags into the ISLE lowering context.	2022-01-20 17:23:31 +01:00
Anton Kirilov	89919f4b1f	Pass the ISA-specific compilation flags to the ABI implementations Copyright (c) 2021, Arm Limited.	2022-01-14 14:18:01 +00:00
Nick Fitzgerald	a052285340	Fix typo: s/sentinals/sentinels/	2022-01-13 16:50:15 -08:00
Nick Fitzgerald	658c5d33c1	cranelift: Port `trap` and `resumable_trap` lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5bb3645bd4	cranelift: Port `ineg` SIMD lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5917f1d2c2	cranelift: Port `ineg` scalar lowering to ISLE on x64	2022-01-13 15:08:01 -08:00
Nick Fitzgerald	b78731839b	cranelift: Use `x64_` prefix to disambiguate with clif in ISLE Instead of using `m_` like we used to, which was short for "mach inst" but not obvious or clear at all.	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	a41fdb0303	cranelift: Port `rotr` lowering to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4120e40318	cranelift: Update assertions to indicate that `rotl` is fully ported to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4e34dd8239	cranelift: Port `ushr` SIMD lowerings to ISLE on x64	2022-01-13 14:39:06 -08:00
Nick Fitzgerald	a7dba81c1d	cranelift: Port `ishl` SIMD lowerings to ISLE (#3686 )	2022-01-13 09:34:37 -06:00
Chris Fallin	13f17db297	Merge pull request #3680 from bjorn3/remove_code_sink Remove the CodeSink interface in favor of MachBufferFinalized	2022-01-12 10:47:23 -08:00
Nick Fitzgerald	7454f1f3af	cranelift: port `sshr` to ISLE on x64 (#3681 )	2022-01-12 09:13:58 -06:00
bjorn3	55d722db05	Remove CodeSink	2022-01-11 17:10:37 +01:00
bjorn3	88baac4ca6	Move the TestCodeSink functionality to MachBufferFinalized	2022-01-11 14:40:53 +01:00
Nick Fitzgerald	6b5e9d8732	Merge pull request #3659 from fitzgen/vselect-isle cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:51:33 -08:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Chris Fallin	a98f9982fd	Merge pull request #3655 from bjorn3/machinst_cleanups2 Remove MachBackend	2022-01-06 13:32:36 -08:00
Nick Fitzgerald	23efaf2196	cranelift: Remove unused x64 instruction helpers	2022-01-06 11:22:54 -08:00
Nick Fitzgerald	09aa09fd76	cranelift: Port `bitselect` over to ISLE on x64	2022-01-06 11:22:54 -08:00
bjorn3	376c93bda0	Remove MachBackend It is identical to TargetIsa	2022-01-06 15:08:12 +01:00
bjorn3	d50f27e8f9	Remove reg_universe method from MachBackend and MachInst	2022-01-06 14:39:50 +01:00
bjorn3	96b8879e4b	Take reg_universe as argument to machinst::compile	2022-01-06 14:39:50 +01:00
Chris Fallin	e2b37a57dc	Merge pull request #3639 from bjorn3/machinst_cleanups Various cleanups around machinst	2022-01-05 10:01:27 -08:00
Chris Fallin	833ebeed76	Fix spillslot size bug in SIMD by removing type-dependent spillslot allocation. This patch makes spillslot allocation, spilling and reloading all based on register class only. Hence when we have a 32- or 64-bit value in a 128-bit XMM register on x86-64 or vector register on aarch64, this results in larger spillslots and spills/restores. Why make this change, if it results in less efficient stack-frame usage? Simply put, it is safer: there is always a risk when allocating spillslots or spilling/reloading that we get the wrong type and make the spillslot or the store/load too small. This was one contributing factor to CVE-2021-32629, and is now the source of a fuzzbug in SIMD code that puns an arbitrary user-controlled vector constant over another stackslot. (If this were a pointer, that could result in RCE. SIMD is not yet on by default in a release, fortunately. In particular, we have not been particularly careful about using moves between values of different types, for example with `raw_bitcast` or with certain SIMD operations, and such moves indicate to regalloc.rs that vregs are in equivalence classes and some arbitrary vreg in the class is provided when allocating the spillslot or spilling/reloading. Since regalloc.rs does not track actual type, and since we haven't been careful about moves, we can't really trust this "arbitrary vreg in equivalence class" to provide accurate type information. In the fix to CVE-2021-32629 we fixed this for integer registers by always spilling/reloading 64 bits; this fix can be seen as the analogous change for FP/vector regs.	2022-01-04 13:24:40 -08:00
bjorn3	17c3c1813f	Remove MachInstEmitInfo	2022-01-04 18:06:01 +01:00
bjorn3	8d1fc75b6b	Make MachBackend::triple return &Triple This avoids an unnecessary clone	2022-01-04 18:06:01 +01:00
Chris Fallin	e10171b7ea	Merge pull request #3619 from cfallin/isle-manifest-siphash Use SipHasher rather than SHA-512 for ISLE manifest.	2021-12-17 13:15:28 -08:00
Chris Fallin	5233175b06	Use SipHasher rather than SHA-512 for ISLE manifest. Fixes #3609. It turns out that `sha2` is a nontrivial dependency for Cranelift in many contexts, partly because it pulls in a number of other crates as well. One option is to remove the hash check under certain circumstances, as implemented in #3616. However, this is undesirable for other reasons: having different dependency options in Wasmtime in particular for crates.io vs. local builds is not really possible, and so either we still have the higher build cost in Wasmtime, or we turn off the checks by default, which goes against the original intent of ensuring developer safety (no mysterious stale-source bugs). This PR uses `SipHash` instead, which is built into the standard library. `SipHash` is deprecated, but it's fixed and deterministic (across runs and across Rust versions), which is what we need, unlike the suggested replacement `std::collections::hash_map::DefaultHasher`. The result is only 64 bits, and is not cryptographically secure, but we never needed that; we just need a simple check to indicate when we forget a `rebuild-isle`.	2021-12-17 12:11:05 -08:00
bjorn3	1dbb747d59	Fix popcnt for small integers	2021-12-17 20:39:53 +01:00
Alex Crichton	d8974ce6bc	aarch64: Migrate ishl/ushr/sshr to ISLE (#3608 ) * aarch64: Migrate ishl/ushr/sshr to ISLE This commit migrates the `ishl`, `ushr`, and `sshr` instructions to ISLE. These involve special cases for almost all types of integers (including vectors) and helper functions for the i128 lowerings since the i128 lowerings look to be used for other instructions as well. This doesn't delete the i128 lowerings in the Rust code just yet because they're still used by Rust lowerings, but they should be deletable in due time once those lowerings are translated to ISLE. * Use more descriptive names for i128 lowerings * Use a with_flags-lookalike for csel * Use existing `with_flags_` Coment backwards order * Update generated code	2021-12-16 17:37:53 -06:00
Chris Fallin	fd171ca063	Fix OperandSize: need clamp-to-32-bit behavior in most cases, but true-width for shifts.	2021-12-16 12:32:28 -08:00
Chris Fallin	1323ae417e	Fix some 16- and 8-bit behavior in x64 backend related to rotates. Uncovered by @bjorn3 (thanks!): 8- and 16-bit rotates were not working properly in recent versions of Cranelift with part of the lowering migrated to ISLE. This PR fixes a few issues: - 8- and 16-bit rotate-left needs to mask a constant amount, if any, because we use a 32-bit rotate instruction and so don't get the appropriate shift-amount masking for free from x86 semantics. - `operand_size_from_type` was incorrect: it only handled 32- and 64-bit types and silently returned `OperandSize::Size32` for everything else. Now uses the `OperandSize::from_ty(ty)` helper as the pre-ISLE code did. Our test coverage for narrow value types is not great; this PR adds some runtests for rotl/rotr but more would always be better!	2021-12-16 11:34:24 -08:00
Alex Crichton	d89410ec4e	aarch64: Migrate `uextend`/`sextend` to ISLE This commit migrates the sign/zero extension instructions from `lower_inst.rs` to ISLE. There's actually a fair amount going on in this migration since a few other pieces needed touching up along the way as well: * First is the actual migration of `uextend` and `sextend`. These instructions are relatively simple but end up having a number of special cases. I've attempted to replicate all the cases here but double-checks would be good. * This commit actually fixes a few issues where if the result of a vector extraction is sign/zero-extended into i128 that actually results in panics in the current backend. * This commit adds exhaustive testing for extension-of-a-vector-extraction is a noop wrt extraction. * A bugfix around ISLE glue was required to get this commit working, notably the case where the `RegMapper` implementation was trying to map an input to an output (meaning ISLE was passing through an input unmodified to the output) wasn't working. This requires a `mov` instruction to be generated and this commit updates the glue to do this. At the same time this commit updates the ISLE glue to share more infrastructure between x64 and aarch64 so both backends get this fix instead of just aarch64. Overall I think that the translation to ISLE was a net benefit for these instructions. It's relatively obvious what all the cases are now unlike before where it took a few reads of the code and some boolean switches to figure out which path was taken for each flavor of input. I think there's still possible improvements here where, for example, the `put_in_reg_{s,z}ext64` helper doesn't use this logic so technically those helpers could also pattern match the "well atomic loads and vector extractions automatically do this for us" but that's a possible future improvement for later (and shouldn't be too too hard with some ISLE refactoring).	2021-12-14 07:01:37 -08:00
Alex Crichton	20e090b114	aarch64: Migrate {s,u}{div,rem} to ISLE (#3572 ) * aarch64: Migrate {s,u}{div,rem} to ISLE This commit migrates four different instructions at once to ISLE: * `sdiv` * `udiv` * `srem` * `urem` These all share similar codegen and center around the `div` instruction to use internally. The main feature of these was to model the manual traps since the `div` instruction doesn't trap on overflow, instead requiring manual checks to adhere to the semantics of the instruction itself. While I was here I went ahead and implemented an optimization for these instructions when the right-hand-side is a constant with a known value. For `udiv`, `srem`, and `urem` if the right-hand-side is a nonzero constant then the checks for traps can be skipped entirely. For `sdiv` if the constant is not 0 and not -1 then additionally all checks can be elided. Finally if the right-hand-side of `sdiv` is -1 the zero-check is elided, but it still needs a check for `i64::MIN` on the left-hand-side and currently there's a TODO where `-1` is still checked too. * Rebasing and review conflicts	2021-12-13 17:27:11 -06:00
Andrew Brown	86611d3bbc	isle: expand enums in ISLE (#3586 ) * x64: expand FloatCC enum in ISLE * isle: regenerate manifests * isle: generate all enum fields in `clif.isle` This expands the `gen_isle` function to write all of the immediate `enum`s out explicitly in `clif.isle`. Non-`enum` immediates are still `extern primitive`. * Only compile `enum_values` with `rebuild-isle` feature * Only compile `gen_enum_isle` with `rebuild-isle` feature	2021-12-12 18:31:42 -08:00
Nick Fitzgerald	da73952021	cranelift-codegen: port `bnot` lowering to ISLE in x64	2021-12-09 08:38:10 -08:00
Nick Fitzgerald	cded0989aa	cranelift-codegen: Port lowering of {i,u}{max,min}` to ISLE for x64	2021-12-08 15:50:42 -08:00
Nick Fitzgerald	471c1e32a4	Consolidate `XmmRmR`-based instructions together	2021-12-08 15:37:24 -08:00
Chris Fallin	7bc17fda39	Fix iadd_ifcout lowering in ISLE to return a register corresponding to the iflags. This register is not initialized, but we protect against its being used by never allowing an iflags/fflags-typed value to be used with `put_value_in_regs`. All `iflags`/`fflags` usages should be handled by pattern-matching: e.g., `trapif` explicitly matches an `iadd_ifcout` input. Eventually (#3249) we need to simplify this by removing iflags/fflags-tyepd values and using bool flags instead, pattern-matching to get the same efficient lowerings as today. For now, this allows the ISLE assertions to pass.	2021-12-08 11:59:38 -08:00
Andrew Brown	594509b734	x64: assert that temporary and destination registers match during renaming	2021-12-08 11:58:45 -08:00
Alex Crichton	25b380d5fc	aarch64: Migrate `{s,u}mulhi` to ISLE This starts moving over some sign/zero-extend helpers also present in lowering in Rust. Otherwise this is a relatively unsurprising transition with the various cases of the instructions mapping well to ISLE utilities.	2021-11-29 18:11:42 -08:00
Alex Crichton	33dba07e6b	aarch64: Migrate `imul` to ISLE This commit migrates the `imul` clif instruction lowering for AArch64 to ISLE. This is a relatively complicated instruction with lots of special cases due to the simd proposal for wasm. Like x64, however, the special casing lends itself to ISLE quite well and the lowerings here in theory are pretty straightforward. The main gotcha of this commit is that this encounters a unique situation which hasn't been encountered yet with other lowerings, namely the `Umlal32` instruction used in the implementation of `i64x2.mul` is unique in the `VecRRRLongOp` class of instructions in that it both reads and writes the destination register (`use_mod` instead of simply `use_def`). This meant that I needed to add another helper in ISLe for creating a `vec_rrrr_long` instruction (despite this enum variant not actually existing) which implicitly moves the first operand into the destination before issuing the actual `VecRRRLong` instruction.	2021-11-29 16:05:57 -08:00
Nick Fitzgerald	21bce8071e	Merge pull request #3554 from fitzgen/isle-iabs ISLE: port `iabs` to ISLE for x64	2021-11-19 12:09:22 -08:00
Andrew Brown	994fe41daf	x64: allow vector types in `select` move As reported in #3173, the `select` instruction fails an assertion when it is given `v128` types as operands. This change relaxes the assertion to allow the same type of XMM move that occurs for the f32 and f64 types. This fixes #3173 in the old `lower.rs` code temporarily until the relatively complex `select` lowering can be ported to ISLE.	2021-11-19 11:24:30 -08:00
Nick Fitzgerald	94e0de45ed	ISLE: port `iabs` to ISLE for x64	2021-11-19 11:03:44 -08:00
Alex Crichton	ef8ea644f4	aarch64: Migrate {s,u}{sub,add}_sat to ISLE (#3551 ) These were pretty straightforward! Only needed a single `rule` per instruction with a new 128-bit vector type matcher.	2021-11-19 12:59:06 -06:00
Alex Crichton	cbf539abb8	Use `with_flags` for 128-bith arith in aarch64 Also move the `with_flags` bits and pieces to `prelude.isle` so it can be shared between backends if necessary.	2021-11-19 07:08:08 -08:00

1 2 3 4 5 ...

386 Commits