wasmtime

Author	SHA1	Message	Date
Andrew Brown	5d8dd648d7	x64: port `fcmp` to ISLE (#3967 ) * x64: port scalar `fcmp` to ISLE Implement the CLIF lowering for the `fcmp` to ISLE. This adds a new type-matcher, `ty_scalar_float`, for detecting uses of `F32` and `F64`. * isle: rename `vec128` to `ty_vec12` This refactoring changes the name of the `vec128` matcher function to follow the `ty_` convention of the other type matchers. It also makes the helper an inline function call. x64: port vector `fcmp` to ISLE	2022-03-29 15:41:49 -07:00
Andrew Brown	4d5bd5f90e	x64: fix register allocation panic due to load-coalesced value (#3954 ) Fuzz testing identified a lowering case for CLIF's `icmp` in which the double use of a loaded operand resulted in a register allocation error. This change manually adds `put_in_xmm` to avoid load-coalescing these values and includes a CLIF filetest to trigger this issue. Closes #3951. I opened #3953 to discuss a way in which this kind of mistake (i.e., forgetting to add `put_in_*` in certain situations) could be avoided.	2022-03-21 18:46:27 -07:00
Andrew Brown	3bfbb3226e	x64: prefix all machine instructions with `x64_` (#3947 ) This change is refactoring only--it should have no logic changes. As discussed previously, prefixing all machine code instructions with `x64_` will make it easier to identify what parts of the ISLE code correspond to single instructions and what parts rely on helpers that may emit more than one instruction.	2022-03-18 17:53:15 -07:00
Andrew Brown	5fa104205d	x64: improve generation of i128 `icmp` (#3946 ) Previously, we used the flags of `AND` for `SETcc`. This change uses `TEST` instead, which discards the AND result but sets the flags needed for `SETcc`. This reduces register pressure slightly for this sequence.	2022-03-18 16:36:31 -07:00
Andrew Brown	e92cbfb283	x64: port `icmp` to ISLE (#3886 ) * x64: port GPR-held `icmp` to ISLE * x64: port equality `icmp` for i128 type * x64: port `icmp` for vector types * x64: rename from_intcc to intcc_to_cc	2022-03-18 11:22:09 -07:00
Chris Fallin	26ce9a3853	Fix uextend on x64 for non-i32-source cases. (#3906 ) In #3849, I moved uextend over to ISLE in the x64 backend. Unfortunately, the lowering patterns had a bug in the i32-to-i64 special case (when we know the generating instruction zeroes the upper 32 bits): it wasn't actually special casing for an i32 source! This meant that e.g. zero extends of the results of i8 adds did not work properly. This PR fixes the bug and updates the runtest for extends significantly to cover the narrow-value cases. No security impact to Wasm as Wasm does not use narrow integer types. Thanks @bjorn3 for reporting!	2022-03-09 11:10:59 -08:00
Chris Fallin	cd173cfe8e	ISLE: port fmin, fmax, fmin_pseudo, fmax_pseudo on x64. (#3856 )	2022-02-28 14:40:26 -08:00
Chris Fallin	d9dfc44c32	ISLE: port more ops on x64 to lowering patterns. (#3855 )	2022-02-28 13:28:42 -08:00
Chris Fallin	90a081a731	ISLE: port extend/reduce opcodes on x64. (#3849 )	2022-02-28 11:49:28 -08:00
Chris Fallin	24f145cd1e	Migrate clz, ctz, popcnt, bitrev, is_null, is_invalid on x64 to ISLE. (#3848 )	2022-02-28 09:45:13 -08:00
Ulrich Weigand	07d615d3f7	ISLE: Lowering of multi-output instructions (#3783 ) This changes the output of the `lower` constructor from a `ValueRegs` to a new `InstOutput` type, which is a vector of `ValueRegs`. Code in `lower_common` is updated to use this new type to handle instructions with multiple outputs. All back-ends are updated to use the new type.	2022-02-24 14:03:06 -08:00
Chris Fallin	e8881b2cc0	ISLE lowering rules: make use of implicit conversions. (#3847 ) This PR makes use of the new implicit-conversion feature of the ISLE DSL that was introduced in #3807 in order to make the lowering rules significantly simpler and more concise. The basic idea is to eliminate the repetitive and mechanical use of terms that convert from one type to another when there is only one real way to do the conversion -- for example, to go from a `WritableReg` to a `Reg`, the only sensible way is to use `writable_reg_to_reg`. This PR generally takes any term of the form "A_to_B" and makes it an automatic conversion, as well as some others that are similar in spirit. The notable exception to the pure-value-convsion category is the `put_in_reg` family of operations, which actually do have side-effects. However, as noted in the doc additions in #3807, this is fine as long as the side-effects are idempotent. And on balance, making `put_in_reg` automatic is a significant clarity win -- together with other operand converters, it enables rules like: ``` ;; Add two registers. (rule (lower (has_type (fits_in_64 ty) (iadd x y))) (add ty x y)) ``` There may be other converters that we could define to make the rules even simpler; we can make such improvements as we think of them, but this should be a good start!	2022-02-23 16:14:38 -08:00
Andrew Brown	f87c61176a	x64: port select to ISLE (#3682 ) * x64: port `select` using an FP comparison to ISLE This change includes quite a few interlocking parts, required mainly by the current x64 conventions in ISLE: - it adds a way to emit a `cmove` with multiple OR-ing conditions; because x64 ISLE cannot currently safely emit a comparison followed by several jumps, this adds `MachInst::CmoveOr` and `MachInst::XmmCmoveOr` macro instructions. Unfortunately, these macro instructions hide the multi-instruction sequence in `lower.isle` - to properly keep track of what instructions consume and produce flags, @cfallin added a way to pass around variants of `ConsumesFlags` and `ProducesFlags`--these changes affect all backends - then, to lower the `fcmp + select` CLIF, this change adds several `cmove*_from_values` helpers that perform all of the awkward conversions between `Value`, `ValueReg`, `Reg`, and `Gpr/Xmm`; one upside is that now these lowerings have much-improved documentation explaining why the various `FloatCC` and `CC` choices are made the the way they are. Co-authored-by: Chris Fallin <chris@cfallin.org>	2022-02-23 10:03:16 -08:00
Nick Fitzgerald	dc86e7a6dc	cranelift: Use GPR newtypes extensively in x64 lowering (#3798 ) We already defined the `Gpr` newtype and used it in a few places, and we already defined the `Xmm` newtype and used it extensively. This finishes the transition to using the newtypes extensively in lowering by making use of `Gpr` in more places. Fixes #3685	2022-02-14 12:54:41 -08:00
Mrmaxmeier	84b9c7bb8a	cranelift/x64: lower min and max for <= `i64` (#3748 ) * cranelift/x64: lower min and max for <= `i64` * cranelift: add runtests for integer min/max	2022-02-14 10:21:19 -08:00
Nick Fitzgerald	bb7ae46ecd	ISLE: emit traps as safepoints on x64	2022-02-07 10:01:23 -08:00
Nick Fitzgerald	2c77cf866a	ISLE: Rename `{gpr,xmm}_mem_new` constructors to `reg_mem_to_{gpr,xmm}_mem`	2022-02-03 14:08:08 -08:00
Nick Fitzgerald	795b0aaf9a	cranelift: Add newtype wrappers for x64 register classes This primary motivation of this large commit (apologies for its size!) is to introduce `Gpr` and `Xmm` newtypes over `Reg`. This should help catch difficult-to-diagnose register class mixup bugs in x64 lowerings. But having a newtype for `Gpr` and `Xmm` themselves isn't enough to catch all of our operand-with-wrong-register-class bugs, because about 50% of operands on x64 aren't just a register, but a register or memory address or even an immediate! So we have `{Gpr,Xmm}Mem[Imm]` newtypes as well. Unfortunately, `GprMem` et al can't be `enum`s and are therefore a little bit noisier to work with from ISLE. They need to maintain the invariant that their registers really are of the claimed register class, so they need to encapsulate the inner data. If they exposed the underlying `enum` variants, then anyone could just change register classes or construct a `GprMem` that holds an XMM register, defeating the whole point of these newtypes. So when working with these newtypes from ISLE, we rely on external constructors like `(gpr_to_gpr_mem my_gpr)` instead of `(GprMem.Gpr my_gpr)`. A bit of extra lines of code are included to add support for register mapping for all of these newtypes as well. Ultimately this is all a bit wordier than I'd hoped it would be when I first started authoring this commit, but I think it is all worth it nonetheless! In the process of adding these newtypes, I didn't want to have to update both the ISLE `extern` type definition of `MInst` and the Rust definition, so I move the definition fully into ISLE, similar as aarch64. Finally, this process isn't complete. I've introduced the newtypes here, and I've made most XMM-using instructions switch from `Reg` to `Xmm`, as well as register class-converting instructions, but I haven't moved all of the GPR-using instructions over to the newtypes yet. I figured this commit was big enough as it was, and I can continue the adoption of these newtypes in follow up commits. Part of #3685.	2022-02-03 14:08:08 -08:00
Nick Fitzgerald	658c5d33c1	cranelift: Port `trap` and `resumable_trap` lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5bb3645bd4	cranelift: Port `ineg` SIMD lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5917f1d2c2	cranelift: Port `ineg` scalar lowering to ISLE on x64	2022-01-13 15:08:01 -08:00
Nick Fitzgerald	b78731839b	cranelift: Use `x64_` prefix to disambiguate with clif in ISLE Instead of using `m_` like we used to, which was short for "mach inst" but not obvious or clear at all.	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	a41fdb0303	cranelift: Port `rotr` lowering to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4e34dd8239	cranelift: Port `ushr` SIMD lowerings to ISLE on x64	2022-01-13 14:39:06 -08:00
Nick Fitzgerald	a7dba81c1d	cranelift: Port `ishl` SIMD lowerings to ISLE (#3686 )	2022-01-13 09:34:37 -06:00
Nick Fitzgerald	7454f1f3af	cranelift: port `sshr` to ISLE on x64 (#3681 )	2022-01-12 09:13:58 -06:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Nick Fitzgerald	09aa09fd76	cranelift: Port `bitselect` over to ISLE on x64	2022-01-06 11:22:54 -08:00
Chris Fallin	1323ae417e	Fix some 16- and 8-bit behavior in x64 backend related to rotates. Uncovered by @bjorn3 (thanks!): 8- and 16-bit rotates were not working properly in recent versions of Cranelift with part of the lowering migrated to ISLE. This PR fixes a few issues: - 8- and 16-bit rotate-left needs to mask a constant amount, if any, because we use a 32-bit rotate instruction and so don't get the appropriate shift-amount masking for free from x86 semantics. - `operand_size_from_type` was incorrect: it only handled 32- and 64-bit types and silently returned `OperandSize::Size32` for everything else. Now uses the `OperandSize::from_ty(ty)` helper as the pre-ISLE code did. Our test coverage for narrow value types is not great; this PR adds some runtests for rotl/rotr but more would always be better!	2021-12-16 11:34:24 -08:00
Nick Fitzgerald	da73952021	cranelift-codegen: port `bnot` lowering to ISLE in x64	2021-12-09 08:38:10 -08:00
Nick Fitzgerald	cded0989aa	cranelift-codegen: Port lowering of {i,u}{max,min}` to ISLE for x64	2021-12-08 15:50:42 -08:00
Chris Fallin	7bc17fda39	Fix iadd_ifcout lowering in ISLE to return a register corresponding to the iflags. This register is not initialized, but we protect against its being used by never allowing an iflags/fflags-typed value to be used with `put_value_in_regs`. All `iflags`/`fflags` usages should be handled by pattern-matching: e.g., `trapif` explicitly matches an `iadd_ifcout` input. Eventually (#3249) we need to simplify this by removing iflags/fflags-tyepd values and using bool flags instead, pattern-matching to get the same efficient lowerings as today. For now, this allows the ISLE assertions to pass.	2021-12-08 11:59:38 -08:00
Nick Fitzgerald	94e0de45ed	ISLE: port `iabs` to ISLE for x64	2021-11-19 11:03:44 -08:00
Alex Crichton	352ee2b186	Move `insertlane` to ISLE (#3544 ) This also fixes a bug where `movsd` was incorrectly used with a memory operand for `insertlane`, causing it to actually zero the upper bits instead of preserving them. Note that the insertlane logic still exists in `lower.rs` because it's used as a helper for a few other instruction lowerings which aren't migrated to ISLE yet. This commit also adds a helper in ISLE itself for those other lowerings to use when they get implemented. Closes #3216	2021-11-18 13:48:11 -06:00
Alex Crichton	f30c8eb464	isle: Migrate f32const/f64const to ISLE (#3537 ) This moves the `f32const` and `f64const` instructions from `lower.rs` to ISLE. I was originally going to add something else but due to the `isle.rs`'s manual use of `constructor_imm(..)` it necessitated filling out the `imm` cases for f32/f64 constants, so I figured I'd go ahead and move these instructions as well. The special case for 0 constants which use `xorp{s,d}` is preserved from `lower.rs` today, but a special case isn't added for the all-ones constant. The comment says to use `cmpeqp{s,d}` but as discovered on other recent PRs that's not quite sufficient because comparing a register against itself which happens to be NaN wouldn't work, so something else will be required (perhaps `pcmpeq` or similar? I figured I'd defer to later)	2021-11-16 14:19:53 -06:00
Nick Fitzgerald	306c96b461	Merge pull request #3536 from alexcrichton/swap-isle-order isle: Move immediates to the end of extractors	2021-11-16 10:13:17 -08:00
Alex Crichton	1c13f62189	isle: Move immediates to the end of extractors Otherwise I was getting type errors trying to match `insertlane` instructions, so I think that this was the intended order.	2021-11-16 09:07:24 -08:00
Alex Crichton	92394566fc	x64: Migrate `fabs` and `bnot` vector operations to ISLE This was my first attempt at transitioning code to ISLE to originally fix #3327 but that fix has since landed on `main`, so this is instead now just porting a few operations to ISLE. Closes #3336	2021-11-16 07:36:49 -08:00
Nick Fitzgerald	bfbf2f2f49	ISLE: implement x64 lowering for `band_not` in ISLE	2021-11-11 15:19:28 -08:00
Nick Fitzgerald	b8494822dc	ISLE: finish porting `imul` lowering to ISLE	2021-11-05 15:41:24 -07:00
Nick Fitzgerald	d377b665c6	Initial ISLE integration with the x64 backend On the build side, this commit introduces two things: 1. The automatic generation of various ISLE definitions for working with CLIF. Specifically, it generates extern type definitions for clif opcodes and the clif instruction data `enum`, as well as extractors for matching each clif instructions. This happens inside the `cranelift-codegen-meta` crate. 2. The compilation of ISLE DSL sources to Rust code, that can be included in the main `cranelift-codegen` compilation. Next, this commit introduces the integration glue code required to get ISLE-generated Rust code hooked up in clif-to-x64 lowering. When lowering a clif instruction, we first try to use the ISLE code path. If it succeeds, then we are done lowering this instruction. If it fails, then we proceed along the existing hand-written code path for lowering. Finally, this commit ports many lowering rules over from hand-written, open-coded Rust to ISLE. In the process of supporting ISLE, this commit also makes the x64 `Inst` capable of expressing SSA by supporting 3-operand forms for all of the existing instructions that only have a 2-operand form encoding: dst = src1 op src2 Rather than only the typical x86-64 2-operand form: dst = dst op src This allows `MachInst` to be in SSA form, since `dst` and `src1` are disentangled. ("3-operand" and "2-operand" are a little bit of a misnomer since not all operations are binary operations, but we do the same thing for, e.g., unary operations by disentangling the sole operand from the result.) There are two motivations for this change: 1. To allow ISLE lowering code to have value-equivalence semantics. We want ISLE lowering to translate a CLIF expression that evaluates to some value into a `MachInst` expression that evaluates to the same value. We want both the lowering itself and the resulting `MachInst` to be pure and referentially transparent. This is both a nice paradigm for compiler writers that are authoring and maintaining lowering rules and is a prerequisite to any sort of formal verification of our lowering rules in the future. 2. Better align `MachInst` with `regalloc2`'s API, which requires that the input be in SSA form.	2021-10-12 17:11:58 -07:00

41 Commits