wasmtime

Author	SHA1	Message	Date
Andrew Brown	630cb3ee62	Add x86 encoding for SIMD imul Only i16x8 and i32x4 are encoded in this commit mainly because i8x16 and i64x2 do not have simple encodings in x86. i64x2 is not required by the SIMD spec and there is discussion (https://github.com/WebAssembly/simd/pull/98#issuecomment-530092217) about removing i8x16.	2019-09-30 13:54:30 -07:00
Andrew Brown	168ad7fda3	Fix 16-bit x86_pextr encoding The x86 ISA has (at least) two encodings for PEXTRW: 1. in the SSE2 opcode (66 0f c5) the XMM operand uses r/m and the GPR operand uses reg 2. in the SSE4.1 opcode (66 0f 3a 15) the XMM operand uses reg and the GPR operand uses r/m This changes the 16-bit x86_pextr encoding from 1 to 2 to match the other PEXTR* implementations (all #2 style).	2019-09-30 13:54:30 -07:00
Andrew Brown	ca1df499a0	Add x86 encoding for isub	2019-09-30 13:54:30 -07:00
Sean Stangl	3d5346a90b	Name opcodes statically in isa/x86. Closes #1051 (#1079 )	2019-09-25 19:59:49 -06:00
Andrew Brown	636ef98024	Use existing `is_equal` predicate with the newly-shared condition codes This removes the `HasConditionCode(&'static str)` predicate and the associated issues with that.	2019-09-24 09:33:07 -07:00
Andrew Brown	a3db30d97e	Add x86 encoding for SIMD `icmp eq` Also adds a predicate for matching the `eq` IntCC code (TODO this should be replaced by something more general)	2019-09-24 09:33:07 -07:00
Andrew Brown	702155b19b	Optimize vconst for x86 when immediate contains all zeroes or ones Instead of using MOVUPS to expensively load bits from memory, this change uses a predicate to optimize vconst without a memory access: - when the 128-bit immediate is all zeroes in all bits, use PXOR to zero out an XMM register - when the 128-bit immediate is all ones in all bits, use PCMPEQB to set an XMM register to all ones This leaves the constant data in the constant pool, which may increase code size (TODO)	2019-09-24 09:33:07 -07:00
Benjamin Bouvier	f0244516c5	[meta] Make more things pub(crate) instead of pub; This could help the compiler find unused fields/methods. It didn't find any during this migration.	2019-09-23 14:42:20 +02:00
Andrew Brown	2330ca7e2c	Fix incorrect regmove and fill encodings for SIMD types - `fill` attempted to use a GPR recipe, `fillSib32`, instead of its FPR equivalent, `ffillSib32` (code compiled without error but incorrect instructions were allowed, e.g. `v1 = regmove v0, %rdi -> %xmm0` - `regmove` could be encoded with a GPR recipe, `rmov`, which hid the above incorrectness; now only FPR-to-FPR regmoves are allowed using the `frmov` recipe	2019-09-20 14:02:03 -07:00
Andrew Brown	fe25abeb0d	Add x86 encodings for vector copy, copy_nop, fill_nop	2019-09-19 12:04:14 -07:00
Andrew Brown	766cf8ddfd	Add x86 implemention for SIMD iadd	2019-09-19 12:04:14 -07:00
Andrew Brown	7e6913e362	Add x86 encodings for vector store, load, fill, spill, and regmove	2019-09-19 12:04:14 -07:00
Andrew Brown	e72434e58f	Add boolean encodings for x86 Includes and, or, xor, not, and regmove; TODO re-factor PerCpuModeEncodings to avoid code duplication	2019-09-19 10:53:40 -07:00
Andrew Brown	af1499ce99	Add x86 implementation of shuffle	2019-09-19 10:53:40 -07:00
Andy Wortman	99380fad1a	Use 'xor r, r' to set registers to 0 instead of mov (#766 )	2019-09-16 16:35:55 +02:00
Ujjwal Sharma	3418fb6e18	[codegen] reintroduce support for carry and borrow instructions in RI… (#1005 ) Reintroduce support for iadd carry variants and isub borrow variants for RISC ISAs which had been removed in https://github.com/CraneStation/cranelift/pull/961 and https://github.com/CraneStation/cranelift/pull/962 because of the lack of a proper flags register in RISC architectures.	2019-09-13 17:27:49 +02:00
Andrew Brown	6f1ed94e82	Fix documentation	2019-09-10 10:45:12 -07:00
Andrew Brown	295b2ef614	Avoid extra register movement when lowering an x86 insertlane to a float vector	2019-09-10 10:45:12 -07:00
Andrew Brown	3dfc68afb1	Avoid extra register movement when lowering the x86 scalar_to_vector of a float value	2019-09-10 10:45:12 -07:00
Andrew Brown	00bedca274	Avoid extra register movement when lowering the x86 extractlane of a float vector This commit is based on the assumption that floats are already stored in XMM registers in x86. When extracting a lane, cranelift was moving the float to a regular register and back to an XMM register; this change avoids this by shuffling the float value to the lowest bits of the XMM register. It also assumes that the upper bits can be left as is (instead of zeroing them out).	2019-09-10 10:45:12 -07:00
Andrew Brown	ebc783e49b	Use raw_bitcast when legalizing splat raw_bitcast matches the intent of this legalization more clearly (to simply change the CLIF type without changing any bits) and the additional null encodings added are necessary for later instructions	2019-09-10 10:45:12 -07:00
Ujjwal Sharma	345b2dc0cc	[codegen] add new recipe "rout" (#1014 ) * [codegen] add new recipe "rout" Add a new recipe "rout" intended to be used by arithematic operations that output flags, currently being used for `iadd_cout` and `isub_bout`. Fixes: https://github.com/CraneStation/cranelift/issues/1009	2019-09-10 12:55:24 +02:00
bjorn3	e2b2b520eb	Fix compilation	2019-09-07 09:55:09 -07:00
bjorn3	ffa1e946a7	Fix compilation	2019-09-07 09:55:09 -07:00
Benjamin Bouvier	660b8b28b8	[codegen] Add a pinned register that's entirely under the control of the user;	2019-09-06 16:18:27 +02:00
Benjamin Bouvier	d1d2e790b9	[meta] Morph a few pub into pub(crate), and remove dead code;	2019-09-06 15:47:20 +02:00
Ujjwal Sharma	dce8ad8229	[codegen] add encodings for isub borrow variants Add encodings for isub borrow variants (isub_bout, isub_bin, isub_borrow) for x86_32, enabling the legalization for isub.i64 to work. Bug: https://bugzilla.mozilla.org/show_bug.cgi?id=1576675 Bug: https://github.com/CraneStation/cranelift/issues/765	2019-09-05 19:28:33 +02:00
Benjamin Bouvier	8a9384f869	Tweak comments;	2019-09-05 17:55:03 +02:00
Ujjwal Sharma	ea919489ee	[codegen] add encodings for iadd carry variants (#961 ) * [codegen] add encodings for iadd carry variants Add encodings for iadd carry variants (iadd_cout, iadd_cin, iadd_carry) for x86_32, enabling the legalization for iadd.i64 to work. * [codegen] remove support for iadd carry variants on riscv Previously, the carry variants of iadd (iadd_cin, iadd_cout and iadd_carry) were being legalized for isa/riscv since RISC architectures lack a flags register. This forced us to return and accept booleans for these operations, which proved to be problematic and inconvenient, especially for x86. This commit removes support for said statements and all dependent statements for isa/riscv so that we can work on a better legalization strategy in the future. * [codegen] change operand type from bool to iflag for iadd carry variants The type of the carry operands for the carry variants of the iadd instruction (iadd_cin, iadd_cout, iadd_carry) was bool for compatibility reasons for isa/riscv. Since support for these instructions on RISC architectures has been temporarily suspended, we can safely change the type to iflags.	2019-09-05 15:03:13 +02:00
Benjamin Bouvier	49a37e48fb	[codegen] Make scalar_to_vector's output type a lane of its input type;	2019-09-04 19:09:54 +02:00
Julian Seward	98056aa05d	Don't incorrectly omit a REX prefix for some encodings of `copy_to_ssa`. Mozilla bug #1576969 . Also, as a ridealong fix, removes R32 encodings for x86_64 in `enc_r32_r64`, since the type `rXX` by definition only exists for targets with word size `XX` bits.	2019-09-04 13:59:01 +02:00
Andrew Brown	8d812b24cc	Add x86 encoding for vconst	2019-08-26 16:12:06 -07:00
Ujjwal Sharma	ec8f72bf20	Use roundss/roundsd when available for Ceil/Floor/Trunc/Nearest (#931 ) Don't tie the preexisting SIMD ISA predicates to the shared enable_simd setting but make new ones instead. Fixes: https://github.com/CraneStation/cranelift/issues/908	2019-08-26 13:37:27 +02:00
julian-seward1	b8fb52446c	Cranelift: implement redundant fill removal on tree-shaped CFG regions. Mozilla bug 1570584. (#906 )	2019-08-25 19:37:34 +02:00
Andrew Brown	cc57e84cbd	Fix segfault due to b64 encoding (#919 ) * Fix segfault due to b64 encoding Prior to this patch, bconst.b64 encoded its instruction with a 32-bit immediate that caused improper decoding of the MOV instruction; instead, use a REX prefix and rely on zero-extension of the immediate. Fixes #911.	2019-08-23 18:04:34 +02:00
Andrew Brown	b4ef90cfcd	Remove SSE2 setting for x86 In talking to @sunfishcode, he preferred to avoid the confusion of more ISA predicates by eliminating SSE2. SSE2 was released with the Pentium 4 in 2000 so it is unlikely that current CPUs would have SIMD enabled and not have this feature. I tried to note the SSE2-specific instructions with comments in the code.	2019-08-20 10:21:12 -07:00
Andrew Brown	d492cf7e0e	Avoid unnecessary lane calculations in codegen code This refactor moves the calculation of the number of lanes to code closer to where the Instruction/BoundInstruction is bound.	2019-08-20 10:21:12 -07:00
Andrew Brown	3fdc78174f	Add x86 implementation of extractlane instruction	2019-08-20 10:21:12 -07:00
Carmen Kwan	19257f80c1	Add reference types R32 and R64 -Add resumable_trap, safepoint, isnull, and null instructions -Add Stackmap struct and StackmapSink trait Co-authored-by: Mir Ahmed <mirahmed753@gmail.com> Co-authored-by: Dan Gohman <sunfish@mozilla.com>	2019-08-16 11:35:16 -07:00
David Lattimore	383ce584ae	Fix an assertion that wasn't doing what it said	2019-08-05 15:22:10 +02:00
Benjamin Bouvier	627ba24b59	Simplify jump table instructions and add missing conversion; This makes non-legalized jump table instructions operate on operands with pointer-sized types. This means we need to extend smaller types into the pointer-sized operand, when the two don't match.	2019-08-02 18:39:39 +02:00
Andrew Brown	3b36a1d1d8	Add x86 implementation of insertlane instruction	2019-07-16 17:07:44 -07:00
Andrew Brown	683e7c75a3	Add x86-specific shuffle instructions This includes both PSHUFD and PSHUFB; these are necessary to legalize future SIMD instructions.	2019-07-16 17:07:44 -07:00
Andrew Brown	61772e9775	Add raw_bitcast instruction Casts bits as a different type of the same width with no change to the data (unlike bitcast)	2019-07-16 17:07:44 -07:00
Andrew Brown	5f0e5567c1	Add scalar_to_vector instruction Moves scalar values in a GPR register to an FPR register	2019-07-16 17:07:44 -07:00
Benjamin Bouvier	fd03677292	[meta] Recipes and encodings descriptions for x86;	2019-07-05 11:38:51 +02:00

1 2

96 Commits