wasmtime

Author	SHA1	Message	Date
Sean Stangl	cf9e762f16	Add a DynRex recipe type for x86, decreasing the number of recipes (#1298 ) This patch adds a third mode for templates: REX inference is requestable at template instantiation time. This reduces the number of recipes by removing rex()/nonrex() redundancy for many instructions.	2019-12-19 15:49:34 -07:00
Philip Craig	86b66e8ede	Fix build failure in cranelift-codegen (#1294 ) error[E0425]: cannot find value `ones` in this scope --> cranelift-codegen/meta/src/isa/x86/legalize.rs:564:33 \| 564 \| def!(c = vconst(ones)), \| ^^^^ not found in this scope	2019-12-16 19:38:09 -08:00
Andrew Brown	4433ad2858	Fix legalization of `icmp ugt` (#1278 ) Previously, the same pattern (pmax + pcmpeq) as `uge` was used but this logic was incorrect for operands with equal values.	2019-12-16 14:14:51 -07:00
Andrew Brown	6181f20326	Fix legalization of SIMD `fneg` (#1286 ) Previously `fsub` was used but this fails when negating -0.0 and +0.0 in the SIMD spec tests; using more instructions, this change uses shifts to create a constant for flipping the most significant bit of each lane with `bxor`.	2019-12-16 10:32:08 -08:00
Andrew Brown	0604ec480c	Fix `scalar_to_vector`: move not wide enough for 64-bit values (#1287 ) Previously, the use of `enc_x86_64` emitted two 64-bit mode encodings for `scalar_to_vector.i64`, neither of which contained the REX.W bit telling `MOVD/MOVQ` to move 64 bits of data instead of 32 bits. Now, `scalar_to_vector.i64` will always use a sole 64-bit mode REX.W encoding and `scalar_to_vector` with other widths will have three encodings: a 32-bit mode move, a 64-bit mode move with no REX, and a 64-bit mode move with REX (but not REX.W).	2019-12-16 10:17:08 -08:00
Andrew Brown	d4df756acf	Remove packed_struct dependency; closes #1271 and #1284 (#1282 )	2019-12-12 17:01:31 -08:00
llogiq	0d8f8bc71f	Fix some clippy warnings (#1277 )	2019-12-07 09:47:43 -08:00
Andrew Brown	91d29c09d0	Add x86 SIMD floating-point absolute value	2019-11-15 13:45:25 -08:00
Andrew Brown	1f17e35e95	Add x86 SIMD immediate shifts	2019-11-15 13:45:25 -08:00
Andrew Brown	6519a43b08	Add x86 SIMD floating-point negation	2019-11-15 13:45:25 -08:00
Sean Stangl	f8ae622003	Use a struct interface for creating and reading encoding bits on x86. #1156 (#1212 )	2019-11-13 18:01:13 -07:00
Andrew Brown	215884e907	Simplify variable name: change `inst_` to `inst`	2019-11-12 17:05:39 -08:00
Andrew Brown	c8eb4e9612	Add x86 SIMD floating-point arithmetic	2019-11-12 17:05:39 -08:00
Andrew Brown	04db2a9f39	Bind constant vectors to vconst; fixes #1052 (#1217 )	2019-11-12 15:57:59 -08:00
Benjamin Bouvier	9080a02e10	Replace CraneStation by bytecodealliance everywhere; (#1221 )	2019-11-12 10:09:31 -08:00
Andrew Brown	d32301854d	Add x86 SIMD implementation of float comparison	2019-11-08 14:06:53 -08:00
Andrew Brown	af4637aff6	Add x86 SIMD legalizations for icmp less-than	2019-11-05 16:42:34 -08:00
Andrew Brown	feffed85d2	Add x86 SIMD legalizations for integer greater-than This includes `icmp ugt`, `icmp sge`, and `icmp uge` for vectors with lanes of I8, I16, and I32.	2019-11-05 16:42:34 -08:00
Andrew Brown	0ab5760fd7	Add x86 SIMD instructions for min and max Only the I8, I16, and I32 versions are included since Cranelift lacks support for AVX.	2019-11-05 16:42:34 -08:00
Andrew Brown	c454c3c771	Add x86 SIMD encoding for `icmp sgt`	2019-11-05 16:42:34 -08:00
Andrew Brown	e3a20d67b2	Add x86 SIMD legalization of `icmp ne`	2019-11-05 16:42:34 -08:00
Nick Fitzgerald	a49483408c	Many multi-value returns (#1147 ) * Add x86 encodings for `bint` converting to `i8` and `i16` * Introduce tests for many multi-value returns * Support arbitrary numbers of return values This commit implements support for returning an arbitrary number of return values from a function. During legalization we transform multi-value signatures to take a struct return ("sret") return pointer, instead of returning its values in registers. Callers allocate the sret space in their stack frame and pass a pointer to it into the caller, and once the caller returns to them, they load the return values back out of the sret stack slot. The callee's return operations are legalized to store the return values through the given sret pointer. * Keep track of old, pre-legalized signatures When legalizing a call or return for its new legalized signature, we may need to look at the old signature in order to figure out how to legalize the call or return. * Add test for multi-value returns and `call_indirect` * Encode bool -> int x86 instructions in a loop * Rename `Signature::uses_sret` to `Signature::uses_struct_return_param` * Rename `p` to `param` * Add a clarifiying comment in `num_registers_required` * Rename `num_registers_required` to `num_return_registers_required` * Re-add newline * Handle already-assigned parameters in `num_return_registers_required` * Document what some debug assertions are checking for * Make "illegalizing" closure's control flow simpler * Add unit tests and comments for our rounding-up-to-the-next-multiple-of-a-power-of-2 function * Use `append_isnt_arg` instead of doing the same thing manually * Fix grammar in comment * Add `Signature::uses_special_{param,return}` helper functions * Inline the definition of `legalize_type_for_sret_load` for readability * Move sret legalization debug assertions out into their own function * Add `round_up_to_multiple_of_type_align` helper for readability * Add a debug assertion that we aren't removing the wrong return value * Rename `RetPtr` stack slots to `StructReturnSlot` * Make `legalize_type_for_sret_store` more symmetrical to `legalized_type_for_sret` * rustfmt * Remove unnecessary loop labels * Do not pre-assign offsets to struct return stack slots Instead, let the existing frame layout algorithm decide where they should go. * Expand "sret" into explicit "struct return" in doc comment * typo: "than" -> "then" in comment * Fold test's debug message into the assertion itself	2019-11-05 14:36:03 -08:00
Andrew Brown	f19456640c	Add documentation for top-level items in cranelift-codegen/meta	2019-10-31 09:35:08 -07:00
Benjamin Bouvier	4632d35196	[meta] Remove the OperandBuilder, replace it with Operand ctors;	2019-10-30 18:39:20 +01:00
Benjamin Bouvier	5889dd2c64	[meta] Add more pub(crate) definitions.	2019-10-29 14:23:10 +01:00
Andrew Brown	f37d1c7ecc	Simplify binding of IntCC::Equals to SIMD `icmp`; fixes #1150	2019-10-28 11:09:37 -07:00
Peter Huene	9f506692c2	Fix clippy warnings. This commit fixes the current set of (stable) clippy warnings in the repo.	2019-10-24 17:20:12 -07:00
Andrew Brown	879ccf871a	Add x86 SIMD vall_true In order to implement SIMD's all_true (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#all-lanes-true), we must legalize some instruction (I chose `vall_true`) to a comparison against 0 and a similar reduction as vany_true using `PTEST` and `SETNZ`. Since `icmp` only allows integers but `vall_true` could allow more vector types, `raw_bitcast` is used to convert the lane types into integers, e.g. b32x4 to i32x4. To do so without runtime type-checking, the `raw_bitcast` instruction (which emits no instruction) can now bitcast from any vector type to the same type, e.g. i32x4 to i32x4.	2019-10-22 11:01:05 -07:00
Andrew Brown	186effc420	Add x86 SIMD vany_true and x86_ptest In order to implement SIMD's any_true (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#any-lane-true), we must legalize some instruction (I chose `vany_true`) to a sequence of `PTEST` and `SETNZ`. To emit `PTEST` I added the new CLIF instruction `x86_ptest` and used CLIF's `trueif ne` for `SETNZ`.	2019-10-22 11:01:05 -07:00
Benjamin Bouvier	0243b642e3	[meta] Remove name lookups in formats; This does a lot at once, since there was no clear way to split the three commits: - Instruction need to be passed an explicit InstructionFormat, - InstructionFormat deduplication is checked once all entities have been defined;	2019-10-22 14:05:12 +02:00
Benjamin Bouvier	9e9a7626d7	[meta] Use a ref-counted pointer to an InstructionFormat in instructions; This avoids a lot of dereferences, and InstructionFormat are immutable once they're created. It removes a lot of code that was keeping the FormatRegistry around, just in case we needed the format. This is more in line with the way we create Instructions, and make it easy to reference InstructionFormats in general.	2019-10-22 14:05:12 +02:00
Benjamin Bouvier	d3e694fbe7	[meta] Remove unused InstructionGroup::{name, doc};	2019-10-22 14:05:12 +02:00
Andrew Brown	b927c55511	Add SIMD bitselect instruction and x86 legalization This new instructions matches the `bitselect` behavior described in the WASM SIMD spec (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#bitwise-select)	2019-10-17 15:49:29 -07:00
Andrew Brown	8f74333662	Add x86 SIMD band_not	2019-10-17 15:49:29 -07:00
Andrew Brown	f1904bffea	Add x86 SIMD sshr and ushr Only the shifts with applicable SSE2 instructions are implemented here: PSRL* (for ushr) only has 16-64 bit instructions and PSRA* (for sshr) only has 16-32 bit instructions.	2019-10-15 15:51:50 -07:00
Andrew Brown	6460fe705f	Add x86 SIMD ishl Only the shifts with applicable SSE2 instructions (i.e. 16-64 bit width) are implemented here.	2019-10-15 15:51:50 -07:00
Andrew Brown	1600dba634	Make ConstantData a container for any-size constant values Previously, ConstantData was a type alias for `Vec<u8>` which prevented it from having an implementation; this meant that `V128Imm` and `&[u8; 16]` were used in places that otherwise could have accepted types of different byte lengths.	2019-10-15 15:19:00 -07:00
Benjamin Bouvier	566a143634	[meta] Add pub(crate) to more types; This caught one unused method, allowing us to remove it.	2019-10-15 11:37:48 +02:00
Benjamin Bouvier	350b3b2406	[meta] Avoid unwrapping instructions several times during legalization; This avoids doing multiple unpacking of the InstructionData for a single legalization, improving readability and reducing size of the generated code. For instance, icmp had to unpack the format once per IntCC condition code.	2019-10-15 11:37:48 +02:00
Andrew Brown	1f728c1797	Add x86 legalization for SIMD bnot	2019-10-11 11:05:24 -07:00
Andrew Brown	dbe7dd59da	Add x86 SIMD bxor	2019-10-11 11:05:24 -07:00
Andrew Brown	4cdc1e76a4	Add x86 SIMD band	2019-10-11 11:05:24 -07:00
Andrew Brown	96d51cb1e8	Switch x86 SIMD bor from ORPS to POR encoding There are two reasons for this change: 1. it reduces confusion; using the `POR` encoding will match the future encodings of `band` and `bxor` and the `ORPS` encoding may be confusing as it is intended for floating-point operations 2. `POR` has slightly more throughput: it only has to wait 0.33 cycles to execute again on all Intel architectures above Core whereas `ORPS` must wait 1 cycle on architectures older than Skylake (Intel Optimization Reference Manual, C.3) `POR` does add one additional byte to the encoding and requires SSE2 so the `ORPS` opcode is left in for future use.	2019-10-11 11:05:24 -07:00
Andrew Brown	6d690e5275	Allow binding immediates to instructions (#1012 ) This change should make the code more clear (and less code) when adding encodings for instructions with specific immediates; e.g., a constant with a 0 immediate could be encoded as an XOR with something like `const.bind(...)` without explicitly creating the necessary predicates. It has several parts: * Introduce Bindable trait to instructions * Convert all instruction bindings to use Bindable::bind() * Add ability to bind immediates to BoundInstruction This is an attempt to reduce some of the issues in #955.	2019-10-10 08:54:46 -07:00
Benjamin Bouvier	0d50462a93	Fixes #1091 : Use match statements instead of HashMaps in x86 encodings;	2019-10-01 09:01:37 -07:00
Andrew Brown	90c49a2f7c	Add saturating subtraction with a SIMD encoding This includes the new instructions `ssub_sat` and `usub_sat` and only encodes the i8x16 and i16x8 types; these are what is needed for implementing the SIMD spec (see https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#saturating-integer-subtraction).	2019-09-30 13:54:30 -07:00
Andrew Brown	21144068d4	Add saturating addition with a SIMD encoding This includes the new instructions `sadd_sat` and `uadd_sat` and only encodes the i8x16 and i16x8 types; these are what is needed for implementing the SIMD spec (see https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#saturating-integer-addition).	2019-09-30 13:54:30 -07:00
Andrew Brown	630cb3ee62	Add x86 encoding for SIMD imul Only i16x8 and i32x4 are encoded in this commit mainly because i8x16 and i64x2 do not have simple encodings in x86. i64x2 is not required by the SIMD spec and there is discussion (https://github.com/WebAssembly/simd/pull/98#issuecomment-530092217) about removing i8x16.	2019-09-30 13:54:30 -07:00
Andrew Brown	168ad7fda3	Fix 16-bit x86_pextr encoding The x86 ISA has (at least) two encodings for PEXTRW: 1. in the SSE2 opcode (66 0f c5) the XMM operand uses r/m and the GPR operand uses reg 2. in the SSE4.1 opcode (66 0f 3a 15) the XMM operand uses reg and the GPR operand uses r/m This changes the 16-bit x86_pextr encoding from 1 to 2 to match the other PEXTR* implementations (all #2 style).	2019-09-30 13:54:30 -07:00
Andrew Brown	ba393afd4d	Add x86 legalization for SIMD ineg	2019-09-30 13:54:30 -07:00

1 2 3

124 Commits