wasmtime

Author	SHA1	Message	Date
Ryan Hunt	c360007b19	Drop 'basic-blocks' feature (#1363 ) * All: Drop 'basic-blocks' feature This makes it so that 'basic-blocks' cannot be disabled and we can start assuming it everywhere. * Tests: Replace non-bb filetests with bb version * Tests: Adapt solver-fixedconflict filetests to use basic blocks	2020-01-23 22:36:06 -07:00
Ryan Hunt	fc58dd6aff	Tests: Add basic test for r32, r64 This commit adds a basic test for reference types on 32/64bit systems. * Storing a ref type in a table * Loading a ref type from a table * Passing ref types to a function * Returning ref types from a function * `is_null` instruction * `is_invalid` instruction	2020-01-23 13:37:11 -06:00
Ryan Hunt	946251e655	Codegen: Align representation of stackmap with SpiderMonkey This commit aligns the representation of stackmaps to be the same as Spidermonkey's by: * Reversing the order of the bitmap from low addresses to high addresses * Including incoming stack arguments * Excluding outgoing stack arguments Additionally, some accessor functions were added to allow Spidermonkey to access the internals of the bitmap.	2020-01-23 13:37:11 -06:00
Benjamin Bouvier	3125431ece	Address nits from #1325	2020-01-23 09:39:49 +01:00
bjorn3	cc50e65f31	Update gimli to 0.20 (#1361 )	2020-01-22 11:25:35 -08:00
Sean Stangl	b4c6bfd371	When splitting a const, insert prior to the terminal branch group. (#1325 ) * When splitting a const, insert prior to the terminal branch group. Closes #1159 Given code like the following, on x86_64, which does not have i128 registers: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 brnz v2, ebb1 jump ebb2(v1) It would be split to: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 brnz v2, ebb1 v3, v4 = isplit.i128 v1 jump ebb2(v3, v4) But that fails basic-block invariants. This patch changes that to: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 v3, v4 = isplit.i128 v1 brnz v2, ebb1 jump ebb2(v3, v4) * Add isplit-bb.clif testcase	2020-01-22 17:14:41 +01:00
jmkrauz	ae6ba1e58c	Fix narrow_icmp_imm (#1343 )	2020-01-21 15:20:44 +01:00
Dan Gohman	80d11e3f8d	Bump version to 0.56.0 (#1356 )	2020-01-17 14:33:52 -08:00
Andrew Brown	e1d513ab4b	Fix remaining clippy warnings (#1340 ) * clippy: allow complex encoding function * clippy: remove unnecessary main() function in doctest * clippy: remove redundant `Type` suffix on LaneType enum variants * clippy: ignore incorrect debug_assert_with_mut_call warning * clippy: fix FDE clippy warnings	2020-01-17 14:03:30 -06:00
Dan Gohman	b00a181824	Bump version to 0.55.0 (#1345 )	2020-01-14 13:04:11 -08:00
Benjamin Bouvier	dd497c19e1	Renames Settings ⚠️ (fixes #976 ) (#1321 ) This is a breaking API change: the following settings have been renamed: - jump_tables_enabled -> enable_jump_tables - colocated_libcalls -> use_colocated_libcalls - probestack_enabled -> enable_probestack - allones_funcaddrs -> emit_all_ones_funcaddrs	2020-01-13 14:42:49 -07:00
Yury Delendik	bd88155483	Refactor unwind; add FDE support. (#1320 ) * Refactor unwind * add FDE support * use sink directly in emit functions * pref off all unwinding generation with feature	2020-01-13 10:32:55 -06:00
Dan Gohman	582e7942f8	Bump version to 0.54.0 (#1333 )	2020-01-10 14:08:07 -08:00
Andrew Brown	6fe86bcb61	Fix SIMD float comparison encoding (#1285 ) The Intel manual uses `CMPNLT` and `CMPNLE` to denote not-less-than and not-less-than-or-equals. These were translated previously to `FloatCC::GreaterThan` and `FloatCC::GreaterThanOrEqual` but should be correctly translated to `FloatCC::UnorderedOrGreaterThanOrEqual` and `FloatCC::UnorderedOrGreaterThan`. This change adds the necessary legalizations to make use of these new encodings.	2020-01-08 09:28:05 -08:00
bjorn3	9bbe378d41	Fix master tests (#1316 ) Co-authored-by: Benjamin Bouvier <public@benj.me>	2020-01-06 14:39:04 +01:00
Sean Stangl	cf9e762f16	Add a DynRex recipe type for x86, decreasing the number of recipes (#1298 ) This patch adds a third mode for templates: REX inference is requestable at template instantiation time. This reduces the number of recipes by removing rex()/nonrex() redundancy for many instructions.	2019-12-19 15:49:34 -07:00
Benjamin Bouvier	ac8a952a6b	Bump version to 0.52.0	2019-12-18 12:37:08 +01:00
Y-Nak	8db7349712	Report when output register annotation is missing (#1289 )	2019-12-17 11:28:45 +01:00
Andrew Brown	4433ad2858	Fix legalization of `icmp ugt` (#1278 ) Previously, the same pattern (pmax + pcmpeq) as `uge` was used but this logic was incorrect for operands with equal values.	2019-12-16 14:14:51 -07:00
Andrew Brown	6181f20326	Fix legalization of SIMD `fneg` (#1286 ) Previously `fsub` was used but this fails when negating -0.0 and +0.0 in the SIMD spec tests; using more instructions, this change uses shifts to create a constant for flipping the most significant bit of each lane with `bxor`.	2019-12-16 10:32:08 -08:00
Andrew Brown	0604ec480c	Fix `scalar_to_vector`: move not wide enough for 64-bit values (#1287 ) Previously, the use of `enc_x86_64` emitted two 64-bit mode encodings for `scalar_to_vector.i64`, neither of which contained the REX.W bit telling `MOVD/MOVQ` to move 64 bits of data instead of 32 bits. Now, `scalar_to_vector.i64` will always use a sole 64-bit mode REX.W encoding and `scalar_to_vector` with other widths will have three encodings: a 32-bit mode move, a 64-bit mode move with no REX, and a 64-bit mode move with REX (but not REX.W).	2019-12-16 10:17:08 -08:00
bjorn3	4cc0241f37	More i8 legalizations (#1253 ) * Legalize stack_{load,store}.i8, fixes #433 * Legalize select.i8, cc: #466 * Legalize brz.i8 and brnz.i8, cc: #1117	2019-12-04 09:16:22 -08:00
krk	bc9f05e5e2	Add legalization for bitrev.i128 via narrowing, fixes #1116 (#1229 )	2019-11-22 12:38:04 +01:00
Dan Gohman	b4528beaf5	Bump version to 0.51.0 (#1250 )	2019-11-20 22:40:55 -08:00
bjorn3	c5e74986e1	Legalize uextend and sextend to 128bit ints	2019-11-19 14:42:21 -08:00
Andrew Brown	91d29c09d0	Add x86 SIMD floating-point absolute value	2019-11-15 13:45:25 -08:00
Andrew Brown	1f17e35e95	Add x86 SIMD immediate shifts	2019-11-15 13:45:25 -08:00
Andrew Brown	6519a43b08	Add x86 SIMD floating-point negation	2019-11-15 13:45:25 -08:00
krk	f7c7245b06	Add legalization for popcnt.i128 via narrowing, fixes #1116	2019-11-15 14:03:49 +01:00
Peter Huene	b578fd5396	Fix the cranelift-filetests build. (#1227 ) This commit removes the dependency on `byteorder::ReadBytesExt`, which can't be used from `no_std`, which is how we're building the `byteorder` crate. The fix is to simply offset into the slice as needed rather than using a `std::io::Cursor`. Fixes #1225.	2019-11-13 16:22:26 -08:00
Andrew Brown	c8eb4e9612	Add x86 SIMD floating-point arithmetic	2019-11-12 17:05:39 -08:00
Andrew Brown	04db2a9f39	Bind constant vectors to vconst; fixes #1052 (#1217 )	2019-11-12 15:57:59 -08:00
Benjamin Bouvier	9080a02e10	Replace CraneStation by bytecodealliance everywhere; (#1221 )	2019-11-12 10:09:31 -08:00
Andrew Brown	d32301854d	Add x86 SIMD implementation of float comparison	2019-11-08 14:06:53 -08:00
Benjamin Bouvier	143cb01489	Do not align the stack frame for leaf functions not using the stack.	2019-11-08 17:20:20 +01:00
Dan Gohman	cf82863ea9	Bump version to 0.49.0 (#1208 )	2019-11-06 14:38:46 -08:00
Andrew Brown	af4637aff6	Add x86 SIMD legalizations for icmp less-than	2019-11-05 16:42:34 -08:00
Andrew Brown	feffed85d2	Add x86 SIMD legalizations for integer greater-than This includes `icmp ugt`, `icmp sge`, and `icmp uge` for vectors with lanes of I8, I16, and I32.	2019-11-05 16:42:34 -08:00
Andrew Brown	0ab5760fd7	Add x86 SIMD instructions for min and max Only the I8, I16, and I32 versions are included since Cranelift lacks support for AVX.	2019-11-05 16:42:34 -08:00
Andrew Brown	c454c3c771	Add x86 SIMD encoding for `icmp sgt`	2019-11-05 16:42:34 -08:00
Andrew Brown	e3a20d67b2	Add x86 SIMD legalization of `icmp ne`	2019-11-05 16:42:34 -08:00
Nick Fitzgerald	a49483408c	Many multi-value returns (#1147 ) * Add x86 encodings for `bint` converting to `i8` and `i16` * Introduce tests for many multi-value returns * Support arbitrary numbers of return values This commit implements support for returning an arbitrary number of return values from a function. During legalization we transform multi-value signatures to take a struct return ("sret") return pointer, instead of returning its values in registers. Callers allocate the sret space in their stack frame and pass a pointer to it into the caller, and once the caller returns to them, they load the return values back out of the sret stack slot. The callee's return operations are legalized to store the return values through the given sret pointer. * Keep track of old, pre-legalized signatures When legalizing a call or return for its new legalized signature, we may need to look at the old signature in order to figure out how to legalize the call or return. * Add test for multi-value returns and `call_indirect` * Encode bool -> int x86 instructions in a loop * Rename `Signature::uses_sret` to `Signature::uses_struct_return_param` * Rename `p` to `param` * Add a clarifiying comment in `num_registers_required` * Rename `num_registers_required` to `num_return_registers_required` * Re-add newline * Handle already-assigned parameters in `num_return_registers_required` * Document what some debug assertions are checking for * Make "illegalizing" closure's control flow simpler * Add unit tests and comments for our rounding-up-to-the-next-multiple-of-a-power-of-2 function * Use `append_isnt_arg` instead of doing the same thing manually * Fix grammar in comment * Add `Signature::uses_special_{param,return}` helper functions * Inline the definition of `legalize_type_for_sret_load` for readability * Move sret legalization debug assertions out into their own function * Add `round_up_to_multiple_of_type_align` helper for readability * Add a debug assertion that we aren't removing the wrong return value * Rename `RetPtr` stack slots to `StructReturnSlot` * Make `legalize_type_for_sret_store` more symmetrical to `legalized_type_for_sret` * rustfmt * Remove unnecessary loop labels * Do not pre-assign offsets to struct return stack slots Instead, let the existing frame layout algorithm decide where they should go. * Expand "sret" into explicit "struct return" in doc comment * typo: "than" -> "then" in comment * Fold test's debug message into the assertion itself	2019-11-05 14:36:03 -08:00
Dan Gohman	45fb377457	Bump version to 0.48.0 (#1202 ) * Bump version to 0.48.0 * Re-enable `byteorder`'s default features. The code uses `WriteBytesExt` which depends on the `std` feature being enabled. So for now, just enable `std`.	2019-11-05 13:59:19 -08:00
Peter Huene	8923bac7e8	Implement emitting Windows unwind information for fastcall functions. (#1155 ) * Implement emitting Windows unwind information for fastcall functions. This commit implements emitting Windows unwind information for x64 fastcall calling convention functions. The unwind information can be used to construct a Windows function table at runtime for JIT'd code, enabling stack walking and unwinding by the operating system. * Address code review feedback. This commit addresses code review feedback: * Remove unnecessary unsafe code. * Emit the unwind information always as little endian. * Fix comments. A dependency from cranelift-codegen to the byteorder crate was added. The byteorder crate is a no-dependencies crate with a reasonable abstraction for writing binary data for a specific endianness. * Address code review feedback. * Disable default features for the `byteorder` crate. * Add a comment regarding the Windows ABI unwind code numerical values. * Panic if we encounter a Windows function with a prologue greater than 256 bytes in size.	2019-11-05 13:14:30 -08:00
Dan Gohman	a9868de3d8	Bump version to 0.47.0	2019-11-04 15:43:27 -08:00
Peter Huene	9f506692c2	Fix clippy warnings. This commit fixes the current set of (stable) clippy warnings in the repo.	2019-10-24 17:20:12 -07:00
Andrew Brown	879ccf871a	Add x86 SIMD vall_true In order to implement SIMD's all_true (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#all-lanes-true), we must legalize some instruction (I chose `vall_true`) to a comparison against 0 and a similar reduction as vany_true using `PTEST` and `SETNZ`. Since `icmp` only allows integers but `vall_true` could allow more vector types, `raw_bitcast` is used to convert the lane types into integers, e.g. b32x4 to i32x4. To do so without runtime type-checking, the `raw_bitcast` instruction (which emits no instruction) can now bitcast from any vector type to the same type, e.g. i32x4 to i32x4.	2019-10-22 11:01:05 -07:00
Andrew Brown	186effc420	Add x86 SIMD vany_true and x86_ptest In order to implement SIMD's any_true (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#any-lane-true), we must legalize some instruction (I chose `vany_true`) to a sequence of `PTEST` and `SETNZ`. To emit `PTEST` I added the new CLIF instruction `x86_ptest` and used CLIF's `trueif ne` for `SETNZ`.	2019-10-22 11:01:05 -07:00
Andrew Brown	b927c55511	Add SIMD bitselect instruction and x86 legalization This new instructions matches the `bitselect` behavior described in the WASM SIMD spec (https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#bitwise-select)	2019-10-17 15:49:29 -07:00
Andrew Brown	8f74333662	Add x86 SIMD band_not	2019-10-17 15:49:29 -07:00

... 3 4 5 6 7 ...

775 Commits