wasmtime

Author	SHA1	Message	Date
wasmtime-publish	354bc48015	Bump Wasmtime to 8.0.0 (#5932 ) Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com>	2023-03-06 15:08:16 +00:00
yuyang	20198d94c6	Codegen fix atomic_rmw_loop missing move result to `dst` register On riscv64. (#5898 ) * fix issue5884. * fix issue5884 * fix test failure * fix atomic rmw missing move result to dst register. * specify little endian some s390x can pass test.	2023-03-06 11:27:46 +00:00
Andrew Brown	ad584f428a	wasi-threads: run test suite (#5907 ) * wasi-threads: run test suite This change enables the running of the wasi-threads [test suite]. It relies on a Wasmtime CLI binary being available and runs all `.wasm` and `.wat` files present in the test suite directory. The results of each execution are compared against a JSON spec file with the same base name as the WebAssembly module. The spec file defines the expected exit code, e.g. This commit does not yet build any `.c` or `.s` files from the test suite. That could be done later, perhaps upstream; in the meantime, this work is still valuable as it lays the foundation for running other WASI tests from the in-progress [wasi-testsuite] which share the same JSON spec infrastructure. [test suite]: https://github.com/WebAssembly/wasi-threads/tree/main/test/testsuite [wasi-testsuite]: https://github.com/WebAssembly/wasi-testsuite * review: move testsuite to top-level tests * fix: remove now-unnecessary wasi-threads test * fix: update testsuite submodule name * fix: ignore tests on Windows prtest:full * fix: `cfg_attr` syntax prtest:full	2023-03-04 21:50:15 +00:00
Afonso Bordado	c24d4101ae	fuzzgen: Add Invalid inputs counter (#5928 )	2023-03-04 21:23:19 +00:00
Afonso Bordado	e96214968c	fuzzgen: Move `Arbitrary` structs into the fuzzers (#5820 ) * fuzzgen: Move `FunctionWithIsa` to icache fuzzer * fuzzgen: Move `Testcase` to fuzzgen fuzzer * fuzzgen: Move allowed libcalls to fuzzers * fuzzgen: Centralize printing of testcases	2023-03-04 19:17:28 +00:00
Alex Crichton	3ff3994a12	Add egraph optimization for fneg's cancelling out (#5910 ) This implements comments from #5895 to cancel out `fneg` operations in `fma` instructions. Additional support for `fmul` is added as well.	2023-03-02 18:28:32 +00:00
Tristan de Cacqueray	87672f7059	doc: fix WASI-api link (#5912 )	2023-03-02 13:22:33 +00:00
Jan-Justin van Tonder	db8fe0108f	cranelift: Add big and little endian memory accesses to interpreter (#5893 ) * Added `mem_flags` parameter to `State::checked_{load,store}` as the means for determining the endianness, typically derived from an instruction. * Added `native_endianness` property to `InterpreterState` as fallback when determining endianness, such as in cases where there are no memory flags avaiable or set. * Added `to_be` and `to_le` methods to `DataValue`. * Added `AtomicCas` and `AtomicRmw` to list of instructions with retrievable memory flags for `InstructionData::memflags`. * Enabled `atomic-{cas,rmw}-subword-{big,little}.clif` for interpreter run tests.	2023-03-02 11:57:01 +00:00
Alex Crichton	9984e959cd	aarch64: Add support for the `fmls` instruction (#5895 ) This commit adds lowerings to the AArch64 backend for the `fmls` instruction which is intended to be leveraged in the relaxed-simd proposal for WebAssembly. This should hopefully allow for a teeny-bit-more efficient codegen for this operator instead of using the `fmla` instruction plus a negation instruction.	2023-03-02 05:45:58 +00:00
Alex Crichton	52b4c48a1b	x64: Improve codegen for i8x16.shr_u (#5906 ) This catches a case that wasn't handled previously by #5880 to allow a constant load to be folded into an instruction rather than forcing it to be loaded into a temporary register.	2023-03-02 05:43:42 +00:00
Chris Fallin	7b8854f803	egraphs: fix handling of effectful-but-idempotent ops and GVN. (#5800 ) * Revert "egraphs: disable GVN of effectful idempotent ops (temporarily). (#5808)" This reverts commit `c7e2571866`. * egraphs: fix handling of effectful-but-idempotent ops and GVN. This PR addresses #5796: currently, ops that are effectful, i.e., remain in the side-effecting skeleton (which we keep in the `Layout` while the egraph exists), but are idempotent and thus mergeable by a GVN pass, are not handled properly. GVN is still possible on effectful but idempotent ops precisely because our GVN does not create partial redundancies: it removes an instruction only when it is dominated by an identical instruction. An isntruction will not be "hoisted" to a point where it could execute in the optimized code but not in the original. However, there are really two parts to the egraph implementation that produce this effect: the deduplication on insertion into the egraph, and the elaboration with a scoped hashmap. The deduplication lets us give a single name (value ID) to all copies of an identical instruction, and then elaboration will re-create duplicates if GVN should not hoist or merge some of them. Because deduplication need not worry about dominance or scopes, we use a simple (non-scoped) hashmap to dedup/intern ops as "egraph nodes". When we added support for GVN'ing effectful but idempotent ops (#5594), we kept the use of this simple dedup'ing hashmap, but these ops do not get elaborated; instead they stay in the side-effecting skeleton. Thus, we inadvertently created potential for weird code-motion effects. The proposal in #5796 would solve this in a clean way by treating these ops as pure again, and keeping them out of the skeleton, instead putting "force" pseudo-ops in the skeleton. However, this is a little more complex than I would like, and I've realized that @jameysharp's earlier suggestion is much simpler: we can keep an actual scoped hashmap separately just for the effectful-but-idempotent ops, and use it to GVN while we build the egraph. In effect, we're fusing a separate GVN pass with the egraph pass (but letting it interact corecursively with egraph rewrites. This is in principle similar to how we keep a separate map for loads and fuse this pass with the egraph rewrite pass as well. Note that we can use a `ScopedHashMap` here without the "context" (as needed by `CtxHashMap`) because, as noted by @jameysharp, in practice the ops we want to GVN have all their args inline. Equality on the `InstructinoData` itself is conservative: two insts whose struct contents compare shallowly equal are definitely identical, but identical insts in a deep-equality sense may not compare shallowly equal, due to list indirection. This is fine for GVN, because it is still sound to skip any given GVN opportunity (and keep the original instructions). Fixes #5796. * Add comments from review.	2023-03-02 02:10:42 +00:00
Alex Crichton	f05babc744	x64: Add `shuffle` cases for `punpck{h,l}bw` (#5905 ) * x64: Add `shuffle` cases for `punpck{h,l}bw` I noticed this difference between LLVM and Cranelift for something I was looking at recently, and while it's probably not all that common I figured I'd add it here since it should be somewhat useful nevertheless. * Review feedback * Use u128 extractor instead	2023-03-01 21:49:00 +00:00
Alexa VanHattum	6f6fcfa437	Add filetest for unexpected imm12_from_negated aarch64 lowering (#5904 )	2023-03-01 20:31:24 +00:00
Andrew Brown	eaf4e9d3cc	doc: add a page listing supported proposals (#5781 ) * doc: add a page listing supported proposals This adds a table showing Wasmtime's support for various WASI proposals, much like the one available for WebAssembly proposals. This change is related to [#2423], which provides guidelines for implementing WASI proposals but was never merged. [#2423]: https://github.com/bytecodealliance/wasmtime/pull/2423 * review: remove phase-gating sentence	2023-03-01 18:13:17 +00:00
Alex Crichton	c4a2c1e818	clif: Remove the type variable from `swizzle` (#5897 ) This instruction is only defined with i8x16 inputs and outputs so there's no need for a type variable, so shadow the otherwise-generic `a` result with a concrete i8x16 type.	2023-03-01 00:38:53 +00:00
Alex Crichton	e0ef0b7c72	x64: Add support for `phadd{w,d}` instructions (#5896 ) This commit adds support for the bare lowering of the `iadd_pairwise` instruction with `i16x8` and `i32x4` types on the x64 backend. These lowerings are achieved with the `phaddw` and `phaddd` instructions, respectively. Additionally AVX encodings of these instructions are added too. The motivation for these new lowerings comes from the relaxed-simd proposal which will use them in the deterministic lowering of some instructions on the x64 backend.	2023-02-28 23:35:53 +00:00
yuyang	32cfd60877	fix codegen riscv64 normalize_cmp_value. (#5873 ) * fix issue5839 * add target. * fix normalize_cmp_value. * fix test failutre. * fix test failure. * fix parameter type. * Update cranelift/codegen/src/isa/riscv64/inst.isle Co-authored-by: Jamey Sharp <jamey@minilop.net> * Update cranelift/codegen/src/isa/riscv64/lower.isle Co-authored-by: Jamey Sharp <jamey@minilop.net> * remove convert rule from IntCC to ExtendOp --------- Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-02-28 23:00:23 +00:00
Sven Sauleau	0e9a48afd5	add basic coredump generation (#5868 ) This change adds a basic coredump generation after a WebAssembly trap was entered. The coredump includes rudimentary stack / process debugging information. A new CLI argument is added to enable coredump generation: ``` wasmtime --coredump-on-trap=/path/to/coredump/file module.wasm ``` See ./docs/examples-coredump.md for a working example. Refs https://github.com/bytecodealliance/wasmtime/issues/5732	2023-02-28 20:27:52 +00:00
Afonso Bordado	2dd6064005	fuzzgen: Generate multiple functions per testcase (#5765 ) * fuzzgen: Generate multiple functions per testcase * fuzzgen: Fix typo Co-authored-by: Jamey Sharp <jamey@minilop.net> --------- Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-02-28 18:47:09 +00:00
Alex Crichton	aad8eaeb5a	Add more vets for core dumps (#5894 ) Required by #5868	2023-02-28 17:32:59 +00:00
Afonso Bordado	480c45b854	fuzzgen: Initial SIMD support (#5885 ) * fuzzgen: Initial SIMD support * riscv64: Address PR Feedback Thanks!	2023-02-28 11:33:11 +00:00
Afonso Bordado	ae881407cd	cranelift-jit: Implement RISC-V Call relocation (#5835 )	2023-02-28 11:14:50 +00:00
Afonso Bordado	ef8a1340df	fuzzgen: Disable unaligned atomics for RISCV (#5883 ) * fuzzgen: Disable unaligned atomics for RISCV * riscv64: Cleanup atomic alignment logic Co-authored-by: Jamey Sharp <jamey@minilop.net> --------- Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-02-28 10:48:14 +00:00
Afonso Bordado	ddbaf6afba	fuzzgen: Add `atomic_cas` instruction (#5886 )	2023-02-28 10:24:24 +00:00
Dan Gohman	c19b742d1c	Change the name of wit-bindgen's host implementation traits. (#5890 ) * Change the name of wit-bindgen's host implementation traits. Instead of naming the host implementation trait something like `wasi_filesystem::WasiFilesystem`, name it `wasi_filesystem::Host`, and avoid using the identifier `Host` in other places. This fixes a collision when generating bindings for the current wasi-clock API, which contains an interface `wall-clock` which contains a type `wall-clock`, which created a naming collision on the name `WallClock`. * Update tests to use the new trait name. * Fix one more. * Add the new test interface to the simple-wasi world.	2023-02-27 23:14:55 +00:00
Alex Crichton	f2dce812c3	x64: Sink constant loads into xmm instructions (#5880 ) A number of places in the x64 backend make use of 128-bit constants for various wasm SIMD-related instructions although most of them currently use the `x64_xmm_load_const` helper to load the constant into a register. Almost all xmm instructions, however, enable using a memory operand which means that these loads can be folded into instructions to help reduce register pressure. Automatic conversions were added for a `VCodeConstant` into an `XmmMem` value and then explicit loads were all removed in favor of forwarding the `XmmMem` value directly to the underlying instruction. Note that some instances of `x64_xmm_load_const` remain since they're used in contexts where load sinking won't work (e.g. they're the first operand, not the second for non-commutative instructions).	2023-02-27 22:02:42 +00:00
Alex Crichton	9b86a0b9b1	Remove the `widening_pairwise_dot_product_s` clif instruction (#5889 ) This was added for the wasm SIMD proposal but I've been poking around at this recently and the instruction can instead be represented by its component parts with the same semantics I believe. This commit removes the instruction and instead represents it with the existing `iadd_pairwise` instruction (among others) and updates backends to with new pattern matches to have the same codegen as before. This interestingly entirely removed the codegen rule with no replacement on the AArch64 backend as the existing rules all existed to produce the same codegen.	2023-02-27 18:43:43 +00:00
Jamey Sharp	6cf7155052	Cranelift: Generalize `(x << k) >> k` optimization (#5746 ) * Generalize unsigned `(x << k) >> k` optimization Split the existing rule into three parts: - A dual of the rule for `(x >> k) << k` that is only valid for unsigned shifts. - Known-bits analysis for `(band (uextend x) k)`. - A new rule for converting `sextend` to `uextend` if the sign-extended bits are masked out anyway. The first two together cover the existing rule. * Generalize signed `(x << k) >> k` optimization * Review comments * Generalize sign-extending shifts further The shifts can be eliminated even if the shift amount isn't exactly equal to the difference in bit-widths between the narrow and wide types. * Add filetests	2023-02-27 17:34:46 +00:00
Volker Mische	6f64e39dda	Fix function call on component instance (#5887 ) The exported function in the instance is not called directly by its name, but by `call_<the-name>`.	2023-02-27 15:10:56 +00:00
yuyang	3864286596	fix issue 5714. (#5845 ) * fix issue 5714. * add target for regression test. * remove x86_64 test because of not implemented.	2023-02-26 16:25:38 +00:00
Jan-Justin van Tonder	66cb13cb4b	cranelift: Add atomic_cas to interpreter (#5875 ) As per issue #5818, atomic_cas was implemented without specific regard for thread safety.	2023-02-25 14:36:49 +00:00
Afonso Bordado	e9095050be	cranelift-interpreter: Implement `call_indirect` and `return_call_indirect` (#5877 ) * cranelift-interpreter: Implement `call_indirect` * cranelift: Fix typo * riscv64: Enable `call_indirect` tests	2023-02-25 13:16:59 +00:00
Afonso Bordado	36e92add6f	riscv64: Move `is_null`/`is_invalid` to ISLE (#5874 ) * riscv64: Move `is_null`/`is_invalid` to ISLE * riscv64: Fix `is_invalid` codegen * Implement review suggestions Thanks! Co-authored-by: Jamey Sharp <jamey@minilop.net> --------- Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-02-25 12:48:44 +00:00
Dan Gohman	67e2e57b02	Allow WASI preopen file descriptors to be closed. (#5828 ) Early on in WASI, we weren't sure whether we should allow preopens to be closed, so conservatively, we disallowed them. Among other things, this protected assumptions in wasi-libc that it can hold onto preopen file descriptors and rely on them always being open. However now, I think it makes sense to relax this restriction. wasi-libc itself doesn't expose the preopen file descriptors, so users shouldn't ever be closing them naively, unless they have wild closes. And toolchains other than wasi-libc may want to close preopens as a way to drop priveleges once the main file handles are opened.	2023-02-24 21:06:38 +00:00
Alex Crichton	fb2cbec34a	Add vet entries for coredump support (#5878 ) * Update the `num_cpus` crate Audits for this update provided from our import from Mozilla. * Add vet entries for coredump support	2023-02-24 18:26:39 +00:00
Trevor Elliott	4c88acbb89	Test all backends when a runtest is modified (#5872 ) * Test all backends when a runtest is modified * Check that this triggers all backend tests * Revert "Check that this triggers all backend tests" This reverts commit 1d12536d04f5a3b01fa5420f407960d7ab81da8f.	2023-02-24 15:39:37 +00:00
Jamey Sharp	5cfb461945	Only emit ISLE/egraph terms for single-value insts (#5848 ) For instructions with no results (such as branches and stores) or instructions with multiple results (such as add with carry), we have assertions checking that an optimization rule doesn't try to match on or construct such instructions. When we generate terms for matching or constructing instructions, the terms for these instructions are guaranteed to panic if they're ever used. So let's just not generate them. In the future we may wish to generate terms with different types for these instructions, to make them usable in ISLE rules for optimization that fall outside our current egraph constraints.	2023-02-24 15:38:48 +00:00
Ryan Levick	6d6bd0ea1c	Result alias for convienient use of anyhow::Error without depending on anyhow (#5853 ) * Add a Result type alias * Refer to the type in top-level docs * Use this inside the documentation for the bindgen! macro * Fix tests * Address small PR feedback * Simply re-export anyhow types	2023-02-24 15:37:34 +00:00
Jamey Sharp	7d790fcdfe	x64: Only branch once in br_table (#5850 ) This uses the `cmov`, which was previously necessary for Spectre mitigation, to clamp the table index instead of zeroing it. By then placing the default target as the last entry in the table, we can use just one branch instruction in all cases. Since there isn't a bounds-check branch any more, this sequence no longer needs Spectre mitigation. And since we don't need to be careful about preserving flags, half the instructions can be removed from this pseudoinstruction and emitted as regular instructions instead. This is a net savings of three bytes in the encoding of x64's br_table pseudoinstruction. The generated code can sometimes be longer overall because the blocks are emitted in a slightly different order. My benchmark results show a very small effect on runtime performance with this change. The spidermonkey benchmark in Sightglass runs "1.01x faster" than main by instructions retired, but with no significant difference in CPU cycles. I think that means it rarely hit the default case in any br_table instructions it executed. The pulldown-cmark benchmark in Sightglass runs "1.01x faster" than main by CPU cycles, but main runs "1.00x faster" by instructions retired. I think that means this benchmark hit the default case a significant amount of the time, so it executes a few more instructions per br_table, but maybe the branches were predicted better.	2023-02-24 04:46:38 +00:00
Trevor Elliott	c5d9d5b10f	Remove module-level code generation tests (#5870 ) * Remove module-level code generation tests * Add cold block tests for each backend * Better cold block tests	2023-02-24 01:19:26 +00:00
Alex Crichton	f91640ffab	Fix a panic due to a race in unpark and park (#5871 ) * Remove globals from parking spot tests Use `std:🧵:scope` to keep everything local to just the tests. * Fix a panic due to a race in `unpark` and `park` This commit fixes a panic in the `ParkingSpot` implementation where an `unpark` signal may not get acknowledged when a waiter times out, causing the waiter to remove itself from the internal map but panic thinking that it missed an unpark signal. The fix in this commit is to consume unpark signals when a timeout happens. This can lead to another possible race I've detailed in the comments which I believe is allowed by the specification of park/unpark in wasm. * Update crates/runtime/src/parking_spot.rs Co-authored-by: Andrew Brown <andrew.brown@intel.com> --------- Co-authored-by: Andrew Brown <andrew.brown@intel.com>	2023-02-23 23:20:05 +00:00
Alex Crichton	3fc3bc9ec8	x64: Fill out more AVX instructions (#5849 ) * x64: Fill out more AVX instructions This commit fills out more AVX instructions for SSE counterparts currently used. Many of these instructions do not benefit from the 3-operand form that AVX uses but instead benefit from being able to use `XmmMem` instead of `XmmMemAligned` which may be able to avoid some extra temporary registers in some cases. * Review comments	2023-02-23 22:31:31 +00:00
Trevor Elliott	8abfe928d6	Reuse the DominatorTree postorder travesal in BlockLoweringOrder (#5843 ) * Rework the blockorder module to reuse the dom tree's cfg postorder * Update domtree tests * Treat br_table with an empty jump table as multiple block exits * Bless tests * Change branch_idx to succ_idx and fix the comment	2023-02-23 22:05:20 +00:00
Ulrich Weigand	4314210162	s390x: Fix implementation of {s,u}{min,max} (#5864 ) When expanding a min/max operation to a pair of icmp + select, do not attempt to expand the input value operands twice, as this might fail with memory operands. Fixes https://github.com/bytecodealliance/wasmtime/issues/5859.	2023-02-23 20:01:51 +00:00
Afonso Bordado	fc080c739e	fuzzgen: Add `AtomicRMW` (#5861 )	2023-02-23 18:34:28 +00:00
Ulrich Weigand	9719147f91	s390x: Fix integer overflow during negation (#5866 ) Use wrapping_neg in i{64,32,16}_from_negated_value to avoid Rust aborts due to integer overflow. The resulting INT_MIN is already handled correctly in subsequent operations. Fixes https://github.com/bytecodealliance/wasmtime/issues/5863.	2023-02-23 16:32:10 +00:00
Alex Crichton	761e44bd36	Fix running WASI tests in isolation (#5865 ) Closes #5860	2023-02-23 16:04:15 +00:00
Noa	4f7746da60	Have StoreContext::data return &'a T (#5855 )	2023-02-23 15:32:35 +00:00
Andrew Brown	f6b16a7178	wasi-threads: fix use of `wait` in test (#5858 ) As @yamt points out [here], the `wait`/`notify` pairing used in this manual WAT test was not effective. The `wait` always immediately returned, meaning that the main thread essentially spins until a counter is atomically incremented. This is fine for test correctness, but was not the original intent, which was lost in a refactoring. This change uses the `$i` local to keep track of the counter value we expect to see for the `wait`, so that the `wait`/`notify` pair actually waits as expected. [here]: https://github.com/bytecodealliance/wasmtime/pull/5484#discussion_r1101200012	2023-02-23 15:23:58 +00:00
Jan-Justin van Tonder	0521155896	cranelift: Add atomic_rmw to interpreter (#5817 ) (#5856 ) As per the linked issue, atomic_rmw was implemented without specific regard for thread safety. Additionally, the relevant filetest (atomic-rmw-little.clif) was enabled and altered to fix an inccorrect call to test function `%atomic_rmw_and_i64` after setting up test function `%atomic_rmw_and_i32`.	2023-02-23 10:24:56 +00:00

1 2 3 4 5 ...

10954 Commits