wasmtime

Author	SHA1	Message	Date
Nick Fitzgerald	48b401c6f5	Merge pull request #2437 from abrown/bench-api Introduce benchmarking API	2020-12-15 09:42:48 -08:00
Ulrich Weigand	fe10cc9d52	Fix jitdump header magic field on big-endian platforms (#2511 ) The jitdump header contains a "magic" field that is defined to hold the value 0x4A695444 as u32 in native endianness. (This allows consumers of the file to detect the endianness of the platform where the file was written, and apply it when reading other fields.) However, current code always writes 0x4A695444 in little-endian byte order, even on big-endian system. This makes consumers fail when attempting to read files written on big-endian platforms. Fixed by always writing the magic in native endianness.	2020-12-15 08:44:43 -06:00
whitequark	fc4eeae896	Alias `yanix::file::OFlags::RSYNC` to `SYNC` on Android. Android defines O_RSYNC to be the same as O_SYNC: `35bb46188c/libc/include/fcntl.h (L57)`	2020-12-14 22:03:43 -08:00
whitequark	1ec1834d6e	Treat Android like Linux in `wasi_common::sys::unix`. Just like it happens in `yanix::sys`.	2020-12-14 22:03:43 -08:00
whitequark	476ee59774	Define local `seekdir` and `telldir` prototypes on Android. These functions are not yet defined in the `libc` crate. See https://github.com/rust-lang/libc/pull/1996.	2020-12-14 22:03:43 -08:00
whitequark	919600d10b	Stub out the utimesat emulation logic on Android. Android always has `utimensat` available, so it is not necessary (or possible, for that matter) to emulate it. Mark the fallback path as `unreachable!()`.	2020-12-14 22:03:43 -08:00
Chris Fallin	743529b4eb	Merge pull request #2492 from uweigand/endian-memory-v5 Support explicit endianness in Cranelift IR MemFlags	2020-12-14 13:59:08 -08:00
Alex Crichton	2d7c2fba5f	Update wasm-smith (#2509 ) Brings in a few bug fixes for generating module-linking modules	2020-12-14 13:39:38 -06:00
Ulrich Weigand	467a1af83a	Support explicit endianness in Cranelift IR MemFlags WebAssembly memory operations are by definition little-endian even on big-endian target platforms. However, other memory accesses will require native target endianness (e.g. to access parts of the VMContext that is also accessed by VM native code). This means on big-endian targets, the code generator will have to handle both little- and big-endian memory accesses. However, there is currently no way to encode that distinction into the Cranelift IR that describes memory accesses. This patch provides such a way by adding an (optional) explicit endianness marker to an instance of MemFlags. Since each Cranelift IR instruction that describes memory accesses already has an instance of MemFlags attached, this can now be used to provide endianness information. Note that by default, memory accesses will continue to use the native target ISA endianness. To override this to specify an explicit endianness, a MemFlags value that was built using the set_endianness routine must be used. This patch does so for accesses that implement WebAssembly memory operations. This patch addresses issue #2124.	2020-12-14 20:15:37 +01:00
Nick Fitzgerald	ed971fcfcc	Merge pull request #2505 from alexcrichton/fix-reserve-bug Fix a memory reservation bug in `reserve_modules`	2020-12-14 10:12:46 -08:00
Nick Fitzgerald	59d435bef4	Merge pull request #2506 from alexcrichton/fix-expect-valid Fix fuzzer expectation about valid modules	2020-12-14 09:53:01 -08:00
Alex Crichton	42adeba65d	Fix fuzzer expectation about valid modules Recent changes to fuzzers made expectations more strict about handling errors while fuzzing, but this erroneously changed a module compilation step to always assume that the input wasm is valid. Instead a flag is now passed through indicating whether the wasm blob is known valid or invalid, and only if compilation fails and it's known valid do we panic.	2020-12-14 08:31:46 -08:00
Alex Crichton	c83dee07b7	Fix a memory reservation bug in `reserve_modules` This method attempted to reserve space in the `results` list of final modules. Unfortunately `results.reserve(nmodules)` isn't enough here because this can be called many times before a module is actually finished and pushed onto the vector. The attempted logic to work around this was buggy, however, and would simply trigger geometric growth on every single reservation because it erroneously assumed that a reservation would be exactly met. This is fixed by avoiding looking at the vector's capacity and instead keeping track of modules-to-be in a side field. This is the incremented and passed to `reserve` as it represents the number of modules that will eventually make their way into the result vector.	2020-12-14 07:57:47 -08:00
bjorn3	8f7f8ee0b4	Fix iconst.i8 0 miscompilation	2020-12-12 09:44:05 +01:00
Dan Gohman	1d90c329b4	Remove an unused variable.	2020-12-11 10:52:54 -08:00
Dan Gohman	88a073eac3	rustfmt	2020-12-11 10:52:54 -08:00
Dan Gohman	c3f0471ff2	Fix the return value from wasi-common's `fd_readdir`. `fd_readdir` returns a "bufused" value, which indicates the number of bytes read into the buffer. WASI libc expects this value to be equal to the size of the buffer if the end of the directory has not yet been scanned. Previously, wasi-common's `fd_readdir` was writing as many complete entries as it could fit and then stopping, but this meant it was returning size less than the buffer size even when the directory had more entries. This patch makes it continue writing up until the end of the buffer, and return that number of bytes, to let WASI libc know that there's more to be read. Fixes #2493.	2020-12-11 10:52:54 -08:00
Nick Fitzgerald	9ccef597c2	Merge pull request #2497 from alexcrichton/fuzz-module-linking Enable fuzzing the module linking implementation	2020-12-11 09:46:03 -08:00
Alex Crichton	25000afe69	Enable fuzzing the module linking implementation This commit updates all the wasm-tools crates that we use and enables fuzzing of the module linking proposal in our various fuzz targets. This also refactors some of the dummy value generation logic to not be fallible and to always succeed, the thinking being that we don't want to accidentally hide errors while fuzzing. Additionally instantiation is only allowed to fail with a `Trap`, other failure reasons are unwrapped.	2020-12-11 08:36:52 -08:00
Andrew Brown	41d668b4da	Introduce benchmarking API The new crate introduced here, `wasmtime-bench-api`, creates a shared library, e.g. `wasmtime_bench_api.so`, for executing Wasm benchmarks using Wasmtime. It allows us to measure several phases separately by exposing `engine_compile_module`, `engine_instantiate_module`, and `engine_execute_module`, which pass around an opaque pointer to the internally initialized state. This state is initialized and freed by `engine_create` and `engine_free`, respectively. The API also introduces a way of passing in functions to satisfy the `"bench" "start"` and `"bench" "end"` symbols that we expect Wasm benchmarks to import. The API is exposed in a C-compatible way so that we can dynamically load it (carefully) in our benchmark runner.	2020-12-10 15:02:10 -08:00
Alex Crichton	e09b9400f8	Restrict threads spawned during fuzzing (#2485 ) I was having limited success fuzzing locally because apparently the fuzzer was spawning too many threads. Looking into it that indeed appears to be the case! The threads which time out runtime of wasm only exit after the sleep has completely finished, meaning that if we execute a ton of wasm that exits quickly each run will generate a sleeping thread. This commit fixes the issue by using some synchronization to ensure the sleeping thread exits when our fuzzed run also exits.	2020-12-09 10:34:37 -06:00
Chris Fallin	267d4a8bdb	Merge pull request #2490 from cfallin/fix-popcnt-load-width x64 lowering fix: i32.popcnt should not merge load and make it 64-bit.	2020-12-08 22:28:41 -08:00
Chris Fallin	06fc28dc85	Merge pull request #1697 from Y-Nak/fix-licm-jt-bug Fix missing modification of jump table in licm	2020-12-08 20:18:59 -08:00
Y-Nak	855a6374dd	Fix missing modification of jump table in licm	2020-12-09 11:13:33 +09:00
Chris Fallin	6632c45c01	x64 lowering fix: i32.popcnt should not merge load and make it 64-bit. As a subtle consequence of the recent load-op fusion, popcnt of a value that came from a load.i32 was compiling into a 64-bit load. This is a result of the way in which x86 infers the width of loads: it is a consequence of the instruction containing the memory reference, not the memory reference itself. So the `input_to_reg_mem()` helper (convert an instruction input into a register or memory reference) was providing the appropriate memory reference for the result of a load.i32, but never encoded the assumption that it would only be used in a 32-bit instruction. It turns out that popcnt.i32 uses a 64-bit instruction to load this RM op, hence widening a 32-bit to 64-bit load (which is problematic when the offset is (memory_length - 4)). Separately, popcnt was using the RM operand twice, resulting in two loads if we merged a load. This isn't a correctness bug in practice because only a racy sequence (store interleaving between the loads) would produce incorrect results, but we decided earlier to treat loads as effectful for now, neither reordering nor duplicating them, to deliberately reduce complexity. Because of the second issue, the fix is just to force the operand into a register always, so any source load will not be merged. Discovered via fuzzing with oss-fuzz.	2020-12-08 12:24:34 -08:00
Chris Fallin	2cec20aa57	Merge pull request #2486 from cfallin/fix-probestack Two Lucet-related fixes to stack overflow handling.	2020-12-07 16:47:37 -08:00
Chris Fallin	3a01d14712	Two Lucet-related fixes to stack overflow handling. Lucet uses stack probes rather than explicit stack limit checks as Wasmtime does. In bytecodealliance/lucet#616, I have discovered that I previously was not running some Lucet runtime tests with the new backend, so was missing some test failures due to missing pieces in the new backend. This PR adds (i) calls to probestack, when enabled, in the prologue of every function with a stack frame larger than one page (configurable via flags); and (ii) trap metadata for every instruction on x86-64 that can access the stack, hence be the first point at which a stack overflow is detected when the stack pointer is decremented.	2020-12-07 16:08:53 -08:00
Andrew Brown	2c765c18c2	Update spec tests	2020-12-07 10:59:55 -08:00
Andrew Brown	f1025322fa	Update wasmparser to 0.69.2	2020-12-07 10:59:55 -08:00
Chris Fallin	6ce1658e4d	Merge pull request #2483 from cfallin/x64-regalloc-register-order x64 regalloc register order: put caller-saves (volatiles) first.	2020-12-07 08:53:12 -08:00
Chris Fallin	1dddba649a	x64 regalloc register order: put caller-saves (volatiles) first. The x64 backend currently builds the `RealRegUniverse` in a way that is generating somewhat suboptimal code. In many blocks, we see uses of callee-save (non-volatile) registers (r12, r13, r14, rbx) first, even in very short leaf functions where there are plenty of volatiles to use. This is leading to unnecessary spills/reloads. On one (local) test program, a medium-sized C benchmark compiled to Wasm and run on Wasmtime, I am seeing a ~10% performance improvement with this change; it will be less pronounced in programs with high register pressure (there we are likely to use all registers regardless, so the prologue/epilogue will save/restore all callee-saves), or in programs with fewer calls, but this is a clear win for small functions and in many cases removes prologue/epilogue clobber-saves altogether. Separately, I think the RA's coalescing is tripping up a bit in some cases; see e.g. the filetest touched by this commit that loads a value into %rsi then moves to %rax and returns immediately. This is an orthogonal issue, though, and should be addressed (if worthwhile) in regalloc.rs.	2020-12-06 22:37:43 -08:00
Andrew Brown	fc752efa89	Fix incorrect arithmetic NaN check and document	2020-12-04 14:44:43 -08:00
Andrew Brown	651f405220	Print WAST assertion failures in a hexadecimal pattern format Fixes #1681.	2020-12-04 14:44:43 -08:00
bjorn3	411ec3a857	Rename SimpleJIT to JIT as it isn't simple anymore	2020-12-04 13:21:13 -08:00
Nick Fitzgerald	502b39606f	Merge pull request #2364 from fitzgen/peepmatic-rebuild-test peepmatic: Make the test-we-can-get-and-rebuild peephole optimizers test work on arm64	2020-12-04 10:32:39 -08:00
Julian Seward	8f34d2dc59	aarch64 isel: collect_address_addends: correctly handle `ExtendOp::UXTW(negative immediate)`. The current code doesn't correctly handle the case where `ExtendOp::UXTW` has as source, a constant-producing insn that produces a negative (32-bit) value. Then the value is incorrectly sign-extended to 64 bits (in fact, this has already been done by `ctx.get_constant(insn)`), whereas it needs to be zero extended. The obvious fix, done here, is just to force bits 63:32 of the extension to zero, hence zero-extending it.	2020-12-04 19:21:40 +01:00
Andrew Brown	d07fffeb41	Use AlexNet for wasi-nn example (#2474 )	2020-12-04 09:07:19 -06:00
Chris Fallin	bfd10512c1	Merge pull request #2473 from cfallin/fix-lowering Fix lowering instruction-sinking (load-merging) bug.	2020-12-03 15:56:50 -08:00
Nick Fitzgerald	76d3fb61c3	Merge pull request #2293 from fitzgen/souper-no-assign-constant souper-harvest: Do not generate assignments of constants	2020-12-03 15:20:41 -08:00
Chris Fallin	3e516e784b	Fix lowering instruction-sinking (load-merging) bug. This fixes a subtle corner case exposed during fuzzing. If we have a bit of CLIF like: ``` v0 = load.i64 ... v1 = iadd.i64 v0, ... v2 = do_other_thing v1 v3 = load.i64 v1 ``` and if this is lowered using a machine backend that can merge loads into ALU ops, and that has an addressing mode that can look through add ops, then the following can happen: 1. We lower the load at `v3`. This looks backward at the address operand tree and finds that `v1` is `v0` plus other things; it has an addressing mode that can add `v0`'s register and the other things directly; so it calls `put_value_in_reg(v0)` and uses its register in the amode. At this point, the add producing `v1` has no references, so it will not (yet) be codegen'd. 2. We lower `do_other_thing`, which puts `v1` in a register and uses it. the `iadd` now has a reference. 3. We reach the `iadd` and, because it has a reference, lower it. Our machine has the ability to merge a load into an ALU operation. Crucially, we think the load at `v0` is mergeable because it has only one user, the add at `v1` (!). So we merge it. 4. We reach the `load` at `v0` and because it has been merged into the `iadd`, we do not separately codegen it. The register that holds `v0` is thus never written, and the use of this register by the final load (Step 1) will see an undefined value. The logic error here is that in the presence of pattern matching that looks through pure ops, we can end up with multiple uses of a value that originally had a single use (because we allow lookthrough of pure ops in all cases). In other words, the multiple-use-ness of `v1` "passes through" in some sense to `v0`. However, the load sinking logic is not aware of this. The fix, I think, is pretty simple: we disallow an effectful instruction from sinking/merging if it already has some other use when we look back at it. If we disallowed lookthrough of any op that had multiple uses, even pure ones, then we would avoid this scenario; but earlier experiments showed that to have a non-negligible performance impact, so (given that we've worked out the logic above) I think this complexity is worth it.	2020-12-03 14:59:12 -08:00
Nick Fitzgerald	1efdf10ca7	souper-harvest: ensure that `select` conditions are of type `i1`	2020-12-03 14:43:28 -08:00
Nick Fitzgerald	f7cf771ee6	souper-harvest: Do not generate assignments of constants It turns out that Souper does not allow a constant to be assigned to a variable, they may only be used as operands. The 2.0.0 version of the `souper-ir` crate correctly reflects this. In the `cranelift_codegen::souper_harvest` module, we need to modify our Souper IR harvester so that it delays converting `iconst` and `bconst` into Souper IR until their values are used as operands. Finally, some unit tests in the `peepmatic-souper` crate need some small updates as well.	2020-12-03 14:43:14 -08:00
Alex Crichton	41caf67af3	Update the C API with module linking support (#2472 ) * Update the C API with module linking support This commit does everything necessary (ideally) to support the module linking proposal in the C API. The changes here are: * New `wasm_{module,instance}type_t` types and accessors * New `wasm_{module,instance}_type` functions * Conversions between `wasm_extern_t` and `wasm_{instance,module}_t`, as well as `wasm_externtype_t` and the new types. * Addition of `WASM_EXTERN_{MODULE,INSTANCE}` constants * New `wasm_config_t` modifier to enable/disable module linking With these functions it should be possible to pass instances/modules to instances and also acquire them from exports. Altogether this should enable everything for module linking. An important point for this is that I've opted to add all these items under the `wasm_` name prefix instead of `wasmtime_`. I've done this since they're all following the idioms of existing APIs and while not standard the intention would be to standardize them (unlike many other Wasmtime-specific APIs). cc #2094 * Appease doxygen	2020-12-03 15:51:38 -06:00
Pat Hickey	0f1dc9a735	Merge pull request #2403 from bjorn3/simplejit_hot_swapping SimpleJIT hot code swapping	2020-12-03 13:36:32 -08:00
Alex Crichton	09662fa716	Fix module-linking handling of instance subtypes (#2466 ) * Fix module-linking handling of instance subtypes When we alias the nth export of an instance, due to subtyping the nth export may not actually be what we want. Instead we need to look at our local type definition's nth export's name, and lookup that name off the export. * Update crates/wasmtime/src/instance.rs Co-authored-by: Peter Huene <peter@huene.dev> Co-authored-by: Peter Huene <peter@huene.dev>	2020-12-03 13:41:32 -06:00
Chris Fallin	111f6dc18d	Merge pull request #2471 from cfallin/disable-simd-ci Disable some x64 SIMD tests that seem to be nondeterministic.	2020-12-03 10:51:18 -08:00
bjorn3	937a3fde40	Fix simplejit tests for x64 by disabling is_pic	2020-12-03 19:24:59 +01:00
Chris Fallin	d2721064df	Disable all x64 SIMD tests, as some seem to be nondeterministic. We are seeing some repeated but nondeterministic failures of x64 SIMD tests, as tracked in #2470. In order to err on the side of not inadvertently breaking/blocking CI for others, we will disable all SIMD tests for now while we investigate.	2020-12-03 10:01:01 -08:00
bjorn3	69d041faf1	Restore support for non-pic code in SimpleJIT	2020-12-03 18:49:18 +01:00
Chris Fallin	8e0e44b802	Merge pull request #2413 from akirilov-arm/fmov_vector Cranelift AArch64: Further vector constant improvements	2020-12-03 09:43:24 -08:00

1 2 3 4 5 ...

7459 Commits