wasmtime

Author	SHA1	Message	Date
Chris Fallin	6bec13da04	Bump versions: Wasmtime to 0.26.0, Cranelift to 0.73.0.	2021-04-05 10:48:42 -07:00
Chris Fallin	cb48ea406e	Switch default to new x86_64 backend. This PR switches the default backend on x86, for both the `cranelift-codegen` crate and for Wasmtime, to the new (`MachInst`-style, `VCode`-based) backend that has been under development and testing for some time now. The old backend is still available by default in builds with the `old-x86-backend` feature, or by requesting `BackendVariant::Legacy` from the appropriate APIs. As part of that switch, it adds some more runtime-configurable plumbing to the testing infrastructure so that tests can be run using the appropriate backend. `clif-util test` is now capable of parsing a backend selector option from filetests and instantiating the correct backend. CI has been updated so that the old x86 backend continues to run its tests, just as we used to run the new x64 backend separately. At some point, we will remove the old x86 backend entirely, once we are satisfied that the new backend has not caused any unforeseen issues and we do not need to revert.	2021-04-02 11:35:53 -07:00
Peter Huene	b7b47e380d	Merge pull request #2791 from peterhuene/compile-command Add a compile command to Wasmtime.	2021-04-02 11:18:14 -07:00
Andrew Brown	d32501c554	x64: refactor REX-specific encoding machinery to its own module In preparation for adding new encoding modes to the x64 backend (e.g. VEX, EVEX), this change moves all of the current instruction encoding functions to `encodings::rex`. This refactor does not change any logic.	2021-04-02 11:17:39 -07:00
Will Robson	da33496063	Seal the block that should have been sealed	2021-04-02 15:35:12 +01:00
Will Robson	f5f3c2fb25	Add test cases to show the unsealed block in switch generation	2021-04-02 15:34:49 +01:00
Peter Huene	0ddfe97a09	Change how flags are stored in serialized modules. This commit changes how both the shared flags and ISA flags are stored in the serialized module to detect incompatibilities when a serialized module is instantiated. It improves the error reporting when a compiled module has mismatched shared flags.	2021-04-01 21:39:57 -07:00
Peter Huene	4ad0099da4	Update `wat` crate. Update the `wat` crate to latest version and use `Error::set_path` in `Module::from_file` to properly record the path associated with errors.	2021-04-01 20:11:26 -07:00
Peter Huene	abf3bf29f9	Add a `wasmtime settings` command to print Cranelift settings. This commit adds the `wasmtime settings` command to print out available Cranelift settings for a target (defaults to the host). The compile command has been updated to remove the Cranelift ISA options in favor of encouraging users to use `wasmtime settings` to discover what settings are available. This will reduce the maintenance cost for syncing the compile command with Cranelift ISA flags.	2021-04-01 19:38:19 -07:00
Peter Huene	29d366db7b	Add a compile command to Wasmtime. This commit adds a `compile` command to the Wasmtime CLI. The command can be used to Ahead-Of-Time (AOT) compile WebAssembly modules. With the `all-arch` feature enabled, AOT compilation can be performed for non-native architectures (i.e. cross-compilation). The `Module::compile` method has been added to perform AOT compilation. A few of the CLI flags relating to "on by default" Wasm features have been changed to be "--disable-XYZ" flags. A simple example of using the `wasmtime compile` command: ```text $ wasmtime compile input.wasm $ wasmtime input.cwasm ```	2021-04-01 19:38:18 -07:00
bjorn3	16e2e3a002	Remove fixed todo	2021-03-28 15:16:33 +02:00
Johnnie Birch	31d3db1ec2	Implements convert low signed integer to float for x64 simd	2021-03-26 12:13:29 -07:00
Alex Crichton	211731b876	Update wasm-tools crates (#2773 ) Brings in some fuzzing-related bug-fixes	2021-03-25 18:44:31 -05:00
Alex Crichton	30d9164b6e	Fix a number of warnings cropping up on nightly Rust (#2767 ) Various small issues here and there, nothing major	2021-03-25 13:19:37 -05:00
Alex Crichton	3f694ae319	Use stable Rust on CI to test the x64 backend (#2766 ) * Use stable Rust on CI to test the x64 backend This commit leverages the newly-released 1.51.0 compiler to test the new backend on Windows and Linux with a stable compiler instead of a nightly compiler. This isolates the nightly build to just the nightly documentation generation and fuzzing, both of which rely on nightly for the best results right now. * Use updated stable in book build job * Run rustfmt for new stable * Silence new warnings for wasi-nn * Allow some dead code in the x64 backend Looks like new rustc is better about emitting some dead-code warnings * Update rust in peepmatic job * Fix a test in the pooling allocator * Remove `package.metdata.docs.rs` temporarily Needs resolution of https://github.com/rust-lang/cargo/pull/9300 first * Fix a warning in a wasi-nn example	2021-03-25 13:18:59 -05:00
Chris Fallin	b429f77ee9	Handle `srem` properly when `avoid_div_traps` is false. The codegen for div/rem ops has two modes, depending on the `avoid_div_traps` flag: it can either do all checks for trapping conditions explicitly, and use explicit trap instructions, then use a hardware divide instruction that will not trap (`avoid_div_traps == true`); or it can run in a mode where a hardware FP fault on the divide instruction implies a Wasm trap (`avoid_div_traps == false`). Wasmtime uses the former while Lucet (for example) uses the latter. It turns out that because we run all our spec tests run under Wasmtime, we missed a spec corner case that fails in the latter: INT_MIN % -1 == 0 per the spec, but causes a trap with the x86 signed divide/remainder instruction. Hence, in Lucet, this specific remainder computation would incorrectly result in a Wasm trap. This PR fixes the issue by just forcing use of the explicit-checks implementation for `srem` even when `avoid_div_traps` is false.	2021-03-24 22:30:07 -07:00
Benjamin Bouvier	49ef2c652a	Cranelift: remove logging of vcode when the log level isn't debug or more (#2755 ) This logging step may be quite expensive, since logging has never been optimized at all. Removing it is a clear win in compile times on my machine for a large wasm module, for which parallel compilation is lowering from 6 seconds to 1.5 seconds. Co-authored-by: bjorn3 <bjorn3@users.noreply.github.com>	2021-03-23 16:07:32 +01:00
bjorn3	b321a7291d	Clarify ownership of data returned by get_finalized_*	2021-03-22 09:48:09 -07:00
Benjamin Bouvier	6e6713ae0b	cranelift: add support for the Mac aarch64 calling convention This bumps target-lexicon and adds support for the AppleAarch64 calling convention. Specifically for WebAssembly support, we only have to worry about the new stack slots convention. Stack slots don't need to be at least 8-bytes, they can be as small as the data type's size. For instance, if we need stack slots for (i32, i32), they can be located at offsets (+0, +4). Note that they still need to be properly aligned on the data type they're containing, though, so if we need stack slots for (i32, i64), we can't start the i64 slot at the +4 offset (it must start at the +8 offset). Added one test that was failing on the Mac M1, as well as other tests stressing different yet similar situations.	2021-03-22 10:06:13 +01:00
LYK	3f2d36d532	Fix an incorrect link.	2021-03-20 03:41:03 +09:00
Will Robson	38926fb1fc	cranelift-module: Add support for passing a StackMapSink when defining functions Fixes #2738 This follows the convention set by the existing method of passing a TrapSink by adding another argument for a StackMapSink.	2021-03-19 00:02:15 +00:00
Benjamin Bouvier	5fecdfa491	Mach ports continued + support aarch64-apple unwinding (#2723 ) * Switch macOS to using mach ports for trap handling This commit moves macOS to using mach ports instead of signals for handling traps. The motivation for this is listed in #2456, namely that once mach ports are used in a process that means traditional UNIX signal handlers won't get used. This means that if Wasmtime is integrated with Breakpad, for example, then Wasmtime's trap handler never fires and traps don't work. The `traphandlers` module is refactored as part of this commit to split the platform-specific bits into their own files (it was growing quite a lot for one inline `cfg_if!`). The `unix.rs` and `windows.rs` files remain the same as they were before with a few minor tweaks for some refactored interfaces. The `macos.rs` file is brand new and lifts almost its entire implementation from SpiderMonkey, adapted for Wasmtime though. The main gotcha with mach ports is that a separate thread is what services the exception. Some unsafe magic allows this separate thread to read non-`Send` and temporary state from other threads, but is hoped to be safe in this context. The unfortunate downside is that calling wasm on macOS now involves taking a global lock and modifying a global hash map twice-per-call. I'm not entirely sure how to get out of this cost for now, but hopefully for any embeddings on macOS it's not the end of the world. Closes #2456 * Add a sketch of arm64 apple support * store: maintain CallThreadState mapping when switching fibers * cranelift/aarch64: generate unwind directives to disable pointer auth Aarch64 post ARMv8.3 has a feature called pointer authentication, designed to fight ROP/JOP attacks: some pointers may be signed using new instructions, adding payloads to the high (previously unused) bits of the pointers. More on this here: https://lwn.net/Articles/718888/ Unwinders on aarch64 need to know if some pointers contained on the call frame contain an authentication code or not, to be able to properly authenticate them or use them directly. Since native code may have enabled it by default (as is the case on the Mac M1), and the default is that this configuration value is inherited, we need to explicitly disable it, for the only kind of supported pointers (return addresses). To do so, we set the value of a non-existing dwarf pseudo register (34) to 0, as documented in https://github.com/ARM-software/abi-aa/blob/master/aadwarf64/aadwarf64.rst#note-8. This is done at the function granularity, in the spirit of Cranelift compilation model. Alternatively, a single directive could be generated in the CIE, generating less information per module. * Make exception handling work on Mac aarch64 too * fibers: use a breakpoint instruction after the final call in wasmtime_fiber_start Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2021-03-17 09:43:22 -05:00
Nick Fitzgerald	d081ef9c2e	Bump Wasmtime to 0.25.0; Cranelift to 0.72.0	2021-03-16 11:02:56 -07:00
Anton Kirilov	07c27039b1	Cranelift AArch64: Add initial support for the Armv8.1 atomics This commit enables Cranelift's AArch64 backend to generate code for instruction set extensions (previously only the base Armv8-A architecture was supported); also, it makes it possible to detect the extensions supported by the host when JIT compiling. The new functionality is applied to the IR instruction `AtomicCas`. Copyright (c) 2021, Arm Limited.	2021-03-13 02:31:51 +00:00
Chris Fallin	2d5db92a9e	Rework/simplify unwind infrastructure and implement Windows unwind. Our previous implementation of unwind infrastructure was somewhat complex and brittle: it parsed generated instructions in order to reverse-engineer unwind info from prologues. It also relied on some fragile linkage to communicate instruction-layout information that VCode was not designed to provide. A much simpler, more reliable, and easier-to-reason-about approach is to embed unwind directives as pseudo-instructions in the prologue as we generate it. That way, we can say what we mean and just emit it directly. The usual reasoning that leads to the reverse-engineering approach is that metadata is hard to keep in sync across optimization passes; but here, (i) prologues are generated at the very end of the pipeline, and (ii) if we ever do a post-prologue-gen optimization, we can treat unwind directives as black boxes with unknown side-effects, just as we do for some other pseudo-instructions today. It turns out that it was easier to just build this for both x64 and aarch64 (since they share a factored-out ABI implementation), and wire up the platform-specific unwind-info generation for Windows and SystemV. Now we have simpler unwind on all platforms and we can delete the old unwind infra as soon as we remove the old backend. There were a few consequences to supporting Fastcall unwind in particular that led to a refactor of the common ABI. Windows only supports naming clobbered-register save locations within 240 bytes of the frame-pointer register, whatever one chooses that to be (RSP or RBP). We had previously saved clobbers below the fixed frame (and below nominal-SP). The 240-byte range has to include the old RBP too, so we're forced to place clobbers at the top of the frame, just below saved RBP/RIP. This is fine; we always keep a frame pointer anyway because we use it to refer to stack args. It does mean that offsets of fixed-frame slots (spillslots, stackslots) from RBP are no longer known before we do regalloc, so if we ever want to index these off of RBP rather than nominal-SP because we add support for `alloca` (dynamic frame growth), then we'll need a "nominal-BP" mode that is resolved after regalloc and clobber-save code is generated. I added a comment to this effect in `abi_impl.rs`. The above refactor touched both x64 and aarch64 because of shared code. This had a further effect in that the old aarch64 prologue generation subtracted from `sp` once to allocate space, then used stores to `[sp, offset]` to save clobbers. Unfortunately the offset only has 7-bit range, so if there are enough clobbered registers (and there can be -- aarch64 has 384 bytes of registers; at least one unit test hits this) the stores/loads will be out-of-range. I really don't want to synthesize large-offset sequences here; better to go back to the simpler pre-index/post-index `stp r1, r2, [sp, #-16]` form that works just like a "push". It's likely not much worse microarchitecturally (dependence chain on SP, but oh well) and it actually saves an instruction if there's no other frame to allocate. As a further advantage, it's much simpler to understand; simpler is usually better. This PR adds the new backend on Windows to CI as well.	2021-03-11 20:03:52 -08:00
Peter Huene	f8cc824396	Merge pull request #2518 from peterhuene/add-allocator Implement the pooling instance allocator.	2021-03-08 12:20:31 -08:00
Chris Fallin	58769e5006	Merge pull request #2714 from Amanieu/more_entitylist EntityList improvments	2021-03-08 11:39:55 -08:00
Amanieu d'Antras	9b1693aa72	Add EntityList::truncate	2021-03-08 18:21:02 +00:00
Amanieu d'Antras	65d0bc58d2	Add EntityList::deep_clone	2021-03-08 18:20:46 +00:00
Amanieu d'Antras	b2abe74f25	Improve codegen for remove and swap_remove on EntityList	2021-03-08 18:20:05 +00:00
Andrew Brown	352e51f68d	[simd] Implement load_lane and store_lane The Wasm SIMD specification has added new instructions that allow inserting to the lane of a vector from a memory location, and conversely, extracting from a lane of a vector to a memory location. The simplest implementation lowers these instructions, `load[8\|16\|32\|64]_lane` and `store[8\|16\|32\|64]_lane`, to a sequence of either `load + insertlane` or `extractlane + store` (in CLIF). With the new backend's pattern matching, we expect these CLIF sequences to compile as a single machine instruction (at least in x64).	2021-03-08 09:49:44 -08:00
Peter Huene	9801c681ce	Fail module translation for segments with overflowing offset+length. This commit fails translation of modules that have an segment offset, when added to the data length, overflows.	2021-03-05 18:36:32 -08:00
Chris Fallin	e41d882144	Merge pull request #2678 from cfallin/x64-fastcall x86-64 Windows fastcall ABI support.	2021-03-05 10:46:47 -08:00
Peter Huene	b58afbf849	Refactor module instantiation in the runtime. This commit refactors module instantiation in the runtime to allow for different instance allocation strategy implementations. It adds an `InstanceAllocator` trait with the current implementation put behind the `OnDemandInstanceAllocator` struct. The Wasmtime API has been updated to allow a `Config` to have an instance allocation strategy set which will determine how instances get allocated. This change is in preparation for an alternative pooling instance allocator that can reserve all needed host process address space in advance. This commit also makes changes to the `wasmtime_environ` crate to represent compiled modules in a way that reduces copying at instantiation time.	2021-03-04 18:18:50 -08:00
Dan Gohman	8854dec01d	Bump version to 0.24.0 I used a specially modified version of the publish script to avoid bumping the `witx` version.	2021-03-04 18:17:03 -08:00
Chris Fallin	6c94eb82aa	x86-64 Windows fastcall ABI support. This adds support for the "fastcall" ABI, which is the native C/C++ ABI on Windows platforms on x86-64. It is similar to but not exactly like System V; primarily, its argument register assignments are different, and it requires stack shadow space. Note that this also adjusts the handling of multi-register values in the shared ABI implementation, and with this change, adjusts handling of `i128`s on both Fastcall/x64 and SysV/x64 platforms. This was done to align with actual behavior by the "rustc ABI" on both platforms, as mapped out experimentally (Compiler Explorer link in comments). This behavior is gated under the `enable_llvm_abi_extensions` flag. Note also that this does not add x64 unwind info on Windows. That will come in a future PR (but is planned!).	2021-03-03 19:53:18 -08:00
Andrew Brown	4a809fc8fd	[x64] Add i64x2.all_true and i64x2.bitmask These instructions already had CLIF implementations but were only recently approved in the Wasm SIMD working group.	2021-03-02 13:57:55 -08:00
Andrew Brown	508f8fa5a9	[x64] Add i64x2.abs This instruction has a single instruction lowering in AVX512F/VL and a three instruction lowering in AVX but neither is currently supported in the x64 backend. To implement this, we instead subtract the vector from 0 and use a blending instruction to pick the lanes containing the absolute value.	2021-03-02 12:30:02 -08:00
Andrew Brown	d730f18a78	[x64] Add the new i64x2 comparisons	2021-03-01 17:52:34 -08:00
Andrew Brown	44e76fe9c0	Update spec tests (#2690 ) * Update wasm-tools crates * Update Wasm SIMD spec tests * Invert 'experimental_x64_should_panic' logic By doing this, it is easier to see which spec tests currently panic. The new tests correspond to recently-added instructions. * Fix: ignore new spec tests for all backends	2021-03-01 16:39:20 -06:00
Chris Fallin	cdb60ec5a9	Merge pull request #2682 from cfallin/shift-bugs Fix some `i128` shift-related bugs in x64 backend.	2021-02-26 15:13:08 -08:00
Nick Fitzgerald	824ce7bf89	deps: Update `Arbitrary` to 1.0; `libfuzzer-sys` to 0.4.0; `wasm-smith` to 0.4.0	2021-02-25 15:34:02 -08:00
Chris Fallin	6dcb31abb7	Fix 128-bit left shift: null out tmp3, not tmp2, on zero-shift case. Add a bunch of test vectors that actually expose this (previously the shift-by-zero test had equal lower and upper halves and hid the bug), including the most basic of all, 1 << 0 == 1 (thanks @bjorn3 for finding this).	2021-02-25 09:46:57 -08:00
Chris Fallin	40db4de44a	Fix incomplete trap metadata due to multiple traps at one address. If an instruction has more than one trap record associated with it (for example: a divide instruction that has participated in load-op fusion, so we have both a heap-out-of-bounds trap record due to its load and a divide-by-zero trap record due to its divide op), the current MachBuffer code would emit only one of the trap records to the sink. Separately, divide instructions probably shouldn't merge loads, because the two separate possible traps at one location might be confusing for some embedders (certainly in Lucet). Divide seems to be the only case in our current codegen where such merging might occur. This PR changes the lowering to always force the divisor into a register. Finally, while working out why trap records were not appearing, I had noticed that `isa::x64::emit_std_enc_mem()` was only emitting heap-OOB trap metadata for loads/stores when it had a srcloc. This PR ensures that the metadata is emitted even when the srcloc is empty. Note that none of the above presents a security or correctness problem; trap metadata only affects the status that we return to the embedder when a Wasm program terminates with a trap.	2021-02-24 15:13:45 -08:00
Chris Fallin	0cc4a3d445	Merge pull request #2683 from cfallin/br-table-unreachable Fix bad jumptable block ref when DCE removes a block.	2021-02-23 16:31:19 -08:00
Chris Fallin	48d542d67c	Fix bad jumptable block ref when DCE removes a block. When a block is unreachable, the `unreachable_code` pass will remove it, which is perfectly sensible. Jump tables factor into unreachability in an expected way: even if a block is listed in a jump table, the block might be unreachable if the jump table itself is unused (or used in an unreachable block). Unfortunately, the verifier still expects all block refs in all jump tables to be valid, even after DCE, which will not always be the case. This makes a simple change to the pass: after removing blocks, it scans jump tables. Any jump table that refers to an unreachable block must itself be unused, and so we just clear its entries. We do not bother removing it (and renumbering all later jumptables), and we do not bother computing full unused-ness of all jumptables, as that would be more expensive; it's sufficient to clear out the ones that refer to unreachable blocks, which are a subset of all unused jumptables. Fixes #2670.	2021-02-23 15:01:01 -08:00
Chris Fallin	0f3e00b25e	Fix some `i128` shift-related bugs in x64 backend. This fixes #2672 and #2679, and also fixes an incorrect instruction emission (`test` with small immediate) that we had missed earlier. The shift-related fixes have to do with (i) shifts by 0 bits, as a special case that must be handled; and (ii) shifts by a 128-bit amount, which we can handle by just dropping the upper half (we only use 3--7 bits of shift amount). This adjusts the lowerings appropriately, and also adds run-tests to ensure that the lowerings actually execute correctly (previously we only had compile-tests with golden lowerings; I'd like to correct this for more ops eventually, adding run-tests beyond what the Wasm spec and frontend covers).	2021-02-23 14:22:04 -08:00
Pat Hickey	ee1ddfb6f9	Merge pull request #2674 from bjorn3/module_improvements Make Module object safe	2021-02-23 09:20:26 -08:00
Alex Crichton	98d3e6823f	Update wasmparser/wat dependencies (#2675 ) * Update wasmparser/wat dependencies Bring in new opcodes and new instructions for SIMD * Update module linking syntax	2021-02-22 11:56:34 -06:00
bjorn3	a710fc4425	Make Module object safe	2021-02-22 16:00:37 +01:00

1 2 3 4 5 ...

2892 Commits