wasmtime

Author	SHA1	Message	Date
Peter Huene	b775b68cfb	Make module information lookup from runtime safe. This commit uses a two-phase lookup of stack map information from modules rather than giving back raw pointers to stack maps. First the runtime looks up information about a module from a pc value, which returns an `Arc` it keeps a reference on while completing the stack map lookup. Second it then queries the module information for the stack map from a pc value, getting a reference to the stack map (which is now safe because of the `Arc` held by the runtime).	2021-04-16 12:30:10 -07:00
Peter Huene	6ac1321162	Minor corrections with latest changes.	2021-04-16 11:08:22 -07:00
Peter Huene	726a936474	Remove ArcModuleCode as it is no longer used.	2021-04-16 11:08:22 -07:00
Peter Huene	510fc71728	Code review feedback. * Make `FunctionInfo` public and `CompiledModule::func_info` return it. * Make the `StackMapLookup` trait unsafe. * Add comments for the purpose of `EngineHostFuncs`. * Rework ownership model of shared signatures: `SignatureCollection` in conjunction with `SignatureRegistry` is now used so that the `Engine`, `Store`, and `Module` don't need to worry about unregistering shared signatures. * Implement `Func::param_arity` and `Func::result_arity` in terms of `Func::ty`. * Make looking up a trampoline with the module registry more efficient by doing a binary search on the function's starting PC value for the owning module and then looking up the trampoline with only that module. * Remove reference to the shared signatures from `GlobalRegisteredModule`.	2021-04-16 11:08:21 -07:00
Peter Huene	ea72c621f0	Remove the stack map registry. This commit removes the stack map registry and instead uses the existing information from the store's module registry to lookup stack maps. A trait is now used to pass the lookup context to the runtime, implemented by `Store` to do the lookup. With this change, module registration in `Store` is now entirely limited to inserting the module into the module registry.	2021-04-16 11:08:21 -07:00
Peter Huene	a2466b3c23	Move the signature registry into `Engine`. This commit moves the shared signature registry out of `Store` and into `Engine`. This helps eliminate work that was performed whenever a `Module` was instantiated into a `Store`. Now a `Module` is registered with the shared signature registry upon creation, storing the mapping from the module's signature index space to the shared index space. This also refactors the "frame info" registry into a general purpose "module registry" that is used to look up trap information, signature information, and (soon) stack map information.	2021-04-16 11:06:44 -07:00
Benjamin Bouvier	f26449f03d	Merge pull request #2845 from bnjbvr/fix-unwind-win64-old-backend Generate unwind information on Win64 with the old backend	2021-04-16 18:59:19 +02:00
Benjamin Bouvier	ba73b458b8	Introduce a new API that allows notifying that a Store has moved to a new thread (#2822 ) * Introduce a new API that allows notifying that a Store has moved to a new thread * Add backlink to documentation, and mention the new API in the multithreading doc;	2021-04-16 11:15:35 -05:00
Benjamin Bouvier	8ab3511b3b	Generate unwind information on Win64 with the old backend Following the new ABI introduced for efficient support of multiple return values, the old-backend test for generating unwind information was incomplete, resulting in no unwind information being generated and traps not being correctly caught by the runtime.	2021-04-16 18:05:49 +02:00
Benjamin Bouvier	82f6556bc2	Merge pull request #2758 from bnjbvr/revert-log cranelift: Use a deferred display mechanism instead of `log_enabled!`	2021-04-16 11:49:44 +02:00
Benjamin Bouvier	50aa645769	cranelift: use a deferred display wrapper for logging the vcode's IR	2021-04-16 10:27:19 +02:00
Chris Fallin	03077e0de9	Merge pull request #2843 from uweigand/spillslot-fix cranelift: Fix spillslot regression on big-endian platforms	2021-04-15 13:28:33 -07:00
Ulrich Weigand	10efe8e780	cranelift: Fix spillslot regression on big-endian platforms PR 2840 changed the store_spillslot routine to always store integer registers in full word size to a spill slot. However, the load_spillslot routine was not updated, which may causes the contents to be reloaded in a different type. On big-endian systems this will fetch wrong data. Fixed by using the same type override in load_spillslot.	2021-04-15 21:39:14 +02:00
Andrew Brown	0acc1451ea	x64: lower iabs.i64x2 using a single AVX512 instruction when possible (#2819 ) * x64: add EVEX encoding mechanism Also, includes an empty stub module for the VEX encoding. * x64: lower abs.i64x2 to VPABSQ when available * x64: refactor EVEX encodings to use `EvexInstruction` This change replaces the `encode_evex` function with a builder-style struct, `EvexInstruction`. This approach clarifies the code, adds documentation, and results in slight speedups when benchmarked. * x64: rename encoding CodeSink to ByteSink	2021-04-15 11:53:58 -07:00
Ulrich Weigand	1243cea455	Update cap-std dependency to 0.13.9 This fixes a build failure on s390x.	2021-04-14 14:11:46 -07:00
Chris Fallin	36c667d58d	Merge pull request #2837 from uweigand/outgoing-args Add back support for accumulating outgoing arguments	2021-04-14 12:54:06 -07:00
Chris Fallin	fd4bfbe5a7	Merge pull request #2836 from uweigand/framesizefix Fix frame size after unwind rework	2021-04-14 12:19:38 -07:00
Chris Fallin	1f21b32e99	Merge pull request #2838 from uweigand/optionalfp Allow unwind support to work without a frame pointer	2021-04-14 10:58:51 -07:00
Chris Fallin	337cc47d2f	Merge pull request #2840 from bnjbvr/fix-2839 cranelift: always spill i32 with i64 stores	2021-04-14 10:11:47 -07:00
Benjamin Bouvier	e7bced9512	cranelift: always spill i32 with i64 stores; Fixes #2839. See also the issue description and comments in this commits for details of what the fix is about here.	2021-04-14 18:08:52 +02:00
Ulrich Weigand	5904c09682	Allow unwind support to work without a frame pointer The patch extends the unwinder to support targets that do not need to use a dedicated frame pointer register. Specifically, the changes include: - Change the "fp" routine in the RegisterMapper to return an optional frame pointer regsiter via Option<Register>. - On targets that choose to not define a FP register via the above routine, the UnwindInst::DefineNewFrame operation no longer switches the CFA to be defined in terms of the FP. (The operation still can be used to define the location of the clobber area.) - In addition, on targets that choose not to define a FP register, the UnwindInst::PushFrameRegs operation is not supported. - There is a new operation UnwindInst::StackAlloc that needs to be called on targets without FP whenever the stack pointer is updated. This caused the CFA offset to be adjusted accordingly. (On targets with FP this operation is a no-op.)	2021-04-14 15:32:31 +02:00
Ulrich Weigand	336c6369b4	Add back support for accumulating outgoing arguments The unwind rework (commit `2d5db92a`) removed support for the feature to allow a target to allocate the space for outgoing function arguments right in the prologue (originally added via commit `80c2d70d`). This patch adds it back.	2021-04-14 13:51:16 +02:00
Ulrich Weigand	e3bb36ba77	Fix frame size after unwind rework After the unwind rework (commit `2d5db92a`) the space used to save clobbered registers now lies between the nominal SP and the FP. Therefore, the size of that space should now be included in the frame size as reported by frame_size(), since this value is used to compute the nominal_sp_to_fp offset.	2021-04-14 13:46:08 +02:00
Andrew Brown	45bee40f33	wasi-nn: use the newly-published Rust bindings for wasi-nn The bindings are now published on [crates.io](https://crates.io/crates/wasi-nn) and have been moved to their [own repository](https://github.com/bytecodealliance/wasi-nn).	2021-04-13 16:00:06 -07:00
Andrew Brown	e9e4afe2c7	wasi-nn: use the MobileNet model instead of AlexNet The MobileNet model is significantly smaller in size (14MB) than the AlexNet model (233MB); this change should reduce bandwidth used during CI.	2021-04-13 16:00:06 -07:00
Chris Fallin	27b3162f87	Merge pull request #2833 from abrown/2826 x64: fix Inst::store to understand all scalar types	2021-04-13 15:36:41 -07:00
Chris Fallin	8caac9ed79	Merge pull request #2823 from akirilov-arm/callee_saves Cranelift AArch64: Improve the handling of callee-saved registers	2021-04-13 15:35:46 -07:00
Chris Fallin	f222802b7a	Merge pull request #2828 from bjorn3/fix_srem_i8 Fix srem.{i8,i16}	2021-04-13 15:35:40 -07:00
Andrew Brown	6bdef48473	x64: refactor to use Inst::store during lowering This re-factoring replaces uses of `Inst::mov_r_m` with `Inst::store` to ensure there is only one code location to troubleshoot when generating store instructions for a specific type.	2021-04-13 13:09:07 -07:00
Andrew Brown	9b25b06d86	x64: store to all scalar sizes Previously, `Inst::store` only understood a subset of the scalar types, which resulted in failures seen in #2826. This change allows `Inst::store` to generate instructions for all scalar widths (`8 \| 16 \| 32 \| 64`) since all of these are supported in the emission code of `Inst::MovRM`.	2021-04-13 12:38:35 -07:00
bjorn3	b272d4b7da	Fix srem.{i8,i16}	2021-04-13 21:28:27 +02:00
Anton Kirilov	7248abd591	Cranelift AArch64: Improve the handling of callee-saved registers SIMD & FP registers are now saved and restored in pairs, similarly to general-purpose registers. Also, only the bottom 64 bits of the registers are saved and restored (in case of non-Baldrdash ABIs), which is the requirement from the Procedure Call Standard for the Arm 64-bit Architecture. As for the callee-saved general-purpose registers, if a procedure needs to save and restore an odd number of them, it no longer uses store and load pair instructions for the last register. Copyright (c) 2021, Arm Limited.	2021-04-13 20:23:08 +01:00
Chris Fallin	8387bc0d76	Merge pull request #2830 from cfallin/pin-nightly CI: pin nightly Rust version to limit breakages to explicit pinning updates.	2021-04-13 12:06:44 -07:00
Chris Fallin	908d47011f	CI: pin nightly Rust version to limit breakages to explicit pinning updates.	2021-04-13 11:12:07 -07:00
Chris Fallin	67cc42d4c3	Merge pull request #2750 from bjorn3/anon_allocs Support declaring anonymous functions and data objects	2021-04-12 12:11:11 -07:00
Nick Fitzgerald	2a32567871	Merge pull request #2821 from alexcrichton/faster-vmoffsets Precompute fields in `VMOffsets`	2021-04-08 14:17:11 -07:00
Alex Crichton	18dd82ba7d	Improve signature lookup happening during instantiation (#2818 ) This commit is intended to be a perf improvement for instantiation of modules with lots of functions. Previously the `lookup_shared_signature` callback was showing up quite high in profiles as part of instantiation. As some background, this callback is used to translate from a module's `SignatureIndex` to a `VMSharedSignatureIndex` which the instance stores. This callback is called for two reasons, one is to translate all of the module's own types into `VMSharedSignatureIndex` for the purposes of `call_indirect` (the translation of that loads from this table to compare indices). The second reason is that a `VMCallerCheckedAnyfunc` is prepared for all functions and this embeds a `VMSharedSignatureIndex` inside of it. The slow part today is that the lookup callback was called once-per-function and each lookup involved hashing a full `WasmFuncType`. Albeit our hash algorithm is still Rust's default SipHash algorithm which is quite slow, but we also shouldn't need to re-hash each signature if we see it multiple times anyway. The fix applied in this commit is to change this lookup callback to an `enum` where one variant is that there's a table to lookup from. This table is a `PrimaryMap` which means that lookup is quite fast. The only thing we need to do is to prepare the table ahead of time. Currently this happens on the instantiation path because in my measurments the creation of the table is quite fast compared to the rest of instantiation. If this becomes an issue, though, we can look into creating the table as part of `SigRegistry::register_module` and caching it somewhere (I'm not entirely sure where but I'm sure we can figure it out). There's in generally not a ton of efficiency around the `SigRegistry` type. I'm hoping though that this fixes the next-lowest-hanging-fruit in terms of performance without complicating the implementation too much. I tried a few variants and this change seemed like the best balance between simplicity and still a nice performance gain. Locally I measured an improvement in instantiation time for a large-ish module by reducing the time from ~3ms to ~2.6ms per instance.	2021-04-08 15:04:18 -05:00
Alex Crichton	c91e14d83f	Precompute fields in `VMOffsets` This commit updates the implementation of `VMOffsets` to frontload all checked arithmetic on construction of the `VMOffsets` which allows eliding all checked arithmetic when accessing the fields of `VMOffsets`. For testing and such this adds a new constructor as well from a new `VMOffsetsFields` structure which is a clone of the old definition. This should help speed up some profile hot spots I've been seeing where with all the checked arithmetic on field sizes this was slowing down the various accessors during instantiation (which uses `VMOffsets` to initialize various fields of the `VMContext`).	2021-04-08 12:46:17 -07:00
Andrew Brown	8e495ac79d	x64: match multiple ISA requirements before emitting Because there are instructions that are present in more than one ISA feature set, we need to see if any of the ISA requirements match before emitting. This change includes the `VPABSQ` instruction as an example, which is present in both `AVX512F` and `AVX512VL`.	2021-04-08 10:30:39 -07:00
Alex Crichton	c77ea0c5c7	Add some more `#[inline]` annotations for trivial functions (#2817 ) Looking at some profiles these or their related functions were all showing up, so this commit adds `#[inline]` to allow cross-crate inlining by default.	2021-04-08 12:23:54 -05:00
Alex Crichton	5c4c03d278	Don't hash for the cache if it's disabled (#2816 ) This fixes an issue where even if the wasmtime cache was disabled we're still calculating the sha256 of modules for the hash key. This hash was then simply discarded if the cache was disabled!	2021-04-08 11:48:24 -05:00
Alex Crichton	812c8ef368	Update wast to 35.0.2 (#2815 ) Fixes a panic or two in translation found by fuzzing.	2021-04-08 11:45:10 -05:00
Peter Huene	2ca97ed7e4	Merge pull request #2814 from fitzgen/inline-vm-offsets wasmtime-environ: Mark all VM offset functions as `#[inline]`	2021-04-07 18:16:29 -07:00
Peter Huene	45a500701f	Merge pull request #2811 from peterhuene/improve-store-registration Refactor store frame information.	2021-04-07 18:16:10 -07:00
Peter Huene	ad9fa11d48	Code review feedback. * Remove `once-cell` dependency. * Remove function address `BTreeMap` from `CompiledModule` in favor of binary searching finished functions directly. * Use `with_capacity` when populating `CompiledModule` finished functions and trampolines.	2021-04-07 16:37:04 -07:00
Nick Fitzgerald	ed31f28158	wasmtime-environ: Mark all VM offset functions as `#[inline]` Otherwise they won't get inlined across crates unless we enable LTO, and much of the usage of these function is across crates (eg from the `wasmtime-runtime` crate).	2021-04-07 16:17:26 -07:00
Alex Crichton	e43d94033f	Document guidance around multithreading and Wasmtime (#2812 ) * Document guidance around multithreading and Wasmtime This commit writes a page of documentation for the Wasmtime book to serve as guidance for embedders looking to add multithreading with Wasmtime support. As always with any safe Rust API this reading is optional because you can't mis-use Wasmtime without `unsafe`, but I'm hoping that this documentation can serve as a point of reference for folks who want to add multithreading but are confused/annoyed that Wasmtime's types do not implement the `Send` and `Sync` traits. Closes #793 * I can type	2021-04-07 16:34:07 -05:00
Peter Huene	875cb92cf0	Refactor store frame information. This commit refactors the store frame information to eliminate the copying of data out from `CompiledModule`. It also moves the population of a `BTreeMap` out of the frame information and into `CompiledModule` where it is only ever calculated once rather than at every new module instantiation into a `Store`. The map is also lazy-initialized so the cost of populating the map is incurred only when a trap occurs. This should help improve instantiation time of modules with a large number of functions and functions with lots of instructions.	2021-04-07 12:47:04 -07:00
Alex Crichton	195bf0e29a	Fully support multiple returns in Wasmtime (#2806 ) * Fully support multiple returns in Wasmtime For quite some time now Wasmtime has "supported" multiple return values, but only in the mose bare bones ways. Up until recently you couldn't get a typed version of functions with multiple return values, and never have you been able to use `Func::wrap` with functions that return multiple values. Even recently where `Func::typed` can call functions that return multiple values it uses a double-indirection by calling a trampoline which calls the real function. The underlying reason for this lack of support is that cranelift's ABI for returning multiple values is not possible to write in Rust. For example if a wasm function returns two `i32` values there is no Rust (or C!) function you can write to correspond to that. This commit, however fixes that. This commit adds two new ABIs to Cranelift: `WasmtimeSystemV` and `WasmtimeFastcall`. The intention is that these Wasmtime-specific ABIs match their corresponding ABI (e.g. `SystemV` or `WindowsFastcall`) for everything except how multiple values are returned. For multiple return values we simply define our own version of the ABI which Wasmtime implements, which is that for N return values the first is returned as if the function only returned that and the latter N-1 return values are returned via an out-ptr that's the last parameter to the function. These custom ABIs provides the ability for Wasmtime to bind these in Rust meaning that `Func::wrap` can now wrap functions that return multiple values and `Func::typed` no longer uses trampolines when calling functions that return multiple values. Although there's lots of internal changes there's no actual changes in the API surface area of Wasmtime, just a few more impls of more public traits which means that more types are supported in more places! Another change made with this PR is a consolidation of how the ABI of each function in a wasm module is selected. The native `SystemV` ABI, for example, is more efficient at returning multiple values than the wasmtime version of the ABI (since more things are in more registers). To continue to take advantage of this Wasmtime will now classify some functions in a wasm module with the "fast" ABI. Only functions that are not reachable externally from the module are classified with the fast ABI (e.g. those not exported, used in tables, or used with `ref.func`). This should enable purely internal functions of modules to have a faster calling convention than those which might be exposed to Wasmtime itself. Closes #1178 * Tweak some names and add docs * "fix" lightbeam compile * Fix TODO with dummy environ * Unwind info is a property of the target, not the ABI * Remove lightbeam unused imports * Attempt to fix arm64 * Document new ABIs aren't stable * Fix filetests to use the right target * Don't always do 64-bit stores with cranelift This was overwriting upper bits when 32-bit registers were being stored into return values, so fix the code inline to do a sized store instead of one-size-fits-all store. * At least get tests passing on the old backend * Fix a typo * Add some filetests with mixed abi calls * Get `multi` example working * Fix doctests on old x86 backend * Add a mixture of wasmtime/system_v tests	2021-04-07 12:34:26 -05:00
Benjamin Bouvier	7588565078	Tweaks some tests for Mac aarch64 - some tests don't pass because of bad interactions with the system's libunwind; ignore them for now. - the page size on mac aarch64 is 16K, not 4K; tweak some tests which were expecting 4K or multiples of 4K pages to use a multiple of host page size instead. - a cranelift-native test needed an update for the new calling convention.	2021-04-07 14:54:50 +02:00

1 2 3 4 5 ...

8197 Commits