wasmtime

Author	SHA1	Message	Date
Chris Fallin	207da989ac	Merge pull request #2862 from akirilov-arm/simd_boolean Enable the simd_boolean test for AArch64	2021-04-27 21:29:18 -07:00
Anton Kirilov	480670e17f	Enable the simd_boolean test for AArch64 Also, enable the simd_i64x2_arith2 test because it doesn't need any code changes. Copyright (c) 2021, Arm Limited.	2021-04-27 20:19:51 +01:00
Chris Fallin	b89c959e4a	Merge pull request #2854 from uweigand/debug-endian debug: Support big-endian architectures	2021-04-27 10:27:22 -07:00
Alex Crichton	8384f3a347	Bring back `Module::deserialize` (#2858 ) * Bring back `Module::deserialize` I thought I was being clever suggesting that `Module::deserialize` was removed from #2791 by funneling all module constructors into `Module::new`. As our studious fuzzers have found, though, this means that `Module::new` is not safe currently to pass arbitrary user-defined input into. Now one might pretty reasonable expect to be able to do that, however, being a WebAssembly engine and all. This PR as a result separates the `deserialize` part of `Module::new` back into `Module::deserialize`. This means that binary blobs created with `Module::serialize` and `Engine::precompile_module` will need to be passed to `Module::deserialize` to "rehydrate" them back into a `Module`. This restores the property that it should be safe to pass arbitrary input to `Module::new` since it's always expected to be a wasm module. This also means that fuzzing will no longer attempt to fuzz `Module::deserialize` which isn't something we want to do anyway. * Fix an example * Mark `Module::deserialize` as `unsafe`	2021-04-27 10:55:12 -05:00
Peter Huene	4a830b1159	Merge pull request #2857 from workingjubilee/byte-less Factor out byteorder in cranelift	2021-04-23 13:42:39 -07:00
Jubilee Young	a8c956ede1	Factor out byteorder in cranelift This removes an existing dependency on the byteorder crate in favor of using std equivalents directly. While not an issue for wasmtime per se, cranelift is now part of the critical path of building and testing Rust, and minimizing dependencies, even small ones, can help reduce the time and bandwidth required.	2021-04-23 12:05:18 -07:00
StackDoubleFlow	9637bc5a09	Fix cranelift `Module` and `ObjectModule` docs links (#2852 )	2021-04-21 06:29:02 -07:00
Ulrich Weigand	801358333d	debug: Support big-endian architectures This fixes some hard-coded assumptions in the debug crate that the native ELF files being accessed are little-endian; specifically in create_gdbjit_image as well as in emit_dwarf. In addition, data in WebAssembly memory always uses little-endian byte order. Therefore, if the native architecture is big-endian, all references to base types need to be marked as little-endian using the DW_AT_endianity attribute, so that the debugger will be able to correctly access them.	2021-04-21 14:14:59 +02:00
Alex Crichton	196bcec6cf	Process declared element segments for "possibly exported funcs" (#2851 ) Now that we're using "possibly exported" as an impactful decision for codegen (which trampolines to generate and which ABI a function has) it's important that we calculate this property of a wasm function correctly! Previously Wasmtime forgot to processed "declared" elements in apart from active/passive element segments, but this updates Wasmtime to ensure that these entries are processed and all the functions contained within are flagged as "possibly exported". Closes #2850	2021-04-20 16:52:51 -05:00
Alex Crichton	200d7f1df6	Delete signature for no-longer-present function (#2849 ) Accidental omission from #2736	2021-04-19 20:49:33 -05:00
Alex Crichton	193551a8d6	Optimize `table.init` instruction and instantiation (#2847 ) * Optimize `table.init` instruction and instantiation This commit optimizes table initialization as part of instance instantiation and also applies the same optimization to the `table.init` instruction. One part of this commit is to remove some preexisting duplication between instance instantiation and the `table.init` instruction itself, after this the actual implementation of `table.init` is optimized to effectively have fewer bounds checks in fewer places and have a much tighter loop for instantiation. A big fallout from this change is that memory/table initializer offsets are now stored as `u32` instead of `usize` to remove a few casts in a few places. This ended up requiring moving some overflow checks that happened in parsing to later in code itself because otherwise the wrong spec test errors are emitted during testing. I've tried to trace where these can possibly overflow but I think that I managed to get everything. In a local synthetic test where an empty module with a single 80,000 element initializer this improves total instantiation time by 4x (562us => 141us) * Review comments	2021-04-19 18:44:48 -05:00
Nick Fitzgerald	2864bb4a0f	Merge pull request #2848 from fitzgen/map-or-in-table-element-into-raw Use `map_or` instead of `map` and `unwrap_or` in `TableElement::into_raw`	2021-04-19 15:45:12 -07:00
Nick Fitzgerald	8507eb7708	Use `map_or` instead of `map` and `unwrap_or` in `TableElement::into_raw`	2021-04-19 14:18:55 -07:00
Peter Huene	f12b4c467c	Add resource limiting to the Wasmtime API. (#2736 ) * Add resource limiting to the Wasmtime API. This commit adds a `ResourceLimiter` trait to the Wasmtime API. When used in conjunction with `Store::new_with_limiter`, this can be used to monitor and prevent WebAssembly code from growing linear memories and tables. This is particularly useful when hosts need to take into account host resource usage to determine if WebAssembly code can consume more resources. A simple `StaticResourceLimiter` is also included with these changes that will simply limit the size of linear memories or tables for all instances created in the store based on static values. * Code review feedback. * Implemented `StoreLimits` and `StoreLimitsBuilder`. * Moved `max_instances`, `max_memories`, `max_tables` out of `Config` and into `StoreLimits`. * Moved storage of the limiter in the runtime into `Memory` and `Table`. * Made `InstanceAllocationRequest` use a reference to the limiter. * Updated docs. * Made `ResourceLimiterProxy` generic to remove a level of indirection. * Fixed the limiter not being used for `wasmtime::Memory` and `wasmtime::Table`. * Code review feedback and bug fix. * `Memory::new` now returns `Result<Self>` so that an error can be returned if the initial requested memory exceeds any limits placed on the store. * Changed an `Arc` to `Rc` as the `Arc` wasn't necessary. * Removed `Store` from the `ResourceLimiter` callbacks. Custom resource limiter implementations are free to capture any context they want, so no need to unnecessarily store a weak reference to `Store` from the proxy type. * Fixed a bug in the pooling instance allocator where an instance would be leaked from the pool. Previously, this would only have happened if the OS was unable to make the necessary linear memory available for the instance. With these changes, however, the instance might not be created due to limits placed on the store. We now properly deallocate the instance on error. * Added more tests, including one that covers the fix mentioned above. * Code review feedback. * Add another memory to `test_pooling_allocator_initial_limits_exceeded` to ensure a partially created instance is successfully deallocated. * Update some doc comments for better documentation of `Store` and `ResourceLimiter`.	2021-04-19 09:19:20 -05:00
Taiki Endo	52b1166918	Update iter-enum to 1 (#2846 )	2021-04-19 09:08:15 -05:00
Peter Huene	6b6a6463a2	Merge pull request #2842 from peterhuene/engine-sig-registry Additional performance improvements for module instantiation.	2021-04-16 13:59:01 -07:00
Peter Huene	ef2ad6375d	Consolidate module construction. This commit adds `Module::from_parts` as an internal constructor that shared the implementation between `Module::from_binary` and module deserialization.	2021-04-16 12:34:38 -07:00
Peter Huene	dfab471ce5	Remove unused file. This file hasn't been used for a while and was mistakenly not deleted.	2021-04-16 12:30:14 -07:00
Peter Huene	b775b68cfb	Make module information lookup from runtime safe. This commit uses a two-phase lookup of stack map information from modules rather than giving back raw pointers to stack maps. First the runtime looks up information about a module from a pc value, which returns an `Arc` it keeps a reference on while completing the stack map lookup. Second it then queries the module information for the stack map from a pc value, getting a reference to the stack map (which is now safe because of the `Arc` held by the runtime).	2021-04-16 12:30:10 -07:00
Peter Huene	6ac1321162	Minor corrections with latest changes.	2021-04-16 11:08:22 -07:00
Peter Huene	726a936474	Remove ArcModuleCode as it is no longer used.	2021-04-16 11:08:22 -07:00
Peter Huene	510fc71728	Code review feedback. * Make `FunctionInfo` public and `CompiledModule::func_info` return it. * Make the `StackMapLookup` trait unsafe. * Add comments for the purpose of `EngineHostFuncs`. * Rework ownership model of shared signatures: `SignatureCollection` in conjunction with `SignatureRegistry` is now used so that the `Engine`, `Store`, and `Module` don't need to worry about unregistering shared signatures. * Implement `Func::param_arity` and `Func::result_arity` in terms of `Func::ty`. * Make looking up a trampoline with the module registry more efficient by doing a binary search on the function's starting PC value for the owning module and then looking up the trampoline with only that module. * Remove reference to the shared signatures from `GlobalRegisteredModule`.	2021-04-16 11:08:21 -07:00
Peter Huene	ea72c621f0	Remove the stack map registry. This commit removes the stack map registry and instead uses the existing information from the store's module registry to lookup stack maps. A trait is now used to pass the lookup context to the runtime, implemented by `Store` to do the lookup. With this change, module registration in `Store` is now entirely limited to inserting the module into the module registry.	2021-04-16 11:08:21 -07:00
Peter Huene	a2466b3c23	Move the signature registry into `Engine`. This commit moves the shared signature registry out of `Store` and into `Engine`. This helps eliminate work that was performed whenever a `Module` was instantiated into a `Store`. Now a `Module` is registered with the shared signature registry upon creation, storing the mapping from the module's signature index space to the shared index space. This also refactors the "frame info" registry into a general purpose "module registry" that is used to look up trap information, signature information, and (soon) stack map information.	2021-04-16 11:06:44 -07:00
Benjamin Bouvier	f26449f03d	Merge pull request #2845 from bnjbvr/fix-unwind-win64-old-backend Generate unwind information on Win64 with the old backend	2021-04-16 18:59:19 +02:00
Benjamin Bouvier	ba73b458b8	Introduce a new API that allows notifying that a Store has moved to a new thread (#2822 ) * Introduce a new API that allows notifying that a Store has moved to a new thread * Add backlink to documentation, and mention the new API in the multithreading doc;	2021-04-16 11:15:35 -05:00
Benjamin Bouvier	8ab3511b3b	Generate unwind information on Win64 with the old backend Following the new ABI introduced for efficient support of multiple return values, the old-backend test for generating unwind information was incomplete, resulting in no unwind information being generated and traps not being correctly caught by the runtime.	2021-04-16 18:05:49 +02:00
Benjamin Bouvier	82f6556bc2	Merge pull request #2758 from bnjbvr/revert-log cranelift: Use a deferred display mechanism instead of `log_enabled!`	2021-04-16 11:49:44 +02:00
Benjamin Bouvier	50aa645769	cranelift: use a deferred display wrapper for logging the vcode's IR	2021-04-16 10:27:19 +02:00
Chris Fallin	03077e0de9	Merge pull request #2843 from uweigand/spillslot-fix cranelift: Fix spillslot regression on big-endian platforms	2021-04-15 13:28:33 -07:00
Ulrich Weigand	10efe8e780	cranelift: Fix spillslot regression on big-endian platforms PR 2840 changed the store_spillslot routine to always store integer registers in full word size to a spill slot. However, the load_spillslot routine was not updated, which may causes the contents to be reloaded in a different type. On big-endian systems this will fetch wrong data. Fixed by using the same type override in load_spillslot.	2021-04-15 21:39:14 +02:00
Andrew Brown	0acc1451ea	x64: lower iabs.i64x2 using a single AVX512 instruction when possible (#2819 ) * x64: add EVEX encoding mechanism Also, includes an empty stub module for the VEX encoding. * x64: lower abs.i64x2 to VPABSQ when available * x64: refactor EVEX encodings to use `EvexInstruction` This change replaces the `encode_evex` function with a builder-style struct, `EvexInstruction`. This approach clarifies the code, adds documentation, and results in slight speedups when benchmarked. * x64: rename encoding CodeSink to ByteSink	2021-04-15 11:53:58 -07:00
Ulrich Weigand	1243cea455	Update cap-std dependency to 0.13.9 This fixes a build failure on s390x.	2021-04-14 14:11:46 -07:00
Chris Fallin	36c667d58d	Merge pull request #2837 from uweigand/outgoing-args Add back support for accumulating outgoing arguments	2021-04-14 12:54:06 -07:00
Chris Fallin	fd4bfbe5a7	Merge pull request #2836 from uweigand/framesizefix Fix frame size after unwind rework	2021-04-14 12:19:38 -07:00
Chris Fallin	1f21b32e99	Merge pull request #2838 from uweigand/optionalfp Allow unwind support to work without a frame pointer	2021-04-14 10:58:51 -07:00
Chris Fallin	337cc47d2f	Merge pull request #2840 from bnjbvr/fix-2839 cranelift: always spill i32 with i64 stores	2021-04-14 10:11:47 -07:00
Benjamin Bouvier	e7bced9512	cranelift: always spill i32 with i64 stores; Fixes #2839. See also the issue description and comments in this commits for details of what the fix is about here.	2021-04-14 18:08:52 +02:00
Ulrich Weigand	5904c09682	Allow unwind support to work without a frame pointer The patch extends the unwinder to support targets that do not need to use a dedicated frame pointer register. Specifically, the changes include: - Change the "fp" routine in the RegisterMapper to return an optional frame pointer regsiter via Option<Register>. - On targets that choose to not define a FP register via the above routine, the UnwindInst::DefineNewFrame operation no longer switches the CFA to be defined in terms of the FP. (The operation still can be used to define the location of the clobber area.) - In addition, on targets that choose not to define a FP register, the UnwindInst::PushFrameRegs operation is not supported. - There is a new operation UnwindInst::StackAlloc that needs to be called on targets without FP whenever the stack pointer is updated. This caused the CFA offset to be adjusted accordingly. (On targets with FP this operation is a no-op.)	2021-04-14 15:32:31 +02:00
Ulrich Weigand	336c6369b4	Add back support for accumulating outgoing arguments The unwind rework (commit `2d5db92a`) removed support for the feature to allow a target to allocate the space for outgoing function arguments right in the prologue (originally added via commit `80c2d70d`). This patch adds it back.	2021-04-14 13:51:16 +02:00
Ulrich Weigand	e3bb36ba77	Fix frame size after unwind rework After the unwind rework (commit `2d5db92a`) the space used to save clobbered registers now lies between the nominal SP and the FP. Therefore, the size of that space should now be included in the frame size as reported by frame_size(), since this value is used to compute the nominal_sp_to_fp offset.	2021-04-14 13:46:08 +02:00
Andrew Brown	45bee40f33	wasi-nn: use the newly-published Rust bindings for wasi-nn The bindings are now published on [crates.io](https://crates.io/crates/wasi-nn) and have been moved to their [own repository](https://github.com/bytecodealliance/wasi-nn).	2021-04-13 16:00:06 -07:00
Andrew Brown	e9e4afe2c7	wasi-nn: use the MobileNet model instead of AlexNet The MobileNet model is significantly smaller in size (14MB) than the AlexNet model (233MB); this change should reduce bandwidth used during CI.	2021-04-13 16:00:06 -07:00
Chris Fallin	27b3162f87	Merge pull request #2833 from abrown/2826 x64: fix Inst::store to understand all scalar types	2021-04-13 15:36:41 -07:00
Chris Fallin	8caac9ed79	Merge pull request #2823 from akirilov-arm/callee_saves Cranelift AArch64: Improve the handling of callee-saved registers	2021-04-13 15:35:46 -07:00
Chris Fallin	f222802b7a	Merge pull request #2828 from bjorn3/fix_srem_i8 Fix srem.{i8,i16}	2021-04-13 15:35:40 -07:00
Andrew Brown	6bdef48473	x64: refactor to use Inst::store during lowering This re-factoring replaces uses of `Inst::mov_r_m` with `Inst::store` to ensure there is only one code location to troubleshoot when generating store instructions for a specific type.	2021-04-13 13:09:07 -07:00
Andrew Brown	9b25b06d86	x64: store to all scalar sizes Previously, `Inst::store` only understood a subset of the scalar types, which resulted in failures seen in #2826. This change allows `Inst::store` to generate instructions for all scalar widths (`8 \| 16 \| 32 \| 64`) since all of these are supported in the emission code of `Inst::MovRM`.	2021-04-13 12:38:35 -07:00
bjorn3	b272d4b7da	Fix srem.{i8,i16}	2021-04-13 21:28:27 +02:00
Anton Kirilov	7248abd591	Cranelift AArch64: Improve the handling of callee-saved registers SIMD & FP registers are now saved and restored in pairs, similarly to general-purpose registers. Also, only the bottom 64 bits of the registers are saved and restored (in case of non-Baldrdash ABIs), which is the requirement from the Procedure Call Standard for the Arm 64-bit Architecture. As for the callee-saved general-purpose registers, if a procedure needs to save and restore an odd number of them, it no longer uses store and load pair instructions for the last register. Copyright (c) 2021, Arm Limited.	2021-04-13 20:23:08 +01:00

1 2 3 4 5 ...

8215 Commits