wasmtime

Author	SHA1	Message	Date
Alex Crichton	e8aa7bb53b	Reimplement how unwind information is stored (#3180 ) * Reimplement how unwind information is stored This commit is a major refactoring of how unwind information is stored after compilation of a function has finished. Previously we would store the raw `UnwindInfo` as a result of compilation and this would get serialized/deserialized alongside the rest of the ELF object that compilation creates. Whenever functions were registered with `CodeMemory` this would also result in registering unwinding information dynamically at runtime, which in the case of Unix, for example, would dynamically created FDE/CIE entries on-the-fly. Eventually I'd like to support compiling Wasmtime without Cranelift, but this means that `UnwindInfo` wouldn't be easily available to decode into and create unwinding information from. To solve this I've changed the ELF object created to have the unwinding information encoded into it ahead-of-time so loading code into memory no longer needs to create unwinding tables. This change has two different implementations for Windows/Unix: * On Windows the implementation was much easier. The unwinding information on Windows is already stored after the function itself in the text section. This was actually slightly duplicated in object building and in code memory allocation. Now the object building continues to do the same, recording unwinding information after functions, and code memory no longer manually tracks this. Additionally Wasmtime will emit a special custom section in the object file with unwinding information which is the list of `RUNTIME_FUNCTION` structures that `RtlAddFunctionTable` expects. This means that the object file has all the information precompiled into it and registration at runtime is simply passing a few pointers around to the runtime. * Unix was a little bit more difficult than Windows. Today a `.eh_frame` section is created on-the-fly with offsets in FDEs specified as the absolute address that functions are loaded at. This absolute address hindered the ability to precompile the FDE into the object file itself. I've switched how addresses are encoded, though, to using `DW_EH_PE_pcrel` which means that FDE addresses are now specified relative to the FDE itself. This means that we can maintain a fixed offset between the `.eh_frame` loaded in memory and the beginning of code memory. When doing so this enables precompiling the `.eh_frame` section into the object file and at runtime when loading an object no further construction of unwinding information is needed. The overall result of this commit is that unwinding information is no longer stored in its cranelift-data-structure form on disk. This means that this unwinding information format is only present during compilation, which will make it that much easier to compile out cranelift in the future. This commit also significantly refactors `CodeMemory` since the way unwinding information is handled is not much different from before. Previously `CodeMemory` was suitable for incrementally adding more and more functions to it, but nowadays a `CodeMemory` either lives per module (in which case all functions are known up front) or it's created once-per-`Func::new` with two trampolines. In both cases we know all functions up front so the functionality of incrementally adding more and more segments is no longer needed. This commit removes the ability to add a function-at-a-time in `CodeMemory` and instead it can now only load objects in their entirety. A small helper function is added to build a small object file for trampolines in `Func::new` to handle allocation there. Finally, this commit also folds the `wasmtime-obj` crate directly into the `wasmtime-cranelift` crate and its builder structure to be more amenable to this strategy of managing unwinding tables. It is not intentional to have any real functional change as a result of this commit. This might accelerate loading a module from cache slightly since less work is needed to manage the unwinding information, but that's just a side benefit from the main goal of this commit which is to remove the dependence on cranelift unwinding information being available at runtime. * Remove isa reexport from wasmtime-environ * Trim down reexports of `cranelift-codegen` Remove everything non-essential so that only the bits which will need to be refactored out of cranelift remain. * Fix debug tests * Review comments	2021-08-17 17:14:18 -05:00
Nick Fitzgerald	9311c38f7e	Merge pull request #3192 from alexcrichton/no-comp-dir Don't require `DW_AT_comp_dir` for debuginfo	2021-08-17 14:43:35 -07:00
Alex Crichton	0642e62f16	Use wasm-smith to canonicalize NaN in differential fuzzing (#3195 ) * Update wasm-smith to 0.7.0 * Canonicalize NaN with wasm-smith for differential fuzzing This then also enables floating point executing in wasmi in addition to the spec interpreter. With NaN canonicalization at the wasm level this means that we should be producing deterministic results between Wasmtime and these alternative implementations.	2021-08-17 11:42:22 -05:00
Alex Crichton	bd47a74dab	Always call the resource limiter for memory allocations (#3189 ) * Always call the resource limiter for memory allocations Previously the memory64 support meant that sometimes we wouldn't call the limiter because the calculation for the minimum size requested would overflow. Instead Wasmtime now wraps the minimum size in something a bit smaller than the address space to inform the limiter, which should guarantee that although the limiter is called with "incorrect" information it's effectively correct and is allowed a pass to learn that a massive memory was requested. This was found by the fuzzers where a request for the absolute maximal size of 64-bit memory (e.g. the entire 64-bit address space) didn't actually invoke the limiter which means that we mis-classified an instantiation error and didn't realize that it was an OOM. * Add a test	2021-08-16 12:49:56 -05:00
Alex Crichton	1bdafbf226	Don't require `DW_AT_comp_dir` for debuginfo I'm not too well-versed in this area of debuginfo, but I think this should address #3184 where it appears not all compilers emit `DW_AT_comp_dir`. This seems to match the default behavior of `gimli` when it maps an existing line program to a new line program as well (choosing an empty name for the compilation directory). Closes #3184	2021-08-16 10:48:10 -07:00
Alex Crichton	0313e30d76	Remove dependency on `TargetIsa` from Wasmtime crates (#3178 ) This commit started off by deleting the `cranelift_codegen::settings` reexport in the `wasmtime-environ` crate and then basically played whack-a-mole until everything compiled again. The main result of this is that the `wasmtime-` family of crates have generally less of a dependency on the `TargetIsa` trait and type from Cranelift. While the dependency isn't entirely severed yet this is at least a significant start. This commit is intended to be largely refactorings, no functional changes are intended here. The refactorings are: A `CompilerBuilder` trait has been added to `wasmtime_environ` which server as an abstraction used to create compilers and configure them in a uniform fashion. The `wasmtime::Config` type now uses this instead of cranelift-specific settings. The `wasmtime-jit` crate exports the ability to create a compiler builder from a `CompilationStrategy`, which only works for Cranelift right now. In a cranelift-less build of Wasmtime this is expected to return a trait object that fails all requests to compile. * The `Compiler` trait in the `wasmtime_environ` crate has been souped up with a number of methods that Wasmtime and other crates needed. * The `wasmtime-debug` crate is now moved entirely behind the `wasmtime-cranelift` crate. * The `wasmtime-cranelift` crate is now only depended on by the `wasmtime-jit` crate. * Wasm types in `cranelift-wasm` no longer contain their IR type, instead they only contain the `WasmType`. This is required to get everything to align correctly but will also be required in a future refactoring where the types used by `cranelift-wasm` will be extracted to a separate crate. * I moved around a fair bit of code in `wasmtime-cranelift`. * Some gdb-specific jit-specific code has moved from `wasmtime-debug` to `wasmtime-jit`.	2021-08-16 09:55:39 -05:00
Alex Crichton	e9f33fc618	Move all trampoline compilation to `wasmtime-cranelift` (#3176 ) * Move all trampoline compilation to `wasmtime-cranelift` This commit moves compilation of all the trampolines used in wasmtime behind the `Compiler` trait object to live in `wasmtime-cranelift`. The long-term goal of this is to enable depending on cranelift only from the `wasmtime-cranelift` crate, so by moving these dependencies we should make that a little more flexible. * Fix windows build	2021-08-12 16:58:21 -05:00
Alex Crichton	2da1b9d375	Delete unused code in `wasmtime-obj` (#3179 ) I believe this was likely used at some point historically, but nowadays this code isn't used so let's delete it.	2021-08-12 13:28:00 -05:00
Alex Crichton	e0c8961333	Add memory64 support to the Wasmtime CLI and C API (#3182 ) Accidentally forgotten from #3153!	2021-08-12 12:33:57 -05:00
Alex Crichton	e68aa99588	Implement the memory64 proposal in Wasmtime (#3153 ) * Implement the memory64 proposal in Wasmtime This commit implements the WebAssembly [memory64 proposal][proposal] in both Wasmtime and Cranelift. In terms of work done Cranelift ended up needing very little work here since most of it was already prepared for 64-bit memories at one point or another. Most of the work in Wasmtime is largely refactoring, changing a bunch of `u32` values to something else. A number of internal and public interfaces are changing as a result of this commit, for example: * Acessors on `wasmtime::Memory` that work with pages now all return `u64` unconditionally rather than `u32`. This makes it possible to accommodate 64-bit memories with this API, but we may also want to consider `usize` here at some point since the host can't grow past `usize`-limited pages anyway. * The `wasmtime::Limits` structure is removed in favor of minimum/maximum methods on table/memory types. * Many libcall intrinsics called by jit code now unconditionally take `u64` arguments instead of `u32`. Return values are `usize`, however, since the return value, if successful, is always bounded by host memory while arguments can come from any guest. * The `heap_addr` clif instruction now takes a 64-bit offset argument instead of a 32-bit one. It turns out that the legalization of `heap_addr` already worked with 64-bit offsets, so this change was fairly trivial to make. * The runtime implementation of mmap-based linear memories has changed to largely work in `usize` quantities in its API and in bytes instead of pages. This simplifies various aspects and reflects that mmap-memories are always bound by `usize` since that's what the host is using to address things, and additionally most calculations care about bytes rather than pages except for the very edge where we're going to/from wasm. Overall I've tried to minimize the amount of `as` casts as possible, using checked `try_from` and checked arithemtic with either error handling or explicit `unwrap()` calls to tell us about bugs in the future. Most locations have relatively obvious things to do with various implications on various hosts, and I think they should all be roughly of the right shape but time will tell. I mostly relied on the compiler complaining that various types weren't aligned to figure out type-casting, and I manually audited some of the more obvious locations. I suspect we have a number of hidden locations that will panic on 32-bit hosts if 64-bit modules try to run there, but otherwise I think we should be generally ok (famous last words). In any case I wouldn't want to enable this by default naturally until we've fuzzed it for some time. In terms of the actual underlying implementation, no one should expect memory64 to be all that fast. Right now it's implemented with "dynamic" heaps which have a few consequences: * All memory accesses are bounds-checked. I'm not sure how aggressively Cranelift tries to optimize out bounds checks, but I suspect not a ton since we haven't stressed this much historically. * Heaps are always precisely sized. This means that every call to `memory.grow` will incur a `memcpy` of memory from the old heap to the new. We probably want to at least look into `mremap` on Linux and otherwise try to implement schemes where dynamic heaps have some reserved pages to grow into to help amortize the cost of `memory.grow`. The memory64 spec test suite is scheduled to now run on CI, but as with all the other spec test suites it's really not all that comprehensive. I've tried adding more tests for basic things as I've had to implement guards for them, but I wouldn't really consider the testing adequate from just this PR itself. I did try to take care in one test to actually allocate a 4gb+ heap and then avoid running that in the pooling allocator or in emulation because otherwise that may fail or take excessively long. [proposal]: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md * Fix some tests * More test fixes * Fix wasmtime tests * Fix doctests * Revert to 32-bit immediate offsets in `heap_addr` This commit updates the generation of addresses in wasm code to always use 32-bit offsets for `heap_addr`, and if the calculated offset is bigger than 32-bits we emit a manual add with an overflow check. * Disable memory64 for spectest fuzzing * Fix wrong offset being added to heap addr * More comments! * Clarify bytes/pages	2021-08-12 09:40:20 -05:00
Andrew Brown	76a93dc112	fuzz: log Wasm contents to file when `log::debug` is enabled Previously, the WAT was printed as a log message. This change standardizes all of the oracles to use `log_wasm`, which emits a `.wasm` and `.wat` file for each case if `log::debug` is enabled and prints a message with the names of the created files. Closes #3140.	2021-08-11 09:10:20 -07:00
Sergei Shulepov	cbabcacb0f	wasmtime: Option to disable parallel compilation (#3169 ) * Introduce parallel-compilation configuration switch * Plumb parallel_compilation config to compilation * Adjust obj.rs * Address review * Fix compilation fail in `cache` crate * Fix obj.rs Also remove the now unneeded feature in /Cargo.toml * fmt	2021-08-10 14:09:15 -05:00
Andrew Brown	42acb72c54	fuzz: retrieve the WebAssembly spec repository in `build.rs` To avoid the large download size of the spec repository mentioned [here](https://github.com/bytecodealliance/wasmtime/pull/3124#discussion_r684605984), this change removes it as a submodule and instead clones it shallowly when the directory is empty (or not present) when `build.rs` is run.	2021-08-10 11:56:07 -07:00
Andrew Brown	651a321f1a	fuzz: add differential_spec fuzzing target This new target compares the outputs of executing the first exported function of a Wasm module in Wasmtime and in the official Wasm spec interpreter (using the `wasm-spec-interpreter` crate). This is an initial step towards more fully-featured fuzzing (e.g. compare memories, add `v128`, add references, add other proposals, etc.)	2021-08-10 11:56:07 -07:00
Andrew Brown	f3955fa62a	refactor: rename `DifferentialWasmiModuleConfig` to `SingleFunctionModuleConfig` Since we plan to reuse this configuration, we rename it and ensure it has at least 1 type (this resulted in invalid modules).	2021-08-10 11:56:07 -07:00
Andrew Brown	a7f592a026	Add a crate to interface with the WebAssembly spec interpreter The WebAssembly spec interpreter is written in OCaml and the new crate uses `ocaml-interop` along with a small OCaml wrapper to interpret Wasm modules in-process. The build process for this crate is currently Linux-specific: it requires several OCaml packages (e.g. `apt install -y ocaml-nox ocamlbuild`) as well as `make`, `cp`, and `ar`.	2021-08-10 11:56:07 -07:00
Andrew Brown	2e95d4e7c6	wasi-nn: refactor wasi-nn context to use multiple backends	2021-08-10 10:05:52 -07:00
Andrew Brown	f0147f23e8	wiggle: emit `From<#ident> for #tag_type` for variants	2021-08-10 10:05:52 -07:00
Andrew Brown	c3bbdead7c	wasi-nn: add backend abstraction	2021-08-10 10:05:52 -07:00
Alex Crichton	480dff21e8	fuzz: Disable more features for spectests fuzzer (#3159 ) The previous commit to eanble multi-memory and simd leaked into the spectest fuzzer, but to pass the spec tests we can't enable these features.	2021-08-06 16:27:42 -05:00
Alex Crichton	33c3d00f10	Remove rss prediction from api_calls fuzzer (#3156 ) This functionality is now subsumed by the limiter built-in to all fuzzing stores, so there's no longer any need for it. It was also triggering arithmetic overflows in fuzzing, so instead of fixing I'm removing it!	2021-08-06 12:43:22 -05:00
Nick Fitzgerald	e5ef1455a3	Merge pull request #3157 from alexcrichton/centralize-error-handling fuzz: Centralize handling instantiation errors	2021-08-06 10:38:48 -07:00
Alex Crichton	45896e0533	Decrease memory limit in fuzzing to 1gb (#3155 ) This should keep us under the default 2gb limit when fuzzing	2021-08-06 12:28:49 -05:00
Alex Crichton	3bdf6c7a48	fuzz: Centralize handling instantiation errors At the same time remove some string matching in favor of checking for oom explicitly.	2021-08-06 07:47:28 -07:00
Alex Crichton	bb85366a3b	Enable simd fuzzing on oss-fuzz (#3152 ) * Enable simd fuzzing on oss-fuzz This commit generally enables the simd feature while fuzzing, which should affect almost all fuzzers. For fuzzers that just throw random data at the wall and see what sticks, this means that they'll now be able to throw simd-shaped data at the wall and have it stick. For wasm-smith-based fuzzers this commit also updates wasm-smith to 0.6.0 which allows further configuring the `SwarmConfig` after generation, notably allowing `instantiate-swarm` to generate modules using simd using `wasm-smith`. This should much more reliably feed simd-related things into the fuzzers. Finally, this commit updates wasmtime to avoid usage of the general `wasm_smith::Module` generator to instead use a Wasmtime-specific custom default configuration which enables various features we have implemented. * Allow dummy table creation to fail Tables might creation for imports may exceed the memory limit on the store, which we'll want to gracefully recover from and not fail the fuzzers.	2021-08-05 16:24:42 -05:00
Alex Crichton	214c5f862d	fuzz: Implement finer memory limits per-store (#3149 ) * fuzz: Implement finer memory limits per-store This commit implements a custom resource limiter for fuzzing. Locally I was seeing a lot of ooms while fuzzing and I believe it was generally caused from not actually having any runtime limits for wasm modules. I'm actually surprised that this hasn't come up more on oss-fuzz more in reality, but with a custom store limiter I think this'll get the job done where we have an easier knob to turn for controlling the memory usage of fuzz-generated modules. For now I figure a 2gb limit should be good enough for limiting fuzzer execution. Additionally the "out of resources" check if instantiation fails now looks for the `oom` flag to be set instead of pattern matching on some error messages about resources. * Fix tests	2021-08-05 15:07:33 -05:00
Alex Crichton	d8c4ac2c25	Improve output of expectation failures of the `wast` commands (#3150 ) This commit updates the output of failed expectations in the `wast` crate to fold in the check-is-the-value-the-same with the generate-a-nice-message. Additionally this tries to make sure that everything is aligned in the output to make it a bit more easily readable. Vectors should notably be improved where lane differences can be compared vertically in the case of integers and printed out specifically in the case of floats.	2021-08-05 14:31:55 -05:00
Alex Crichton	9e142f8792	Fix some warnings on nightly Rust (#3148 ) Looks like these trailing-semicolons-in-macros are likely to become a hard error in the future, so this updates to remove them as necessary.	2021-08-05 13:02:44 -05:00
Alex Crichton	4cfa031c5f	Implement API support for v128-globals (#3147 ) Found via fuzzing, and looks like these were accidentally left out along the way SIMD was taking shape.	2021-08-05 13:02:34 -05:00
Alex Crichton	85f16f488d	Consolidate address calculations for atomics (#3143 ) * Consolidate address calculations for atomics This commit consolidates all calcuations of guest addresses into one `prepare_addr` function. This notably remove the atomics-specifics paths as well as the `prepare_load` function (now renamed to `prepare_addr` and folded into `get_heap_addr`). The goal of this commit is to simplify how addresses are managed in the code generator for atomics to use all the shared infrastrucutre of other loads/stores as well. This additionally fixes #3132 via the use of `heap_addr` in clif for all operations. I also added a number of tests for loads/stores with varying alignments. Originally I was going to allow loads/stores to not be aligned since that's what the current formal specification says, but the overview of the threads proposal disagrees with the formal specification, so I figured I'd leave it as-is but adding tests probably doesn't hurt. Closes #3132 * Fix old backend * Guarantee misalignment checks happen before out-of-bounds	2021-08-04 15:57:56 -05:00
Alex Crichton	a33caec9be	Bump the wasm-tools crates (#3139 ) * Bump the wasm-tools crates Pulls in some updates here and there, mostly for updating crates to the latest version to prepare for later memory64 work. * Update lightbeam	2021-08-04 09:53:47 -05:00
Chris Fallin	a13a777230	Bump to Wasmtime v0.29.0 and Cranelift 0.76.0.	2021-08-02 11:24:09 -07:00
Alex Crichton	63a3bbbf5a	Change VMMemoryDefinition::current_length to `usize` (#3134 ) * Change VMMemoryDefinition::current_length to `usize` This commit changes the definition of `VMMemoryDefinition::current_length` to `usize` from its previous definition of `u32`. This is a pretty impactful change because it also changes the cranelift semantics of "dynamic" heaps where the bound global value specifier must now match the pointer type for the platform rather than the index type for the heap. The motivation for this change is that the `current_length` field (or bound for the heap) is intended to reflect the current size of the heap. This is bound by `usize` on the host platform rather than `u32` or` u64`. The previous choice of `u32` couldn't represent a 4GB memory because we couldn't put a number representing 4GB into the `current_length` field. By using `usize`, which reflects the host's memory allocation, this should better reflect the size of the heap and allows Wasmtime to support a full 4GB heap for a wasm program (instead of 4GB minus one page). This commit also updates the legalization of the `heap_addr` clif instruction to appropriately cast the address to the platform's pointer type, handling bounds checks along the way. The practical impact for today's targets is that a `uextend` is happening sooner than it happened before, but otherwise there is no intended impact of this change. In the future when 64-bit memories are supported there will likely need to be fancier logic which handles offsets a bit differently (especially in the case of a 64-bit memory on a 32-bit host). The clif `filetest` changes should show the differences in codegen, and the Wasmtime changes are largely removing casts here and there. Closes #3022 * Add tests for memory.size at maximum memory size * Add a dfg helper method	2021-08-02 13:09:40 -05:00
Shamil	072d5dc978	Fix typo in doc (#3127 )	2021-07-29 08:59:41 -05:00
Andrew Brown	e3c56efd3e	Fix unused borrow warning `#[warn(unused_must_use)]` is on, prompting a compiler warning like: "unused borrow that must be used".	2021-07-28 15:39:45 -07:00
Alex Crichton	65378422bf	Add a `wasmtime_linker_define_func` C API function (#3122 ) This exposes the functionality of the `Linker` type where a store-independent function can be created and inserted, allowing a linker's functions to be used across many stores (instead of requiring one linker-per-store). Closes #3110	2021-07-27 18:56:52 -05:00
Alex Crichton	9b088756b3	Implement `Linker::module_async` (#3121 ) This implements and adds the async counterpart of the `Linker::module` method. Closes #3077	2021-07-27 16:17:45 -05:00
Alex Crichton	b5f7b2f86a	Remove thread local for mach port (#3119 ) This was needed a long time ago in the original implementation when the function being called here was hotter than it was before, but nowadays this function isn't hot as it's protected elsewhere from being repeatedly called, so the caching thread local is no longer necessary.	2021-07-27 11:07:15 -05:00
Dan Gohman	784a380e5f	Add comments about vmctx pointers in various datastructures. (#2925 ) This forward-ports the relevant parts of #1396.	2021-07-27 09:33:27 -05:00
Nick Fitzgerald	10eead18c8	Update `object` to 0.26.0	2021-07-26 12:10:41 -07:00
Nick Fitzgerald	514bbb20b4	Update `backtrace` to 0.3.61	2021-07-26 12:05:44 -07:00
Nick Fitzgerald	3d76cbdf34	Update `gimli` to 0.25; `addr2line` to 0.16	2021-07-26 11:04:53 -07:00
Peter Huene	ad054c6bce	Add more documentation to `ModuleLimits` and `InstanceLimits`. This commit adds some clarifying documentation to both the `ModuleLimits` and `InstanceLimits` types in the Wasmtime API. It clarifies how each setting relates to the memory allocated by the pooling instance allocator. Closes #3080.	2021-07-23 14:26:48 -07:00
Nick Fitzgerald	f136f73033	Reword `get_export` mutable context docs to be more user-facing	2021-07-21 11:25:49 -07:00
Nick Fitzgerald	943d027757	Document reborrowing issues for `AsContextMut` and workarounds	2021-07-21 09:28:36 -07:00
Nick Fitzgerald	4c9f90e89d	Document why `get_export` requires a mutable context	2021-07-21 09:10:09 -07:00
Qiu Wenbo	f628d06118	Upgrade capstone to v0.9	2021-07-19 17:14:28 +08:00
Alex Crichton	ff1ae6e10c	Flag another error as ok to hit when fuzzing (#3092 ) We've got a lot of fuzz failures right now of modules instantiating memories of 65536 pages, which we specifically disallow since the representation of limits within Wasmtime don't support full 4GB memories. This is ok, however, and it's not a fuzz failure that we're interested in, so this commit allows strings of that error to pass through the fuzzer.	2021-07-16 14:37:27 -05:00
Pat Hickey	83f7872ace	Merge pull request #3090 from bytecodealliance/pch/wiggle_dummy_executor_crashes wiggle: dummy executor traps instead of panics, improve testing	2021-07-16 11:34:56 -07:00
Pat Hickey	906182a304	fix wasi-tokio	2021-07-16 10:28:09 -07:00

1 2 3 4 5 ...

1785 Commits