wasmtime

Author	SHA1	Message	Date
Alex Crichton	b759514124	Allow wasmtime/v8 to differ on errors slightly (#3348 ) I'm not sure why when run repeatedly v8 has different limits on call-stack-size but it's not particularly interesting to assert exact matches here, so this should fix a fuzz-bug-failure found on oss-fuzz.	2021-09-14 10:40:24 -05:00
Dan Gohman	d1fce1e836	Modify the `poll_oneoff_files` test tolerate OS differences. (#3346 ) Modify the `poll_oneoff_files` test to avoid assuming that `poll_oneoff` returns all pending events, as it may sometimes return only a subset of events. When multiple events are expected, use a loop, and loop until all events have been recorded.	2021-09-13 14:59:50 -05:00
Dan Gohman	4d86f0ca10	Update to cap-std 0.19.0 and rsix 0.22.4. (#3331 ) This pulls in the s390x fix needed by #3330. Also a small `rsix` API update; `PollFdVec` has been removed in favor of just using `Vec<PollFd>`.	2021-09-11 12:28:30 -05:00
Dan Gohman	256e942aa0	Tidy up redundant `use` declarations. (#3333 ) This is just a minor code cleanup.	2021-09-11 12:26:54 -05:00
Nick Fitzgerald	4b256ab968	Place unwind info directly after the text section, even when debug info is enabled When debug info was enabled, we would put the debug info sections in between the text section and the unwind info section. But the unwind info is encoded in a position-independent manner (so that we don't need relocs for it) that relies on it directly following the text section. The result of the misplacement was some crashes inside the unwinder.	2021-09-09 13:39:30 -07:00
Nick Fitzgerald	0499cca2fa	Name unwind info `.eh_frame` in the Wasmtime's compiled ELF artifact We were previously using `_wasmtime_eh_frame` but there is no good reason to add the prefix Wasmtime-specific prefix. Using the standard name allows for better inspection with standard tools like `dwarfdump`.	2021-09-09 12:54:49 -07:00
Nick Fitzgerald	dd0bc3237e	Do not write a DWARF section if it is empty There is no point in writing an empty DWARF section, and this will make our ELF files a tiny bit smaller.	2021-09-09 12:54:13 -07:00
Alex Crichton	e26b91e890	Remove unnecessary annotations on `Module::get_export` (#3318 ) I think these were historically needed but nowadays not necessary!	2021-09-09 11:54:50 -07:00
Pat Hickey	bd19f43f84	rewrite `Store::{entering,exiting}_native_code_hook` into `Store::call_hook` (#3313 ) which provides additional detail on what state transition is being made	2021-09-09 09:20:45 -05:00
Alex Crichton	c73673559b	Avoid vector allocations in wasm->host calls (#3294 ) This commit improves the runtime support for wasm-to-host invocations for functions created with `Func::new` or `wasmtime_func_new` in the C API. Previously a `Vec` (sometimes a `SmallVec`) would be dynamically allocated on each host call to store the arguments that are coming from wasm and going to the host. In the case of the `wasmtime` crate we need to decode the `u128`-stored values, and in the case of the C API we need to decode the `Val` into the C API's `wasmtime_val_t`. The technique used in this commit is to store a singular `Vec<T>` inside the "store", be it the literal `Store<T>` or within the `T` in the case of the C API, which can be reused across wasm->host calls. This means that we're unlikely to actually perform dynamic memory allocation and instead we should hit a faster path where the `Vec` always has enough capacity. Note that this is just a mild improvement for `Func::new`-based functions. It's still the case that `Func::wrap` is much faster, but unfortunately the C API doesn't have access to `Func::wrap`, so the main motivation here is accelerating the C API.	2021-09-03 15:14:21 -05:00
Alex Crichton	ca3947911e	Refactor the internals of `Store<T>` (#3291 ) * Refactor the internals of `Store<T>` This commit is an overdue refactoring and renaming of some internals of the `Store` type in Wasmtime. The actual implementation of `Store<T>` has evolved from the original implementation to the point where some of the aspects of how things are structured no longer makes sense. There's also always been a lot of unnecessary gymnastics when trying to get access to various store pieces depending on where you are in `wasmtime`. This refactoring aims to simplify all this and make the internals much easier to read/write. The following changes were made: * The `StoreOpaque<'_>` type is deleted, along with the `opaque()` method. * The `StoreInnermost` type was renamed to `StoreOpaque`. `StoreOpaque<'_>` is dead. Long live `StoreOpaque`. This renaming and a few small tweaks means that this type now suffices for all consumers. * The `AsContextMut` and `AsContext` traits are now implemented for `StoreInner<T>`. These changes, while subtly small, help clean up a lot of the internals of `wasmtime`. There's a lot less verbose `&mut store.as_context_mut().opaque()` now. Additionally many methods can simply start with `let store = store.as_context_mut().0;` and use things internally. One of the nicer aspects of using references directly is that the compiler automatically reborrows references as necessary meaning there's lots of less manual reborrowing. The main motivation for this change was actually somewhat roundabout where I found that when `StoreOpaque<'_>` was being captured in closures and iterators it's 3 pointers wide which is a lot of data to move around. Now things capture over `&mut StoreOpaque` which is just one nice and small pointer to move around. In any case though I've long wanted to revisit the design of these internals to improve the ergonomics. It's not expected that this change alone will really have all that much impact on the performance of `wasmtime`. Finally a doc comment was added to `store.rs` to try to explain all the `Store`-related types since there are a nontrivial amount. * Rustfmt	2021-09-03 13:55:18 -05:00
Alex Crichton	50ce19a4a4	Remove an indirect function call in `Func::new` (#3293 ) This commit optimizes the runtime execution of `Func::new` by removing an indirect function call that happens whenever a host function is called. This indirection was generally done to prevent monomoprhizing a lot into consumer code but the few extra functions this makes monomorphic are fairly small, and in general wasm->host call performance is pretty important. While not a massive win this is expected to improve codegen, especially because with the indirect call removed the compiler should now be able to prove more often when a `Func::new` closure doesn't panic or return an error.	2021-09-03 13:40:51 -05:00
Alex Crichton	c33700087d	Align order of wasm types/values across Wasmtime (#3292 ) Wasmtime has a few representations of `Val` and `ValType` across the internal crates, the `wasmtime` crate, and the C API. These were previously sometimes mentioned in different orders which means that converting between the two took a little extra code than before. This commit is a micro-optimization to align the types across the various places we define these to help reduce the codegen burden when converting between these types. This is not expected to have a major impact on performance, rather it's a small cleanup which should be easy-ish to preserve I've noticed while staring at assembly.	2021-09-03 11:43:56 -05:00
Nick Fitzgerald	dd71acd7e3	Merge pull request #3281 from alexcrichton/small-opts Some small optimizations for calling wasm functions	2021-09-02 15:06:42 -07:00
Pat Hickey	fa15adfdd0	Merge pull request #3271 from bytecodealliance/pch/flexible_ser_module_versioning More flexible versioning for module serialization	2021-09-02 12:51:03 -07:00
Alex Crichton	37b9fc5333	Fix async build	2021-09-02 07:38:29 -07:00
Alex Crichton	6b5e21d80e	Inline some trivial store accessors These were showing up in some profiles, but they're trivial functions, so `#[inline]` them.	2021-09-02 07:26:10 -07:00
Alex Crichton	230159efa7	Inline some type conversions for `()` The `()` type accidentally wasn't getting its trivial type conversions inlined because it doesn't actually have any type parameters. This commit adds `#[inline]` to the relevant functions to ensure that these get inlined across crates.	2021-09-02 07:26:07 -07:00
Alex Crichton	c8f55ed688	Optimize codegen slightly calling wasm functions Currently wasm-calls work with `Result<T, Trap>` internally but `Trap` is an enum defined in `wasmtime-runtime` which is actually quite large. Since traps are supposed to be rare this commit changes these functions to return a `Box<Trap>` which is un-boxed later up in the `wasmtime` crate within a `#[cold]` function.	2021-09-02 07:26:03 -07:00
Benjamin Bouvier	fb94b81538	Use 16K code pages on Mac M1 Fixes #3278.	2021-09-02 09:16:34 +02:00
Benjamin Bouvier	f871e8cf8f	Correctly set the address of FP when unwinding from within fibers on aarch64 Fixes #3256.	2021-09-02 08:58:03 +02:00
Pat Hickey	f46f58ecc2	replace Config::deserialize_check_wasmtime_version with Config::module_version which is more expressive than the former. Instead of just configuring Module::deserialize to ignore version information, we can configure Module::serialize to emit a custom version string, and Module::deserialize to check for that string. A new enum ModuleVersionStrategy is declared, and Config::deserialize_check_wasmtime_version:bool is replaced with Config::module_version:ModuleVersionStrategy.	2021-09-01 17:12:15 -07:00
Alex Crichton	1532516a36	Use relative `call` instructions between wasm functions (#3275 ) * Use relative `call` instructions between wasm functions This commit is a relatively major change to the way that Wasmtime generates code for Wasm modules and how functions call each other. Prior to this commit all function calls between functions, even if they were defined in the same module, were done indirectly through a register. To implement this the backend would emit an absolute 8-byte relocation near all function calls, load that address into a register, and then call it. While this technique is simple to implement and easy to get right, it has two primary downsides associated with it: * Function calls are always indirect which means they are more difficult to predict, resulting in worse performance. * Generating a relocation-per-function call requires expensive relocation resolution at module-load time, which can be a large contributing factor to how long it takes to load a precompiled module. To fix these issues, while also somewhat compromising on the previously simple implementation technique, this commit switches wasm calls within a module to using the `colocated` flag enabled in Cranelift-speak, which basically means that a relative call instruction is used with a relocation that's resolved relative to the pc of the call instruction itself. When switching the `colocated` flag to `true` this commit is also then able to move much of the relocation resolution from `wasmtime_jit::link` into `wasmtime_cranelift::obj` during object-construction time. This frontloads all relocation work which means that there's actually no relocations related to function calls in the final image, solving both of our points above. The main gotcha in implementing this technique is that there are hardware limitations to relative function calls which mean we can't simply blindly use them. AArch64, for example, can only go +/- 64 MB from the `bl` instruction to the target, which means that if the function we're calling is a greater distance away then we would fail to resolve that relocation. On x86_64 the limits are +/- 2GB which are much larger, but theoretically still feasible to hit. Consequently the main increase in implementation complexity is fixing this issue. This issue is actually already present in Cranelift itself, and is internally one of the invariants handled by the `MachBuffer` type. When generating a function relative jumps between basic blocks have similar restrictions. This commit adds new methods for the `MachBackend` trait and updates the implementation of `MachBuffer` to account for all these new branches. Specifically the changes to `MachBuffer` are: * For AAarch64 the `LabelUse::Branch26` value now supports veneers, and AArch64 calls use this to resolve relocations. * The `emit_island` function has been rewritten internally to handle some cases which previously didn't come up before, such as: * When emitting an island the deadline is now recalculated, where previously it was always set to infinitely in the future. This was ok prior since only a `Branch19` supported veneers and once it was promoted no veneers were supported, so without multiple layers of promotion the lack of a new deadline was ok. * When emitting an island all pending fixups had veneers forced if their branch target wasn't known yet. This was generally ok for 19-bit fixups since the only kind getting a veneer was a 19-bit fixup, but with mixed kinds it's a bit odd to force veneers for a 26-bit fixup just because a nearby 19-bit fixup needed a veneer. Instead fixups are now re-enqueued unless they're known to be out-of-bounds. This may run the risk of generating more islands for 19-bit branches but it should also reduce the number of islands for between-function calls. * Otherwise the internal logic was tweaked to ideally be a bit more simple, but that's a pretty subjective criteria in compilers... I've added some simple testing of this for now. A synthetic compiler option was create to simply add padded 0s between functions and test cases implement various forms of calls that at least need veneers. A test is also included for x86_64, but it is unfortunately pretty slow because it requires generating 2GB of output. I'm hoping for now it's not too bad, but we can disable the test if it's prohibitive and otherwise just comment the necessary portions to be sure to run the ignored test if these parts of the code have changed. The final end-result of this commit is that for a large module I'm working with the number of relocations dropped to zero, meaning that nothing actually needs to be done to the text section when it's loaded into memory (yay!). I haven't run final benchmarks yet but this is the last remaining source of significant slowdown when loading modules, after I land a number of other PRs both active and ones that I only have locally for now. * Fix arm32 * Review comments	2021-09-01 13:27:38 -05:00
Dan Gohman	05d113148d	Use `std::alloc::alloc` instead of `libc::posix_memalign`. This makes Cranelift use the Rust `alloc` API its allocations, rather than directly calling into `libc`, which makes it respect the `#[global_allocator]` configuration. Also, use `region::page::ceil` instead of having our own copies of that logic.	2021-08-31 15:49:50 -07:00
Dan Gohman	197aec9a08	Update io-lifetimes, cap-std, and rsix (#3269 ) - Fixes for compiling on OpenBSD - io-lifetimes 0.3.0 has an option (io_lifetimes_use_std, which is off by default) for testing the `io_safety` feature in Rust nightly.	2021-08-31 13:02:37 -07:00
Alex Crichton	9e0c910023	Add a `Module::deserialize_file` method (#3266 ) * Add a `Module::deserialize_file` method This commit adds a new method to the `wasmtime::Module` type, `deserialize_file`. This is intended to be the same as the `deserialize` method except for the serialized module is present as an on-disk file. This enables Wasmtime to internally use `mmap` to avoid copying bytes around and generally makes loading a module much faster. A C API is added in this commit as well for various bindings to use this accelerated path now as well. Another option perhaps for a Rust-based API is to have an API taking a `File` itself to allow for a custom file descriptor in one way or another, but for now that's left for a possible future refactoring if we find a use case. * Fix compat with main - handle readdonly mmap * wip * Try to fix Windows support	2021-08-31 13:05:51 -05:00
Alex Crichton	4376cf2609	Add differential fuzzing against V8 (#3264 ) * Add differential fuzzing against V8 This commit adds a differential fuzzing target to Wasmtime along the lines of the wasmi and spec interpreters we already have, but with V8 instead. The intention here is that wasmi is unlikely to receive updates over time (e.g. for SIMD), and the spec interpreter is not suitable for fuzzing against in general due to its performance characteristics. The hope is that V8 is indeed appropriate to fuzz against because it's naturally receiving updates and it also is expected to have good performance. Here the `rusty_v8` crate is used which provides bindings to V8 as well as precompiled binaries by default. This matches exactly the use case we need and at least for now I think the `rusty_v8` crate will be maintained by the Deno folks as they continue to develop it. If it becomes an issue though maintaining we can evaluate other options to have differential fuzzing against. For now this commit enables the SIMD and bulk-memory feature of fuzz-target-generation which should enable them to get differentially-fuzzed with V8 in addition to the compilation fuzzing we're already getting. * Use weak linkage for GDB jit helpers This should help us deduplicate our symbol with other JIT runtimes, if any. For now this leans on some C helpers to define the weak linkage since Rust doesn't support that on stable yet. * Don't use rusty_v8 on MinGW They don't have precompiled libraries there. * Fix msvc build * Comment about execution	2021-08-31 09:34:55 -05:00
Alex Crichton	ef3ec594ce	Don't copy executable code into a `CodeMemory` (#3265 ) * Don't copy executable code into a `CodeMemory` This commit moves a copy from compiled artifacts into a `CodeMemory`. In general this commit drastically changes the meaning of a `CodeMemory`. Previously it was an iteratively-pushed-on structure that would accumulate executable code over time. Afterwards, however, it's a manager for an `MmapVec` which updates the permissions on text section to ensure that the pages are executable. By taking ownership of an `MmapVec` within a `CodeMemory` there's no need to copy any data around, which means that the `.text` section in the ELF image produced by Wasmtime is usable as-is after placement in memory and relocations have been resolved. This moves Wasmtime one step closer to being able to directly use a module after it's `mmap`'d into memory, optimizing when a module is loaded. * Fix windows section alignment * Review comments	2021-08-30 13:38:35 -05:00
Alex Crichton	eb251deca9	Remove `scroll` dependency from `wasmtime-jit` (#3260 ) Similar functionality to `scroll` is provided with the `object` crate and doesn't have a `*_derive` crate to go with it. This commit updates the jitdump linux support to use `object` instead of `scroll` to achieve the needs of writing structs-as-bytes onto disk.	2021-08-30 13:26:07 -05:00
Nick Fitzgerald	1c8f0b4652	Merge pull request #3261 from jlb6740/fix-build-for-benchmark-api Bench-api cargo update to allow seeing Module functions	2021-08-30 09:39:49 -07:00
Alex Crichton	a237e73b5a	Remove some allocations in `CodeMemory` (#3253 ) * Remove some allocations in `CodeMemory` This commit removes the `FinishedFunctions` type as well as allocations associated with trampolines when allocating inside of a `CodeMemory`. The main goal of this commit is to improve the time spent in `CodeMemory` where currently today a good portion of time is spent simply parsing symbol names and trying to extract function indices from them. Instead this commit implements a new strategy (different from #3236) where compilation records offset/length information for all functions/trampolines so this doesn't need to be re-learned from the object file later. A consequence of this commit is that this offset information will be decoded/encoded through `bincode` unconditionally, but we can also optimize that later if necessary as well. Internally this involved quite a bit of refactoring since the previous map for `FinishedFunctions` was relatively heavily relied upon. * comments	2021-08-30 10:35:17 -05:00
Alex Crichton	c73be1f13a	Use an mmap-friendly serialization format (#3257 ) * Use an mmap-friendly serialization format This commit reimplements the main serialization format for Wasmtime's precompiled artifacts. Previously they were generally a binary blob of `bincode`-encoded metadata prefixed with some versioning information. The downside of this format, though, is that loading a precompiled artifact required pushing all information through `bincode`. This is inefficient when some data, such as trap/address tables, are rarely accessed. The new format added in this commit is one which is designed to be `mmap`-friendly. This means that the relevant parts of the precompiled artifact are already page-aligned for updating permissions of pieces here and there. Additionally the artifact is optimized so that if data is rarely read then we can delay reading it until necessary. The new artifact format for serialized modules is an ELF file. This is not a public API guarantee, so it cannot be relied upon. In the meantime though this is quite useful for exploring precompiled modules with standard tooling like `objdump`. The ELF file is already constructed as part of module compilation, and this is the main contents of the serialized artifact. THere is some extra information, though, not encoded in each module's individual ELF file such as type information. This information continues to be `bincode`-encoded, but it's intended to be much smaller and much faster to deserialize. This extra information is appended to the end of the ELF file. This means that the original ELF file is still a valid ELF file, we just get to have extra bits at the end. More information on the new format can be found in the module docs of the serialization module of Wasmtime. Another refatoring implemented as part of this commit is to deserialize and store object files directly in `mmap`-backed storage. This avoids the need to copy bytes after the artifact is loaded into memory for each compiled module, and in a future commit it opens up the door to avoiding copying the text section into a `CodeMemory`. For now, though, the main change is that copies are not necessary when loading from a precompiled compilation artifact once the artifact is itself in mmap-based memory. To assist with managing `mmap`-based memory a new `MmapVec` type was added to `wasmtime_jit` which acts as a form of `Vec<T>` backed by a `wasmtime_runtime::Mmap`. This type notably supports `drain(..N)` to slice the buffer into disjoint regions that are all separately owned, such as having a separately owned window into one artifact for all object files contained within. Finally this commit implements a small refactoring in `wasmtime-cache` to use the standard artifact format for cache entries rather than a bincode-encoded version. This required some more hooks for serializing/deserializing but otherwise the crate still performs as before. * Review comments	2021-08-30 09:19:20 -05:00
Johnnie Birch	6e1015c0b6	Bench-api cargo update to allow seeing Module functions	2021-08-28 12:41:13 -07:00
Andrew Brown	4ccdcb110a	typo: change 'sharedable' to 'shareable' (#3259 )	2021-08-27 11:50:11 -07:00
Alex Crichton	12515e6646	Move trap information to a section of the compiled image (#3241 ) This commit moves the `traps` field of `FunctionInfo` into a section of the compiled artifact produced by Cranelift. This section is quite large and when previously encoded/decoded with `bincode` this can take quite some time to process. Traps are expected to be relatively rare and it's not necessarily the right tradeoff to spend so much time serializing/deserializing this data, so this commit offloads the section into a custom-encoded binary format located elsewhere in the compiled image. This is similar to #3240 in its goal which is to move very large pieces of metadata to their own sections to avoid decoding anything when we load a precompiled modules. This also has a small benefit that it's slightly more efficient storage for the trap information too, but that's a negligible benefit. This is part of #3230 to make loading modules fast.	2021-08-27 01:09:55 -05:00
Alex Crichton	fc91176685	Move address maps to a section of the compiled image (#3240 ) This commit moves the `address_map` field of `FunctionInfo` into a custom-encoded section of the executable. The goal of this commit is, as previous commits, to push less data through `bincode`. The `address_map` field is actually extremely large and has huge benefits of not being decoded when we load a module. This data is only used for traps and such as well, so it's not overly important that it's massaged in to precise data the runtime can extremely speedily use. The `FunctionInfo` type does retain a tiny bit of information about the function itself (it's start source location), but other than that the `FunctionAddressMap` structure is moved from `wasmtime-environ` to `wasmtime-cranelift` since it's now no longer needed outside of that context.	2021-08-26 23:06:41 -05:00
Alex Crichton	d12f1d77e6	Convert compilation artifacts to just bytes (#3239 ) * Convert compilation artifacts to just bytes This commit strips the `CompilationArtifacts` type down to simply a list of bytes. This moves all extra metadata elsewhere to live within the list of bytes itself as `bincode`-encoded information. Small affordance is made to avoid an in-process serialize-then-deserialize round-trip for use cases like `Module::new`, but otherwise this is mostly just moving some data around. * Rename data section to `.rodata.wasm`	2021-08-26 21:17:02 -05:00
Peter Huene	a2a6be72c4	Merge pull request #3245 from peterhuene/add-paged-init-setting Add `paged_memory_initialization` to Config.	2021-08-26 18:54:16 -07:00
Peter Huene	e2b9b54301	Add `paged_memory_initialization` to Config. This commit adds a `paged_memory_initialization` setting to `Config`. The setting controls whether or not an attempt is made to organize data segments into Wasm pages during compilation. When used in conjunction with the `uffd` feature on Linux, Wasmtime can completely skip initializing linear memories and instead initialize any pages that are accessed for the first time during Wasm execution.	2021-08-26 16:56:38 -07:00
Alex Crichton	d74cc33856	Merge `wasmtime-jit` and `wasmtime-profiling` (#3247 ) * Merge `wasmtime-jit` and `wasmtime-profiling` This commit merges the `wasmtime-profiling` crate into the `wasmtime-jit` crate. It wasn't really buying a ton being a separate crate and an upcoming refactoring I'd like to do is to remove the `FinishedFunctions` structure. To enable the profilers to work as they used to this commit changes them to pass `CompiledModule` as the argument, but this only works if the profiling trait can see the `CompiledModule` type. * Fix a length calculation	2021-08-26 16:22:11 -05:00
Alex Crichton	def394eca2	Rewrite gdbjit support with safety and fewer deps (#3246 ) This refactoring primarily removes the dependency of the gdbjit image creation on the `finished_functions` array, which shouldn't be necessary given the input object being passed in since information can be read from the object instead. Additionally, though, this commit also removes all `unsafe` from the file, relying on various tools in the `object` crate to parse the internals and update various fields.	2021-08-26 10:44:05 -05:00
Nick Fitzgerald	78c1e4032f	Include the function name in `Instance::get_typed_func` error context (#3243 )	2021-08-26 09:18:43 -05:00
Alex Crichton	6fbddc1931	Replace some cfg(debug) with cfg(debug_assertions) (#3242 ) * Replace some cfg(debug) with cfg(debug_assertions) Cargo nor rustc ever sets `cfg(debug)` automatically, so it's expected that these usages were intended to be `cfg(debug_assertions)`. * Fix MachBuffer debug-assertion invariant checks. We should only check invariants when we expect them to be true -- specifically, before the branch-simplification algorithm runs. At other times, they may be temporarily violated: e.g., after `add_{cond,uncond}_branch()` but before emitting the branch bytes. This is the expected sequence, and the rest of the code is consistent with that. Some of the checks also were not quite right (w.r.t. the written invariants); specifically, we should not check validity of a label's offset when the label has been aliased to another label. It seems that this is an unfortunate consequence of leftover debug-assertions that weren't actually being run, so weren't kept up-to-date. Should no longer happen now that we actually check these! Co-authored-by: Chris Fallin <chris@cfallin.org>	2021-08-25 22:15:24 -05:00
Alex Crichton	da5c82b786	Fix a possible use-after-free introduced in #3231 (#3238 ) In #3231 the wasm data sections were moved from the `wasmtime_environ::Module` structure into the `CompilationArtifacts`. Each `wasmtime_runtime::Instance` holds raw pointers into the data section owned by the compilation artifacts under the assumption that the runtime keeps the artifacts alive while the module is in use. Data is needed beyond original initialization for `memory.init` instructions as well as lazy-initialization with the `uffd` feature. The intention of #3231 was that all `CompiledModule` structures, which own `CompilationArtifacts` were owned by a store's `ModuleRegistry`, so this was already taken care of. It turns out, however, that empty modules which contain no functions are not held within a `ModuleRegistry` since there was no need prior to retain them. This commit remedies this mistake by retaining the `CompiledModule` structure, even if there aren't any functions compiled in. This should unblock #3235 and fixes the spurious error found there. The test here, at least on Linux, will deterministically reproduce the error before this commit since `uffd` was initializing wasm memory with free'd host memory.	2021-08-25 12:14:13 -05:00
Alex Crichton	7d05ebe7ff	Move wasm data/debuginfo into the ELF compilation image (#3235 ) * Move wasm data/debuginfo into the ELF compilation image This commit moves existing allocations of `Box<[u8]>` stored separately from compilation's final ELF image into the ELF image itself. The goal of this commit is to reduce the amount of data which `bincode` will need to process in the future. DWARF debugging information and wasm data segments can be quite large, and they're relatively rarely read, so there's typically no need to copy them around. Instead by moving them into the ELF image this opens up the opportunity in the future to eliminate copies and use data directly as-found in the image itself. For information accessed possibly-multiple times, such as the wasm data ranges, the indexes of the data within the ELF image are computed when a `CompiledModule` is created. These indexes are then used to directly index into the image without having to root around in the ELF file each time they're accessed. One other change located here is that the symbolication context previously cloned the debug information into it to adhere to the `'static` lifetime safely, but this isn't actually ever used in `wasmtime` right now so the unsafety around this has been removed and instead borrowed data is returned (no more clones, yay!). * Fix lightbeam	2021-08-25 09:03:07 -05:00
Alex Crichton	a662f5361d	Move wasm data sections out of wasmtime_environ::Module (#3231 ) * Reduce indentation in `to_paged` Use a few early-returns from `match` to avoid lots of extra indentation. * Move wasm data sections out of `wasmtime_environ::Module` This is the first step down the road of #3230. The long-term goal is that `Module` is always `bincode`-decoded, but wasm data segments are a possibly very-large portion of this residing in modules which we don't want to shove through bincode. This refactors the internals of wasmtime to be ok with this data living separately from the `Module` itself, providing access at necessary locations. Wasm data segments are now extracted from a wasm module and concatenated directly. Data sections then describe ranges within this concatenated list of data, and passive data works the same way. This implementation does not lend itself to eventually optimizing the case where passive data is dropped and no longer needed. That's left for a future PR.	2021-08-24 14:04:03 -05:00
Alex Crichton	b05cd2e023	Bounds-check all relocations we apply in linking (#3237 ) This commit removes the unsafety present in the `link_module` function by bounds-checking all relocations that we apply, using utilities from the `object` crate for convenience. This isn't intended to have any actual functional change, just ideally improving the safety a bit here in the case of future bugs.	2021-08-24 13:44:28 -05:00
Alex Crichton	f3977f1d97	Fix determinism of compiled modules (#3229 ) * Fix determinism of compiled modules Currently wasmtime's compilation artifacts are not deterministic due to the usage of `HashMap` during serialization which has randomized order of its elements. This commit fixes that by switching to a sorted `BTreeMap` for various maps. A test is also added to ensure determinism. If in the future the performance of `BTreeMap` is not as good as `HashMap` for some of these cases we can implement a fancier `serialize_with`-style solution where we sort keys during serialization, but only during serialization and otherwise use a `HashMap`. * fix lightbeam	2021-08-23 17:08:19 -05:00
Alex Crichton	eb21ae149a	Move definition of ModuleMemoryOffset (#3228 ) This was historically defined in `wasmtime-environ` but it's only used in `wasmtime-cranelift`, so this commit moves the definition to the `debug` module where it's primarily used.	2021-08-23 14:42:21 -05:00
Alex Crichton	22ab535ad9	Parse fewer names in linking (#3226 ) We don't need an auxiliary map to tell us function addresses, we can query the symbol instead.	2021-08-23 14:35:48 -05:00

1 2 3 4 5 ...

1846 Commits