wasmtime

Author	SHA1	Message	Date
Alex Crichton	2c6841041d	Validate modules while translating (#2059 ) * Validate modules while translating This commit is a change to cranelift-wasm to validate each function body as it is translated. Additionally top-level module translation functions will perform module validation. This commit builds on changes in wasmparser to perform module validation interwtwined with parsing and translation. This will be necessary for future wasm features such as module linking where the type behind a function index, for example, can be far away in another module. Additionally this also brings a nice benefit where parsing the binary only happens once (instead of having an up-front serial validation step) and validation can happen in parallel for each function. Most of the changes in this commit are plumbing to make sure everything lines up right. The major functional change here is that module compilation should be faster by validating in parallel (or skipping function validation entirely in the case of a cache hit). Otherwise from a user-facing perspective nothing should be that different. This commit does mean that cranelift's translation now inherently validates the input wasm module. This means that the Spidermonkey integration of cranelift-wasm will also be validating the function as it's being translated with cranelift. The associated PR for wasmparser (bytecodealliance/wasmparser#62) provides the necessary tools to create a `FuncValidator` for Gecko, but this is something I'll want careful review for before landing! * Read function operators until EOF This way we can let the validator take care of any issues with mismatched `end` instructions and/or trailing operators/bytes.	2020-10-05 11:02:01 -05:00
Julian Seward	25e31739a6	Implement Wasm Atomics for Cranelift/newBE/aarch64. The implementation is pretty straightforward. Wasm atomic instructions fall into 5 groups * atomic read-modify-write * atomic compare-and-swap * atomic loads * atomic stores * fences and the implementation mirrors that structure, at both the CLIF and AArch64 levels. At the CLIF level, there are five new instructions, one for each group. Some comments about these: * for those that take addresses (all except fences), the address is contained entirely in a single `Value`; there is no offset field as there is with normal loads and stores. Wasm atomics require alignment checks, and removing the offset makes implementation of those checks a bit simpler. * atomic loads and stores get their own instructions, rather than reusing the existing load and store instructions, for two reasons: - per above comment, makes alignment checking simpler - reuse of existing loads and stores would require extension of `MemFlags` to indicate atomicity, which sounds semantically unclean. For example, then any instruction carrying `MemFlags` could be marked as atomic, even in cases where it is meaningless or ambiguous. * I tried to specify, in comments, the behaviour of these instructions as tightly as I could. Unfortunately there is no way (per my limited CLIF knowledge) to enforce the constraint that they may only be used on I8, I16, I32 and I64 types, and in particular not on floating point or vector types. The translation from Wasm to CLIF, in `code_translator.rs` is unremarkable. At the AArch64 level, there are also five new instructions, one for each group. All of them except `::Fence` contain multiple real machine instructions. Atomic r-m-w and atomic c-a-s are emitted as the usual load-linked store-conditional loops, guarded at both ends by memory fences. Atomic loads and stores are emitted as a load preceded by a fence, and a store followed by a fence, respectively. The amount of fencing may be overkill, but it reflects exactly what the SM Wasm baseline compiler for AArch64 does. One reason to implement r-m-w and c-a-s as a single insn which is expanded only at emission time is that we must be very careful what instructions we allow in between the load-linked and store-conditional. In particular, we cannot allow any extra memory transactions in there, since -- particularly on low-end hardware -- that might cause the transaction to fail, hence deadlocking the generated code. That implies that we can't present the LL/SC loop to the register allocator as its constituent instructions, since it might insert spills anywhere. Hence we must present it as a single indivisible unit, as we do here. It also has the benefit of reducing the total amount of work the RA has to do. The only other notable feature of the r-m-w and c-a-s translations into AArch64 code, is that they both need a scratch register internally. Rather than faking one up by claiming, in `get_regs` that it modifies an extra scratch register, and having to have a dummy initialisation of it, these new instructions (`::LLSC` and `::CAS`) simply use fixed registers in the range x24-x28. We rely on the RA's ability to coalesce V<-->R copies to make the cost of the resulting extra copies zero or almost zero. x24-x28 are chosen so as to be call-clobbered, hence their use is less likely to interfere with long live ranges that span calls. One subtlety regarding the use of completely fixed input and output registers is that we must be careful how the surrounding copy from/to of the arg/result registers is done. In particular, it is not safe to simply emit copies in some arbitrary order if one of the arg registers is a real reg. For that reason, the arguments are first moved into virtual regs if they are not already there, using a new method `<LowerCtx for Lower>::ensure_in_vreg`. Again, we rely on coalescing to turn them into no-ops in the common case. There is also a ridealong fix for the AArch64 lowering case for `Opcode::Trapif \| Opcode::Trapff`, which removes a bug in which two trap insns in a row were generated. In the patch as submitted there are 6 "FIXME JRS" comments, which mark things which I believe to be correct, but for which I would appreciate a second opinion. Unless otherwise directed, I will remove them for the final commit but leave the associated code/comments unchanged.	2020-08-04 09:35:50 +02:00
Alex Crichton	026fb8d388	Don't re-parse wasm for debuginfo (#2085 ) * Don't re-parse wasm for debuginfo This commit updates debuginfo parsing to happen during the main translation of the original wasm module. This avoid re-parsing the wasm module twice (at least the section-level headers). Additionally this ties debuginfo directly to a `ModuleTranslation` which makes it easier to process debuginfo for nested modules in the upcoming module linking proposal. The changes here are summarized by taking the `read_debuginfo` function and merging it with the main module translation that happens which is driven by cranelift. Some new hooks were added to the module environment trait to support this, but most of it was integrating with existing hooks. * Fix tests in debug crate	2020-08-03 09:59:20 -05:00
Yury Delendik	b2551bb4d0	Make wasmtime_environ::Module serializable (#2005 ) * Define WasmType/WasmFuncType in the Cranelift * Make `Module` serializable	2020-07-10 15:56:43 -05:00
Nick Fitzgerald	8c5f59c0cf	wasmtime: Implement `table.get` and `table.set` These instructions have fast, inline JIT paths for the common cases, and only call out to host VM functions for the slow paths. This required some changes to `cranelift-wasm`'s `FuncEnvironment`: instead of taking a `FuncCursor` to insert an instruction sequence within the current basic block, `FuncEnvironment::translate_table_{get,set}` now take a `&mut FunctionBuilder` so that they can create whole new basic blocks. This is necessary for implementing GC read/write barriers that involve branching (e.g. checking for null, or whether a store buffer is at capacity). Furthermore, it required that the `load`, `load_complex`, and `store` instructions handle loading and storing through an `r{32,64}` rather than just `i{32,64}` addresses. This involved making `r{32,64}` types acceptable instantiations of the `iAddr` type variable, plus a few new instruction encodings. Part of #929	2020-06-30 12:00:57 -07:00
Nick Fitzgerald	ddc2ce8080	cranelift-wasm: Make `FuncEnvironment::translate_ref_func` take a `FuncIndex` It was previously taking a raw `u32`. This change makes it more clear what index space that index points into.	2020-06-23 16:36:10 -07:00
Nick Fitzgerald	03165e0cb5	cranelift-wasm: Allow more customization of ref type representations * Allow different Cranelift IR types to be used for different Wasm reference types. * Do not assume that all Wasm reference types are always a Cranelift IR reference type. For example, `funcref`s might not need GC in some implementations, and can therefore be represented with a pointer rather than a reference type.	2020-06-23 16:36:10 -07:00
Nick Fitzgerald	28fccaedc4	cranelift-wasm: Pass `ir::Table`s into all the `translate_table_*` methods This serves two purposes: 1. It ensures that we call `get_or_create_table` to ensure that the embedder already had a chance to create the given table (although this is mostly redundant due to validation). 2. It allows the embedder to easily get the `ir::TableData` associated with this table, and more easily emit whatever inline JIT code to translate the table instruction (rather than falling back to VM calls).	2020-06-23 16:36:10 -07:00
Nick Fitzgerald	acf8ad0df7	cranelift_wasm: expose the original Wasm function signature In the `ModuleEnvironment::declare_signature` callback, also pass the original Wasm function signature, so that consumers may associate this information with each compiled function. This is often necessary because while each Wasm signature gets compiled down into a single native signature, multiple Wasm signatures might compile down into the same native signature, and in these cases the original Wasm signature is required for dynamic type checking of calls.	2020-06-01 14:53:10 -07:00
Nick Fitzgerald	01a92aef95	cranelift_wasm: Use the `TableIndex` type instead of raw `u32` About half of the `FuncEnvironment::translate_table_*` methods were using the `TableIndex` newtype, while the other half were using raw `u32`s. This commit makes everything use `TableIndex`.	2020-06-01 14:53:10 -07:00
Alex Crichton	65e32b3660	Store module name on `wasmtime_environ::Module` (#1309 ) * Store module name on `wasmtime_environ::Module` This keeps all name information in one place so we dont' have to keep extra structures around in `wasmtime::Module`. * rustfmt	2020-03-13 17:51:10 -05:00
Alex Crichton	8597930eed	rename PassiveElemIndex to ElemIndex and same for PassiveDataIndex (#1188 ) * rename PassiveElemIndex to ElemIndex and same for PassiveDataIndex (#1411) * rename PassiveDataIndex to DataIndex * rename PassiveElemIndex to ElemIndex * Apply renamings to wasmtime as well * Run rustfmt Co-authored-by: csmoe <csmoe@msn.com>	2020-03-02 08:55:25 -06:00
Nick Fitzgerald	9b3ac10ebc	wasm: Add support for passive data and element segments (#1389 ) This is part of the bulk memory and reference types proposals.	2020-02-15 14:53:32 -08:00
Ryan Hunt	41f225804b	Wasm: Allow environment to translate some global.set/get operations Spidermonkey will need to emit pre/post barriers for global.set/get to a reference type. #1176 and #1299 plan to add a template concept that could be used to implement this. Once that has been stabilized, we should be able to remove this code in favor of templates easily.	2020-01-23 13:37:11 -06:00
Ryan Hunt	f41bf5ecca	Wasm: Use environment to translate reference types instructions and add support for multiple tables This commit introduces environment functions to handle the translation of reference type instructions, analogous to how bulk-memory was implemented. Additionally, the bulk-memory instructions that operate on tables are extended to support multiple table indices.	2020-01-23 13:37:11 -06:00
Dan Gohman	1d504ecf6d	Correctly count the number of wasm parameters. (#1337 ) * Correctly count the number of wasm parameters. Following up on #1329, this further replaces `num_normal_params` with a function which calls `is_wasm_parameter` to correctly count the number of wasm parameters a function has. * Move is_wasm_parameter's implementation into the trait.	2020-01-14 11:42:22 -08:00
Dan Gohman	d765677fcc	Add a `is_wasm_parameter` method to the wasm `FuncEnvironment`. (#1329 ) This provides a more flexible way to allow embedding to tell cranelift-wasm which function parameters are hidden, and which should be translated as wasm user variables. This replaces https://github.com/bytecodealliance/cranelift/pull/1086.	2020-01-10 04:40:25 -08:00
Yury Delendik	2c51341888	Add wasm reference/pointers translation. (#1073 )	2019-12-06 17:46:03 -06:00
Ryan Hunt	aabf6559a0	Add hooks for implementing bulk-memory-operations (#1258 )	2019-12-06 16:13:53 +01:00
Benjamin Bouvier	9080a02e10	Replace CraneStation by bytecodealliance everywhere; (#1221 )	2019-11-12 10:09:31 -08:00
Nick Fitzgerald	ca53090f1b	cranelift-wasm: Create `ModuleTranslationState` and polish API a little (#1111 ) * cranelift-wasm: replace `WasmTypesMap` with `ModuleTranslationState` The `ModuleTranslationState` contains information decoded from the Wasm module that must be referenced during each Wasm function's translation. This is only for data that is maintained by `cranelift-wasm` itself, as opposed to being maintained by the embedder. Data that is maintained by the embedder is represented with `ModuleEnvironment`. A `ModuleTranslationState` is returned by `translate_module`, and can then be used when translating functions from that module. * cranelift-wasm: rename `TranslationState` to `FuncTranslationState` To disambiguate a bit with the new `ModuleTranslationState`. * cranelift-wasm: Reorganize the internal `state` module into submodules One module for the `ModuleTranslationState` and another for the `FuncTranslationState`. * cranelift-wasm: replace `FuncTranslator` with methods on `ModuleTranslationState` `FuncTranslator` was two methods that always took ownership of `self`, so it didn't really make sense as an object as opposed to two different functions, or in this case methods on the object that actually persists for a longer time. I think this improves ergonomics nicely. Before: ```rust let module_translation = translate_module(...)?; for body in func_bodies { let mut translator = FuncTranslator::new(); translator.translate(body, ...)?; } ``` After: ```rust let module_translation = translate_module(...)?; for body in func_bodies { module_translation.translate_func(body, ...)?; } ``` Note that this commit does not remove `FuncTranslator`. It still exists, but is just a wrapper over the `ModuleTranslationState` methods, and it is marked deprecated, so that downstream users get a heads up. This should make the transition easier. * Revert "cranelift-wasm: replace `FuncTranslator` with methods on `ModuleTranslationState`" This reverts commit 075f9ae933bcaae39348b61287c8f78a4009340d.	2019-10-11 12:37:17 -07:00
Nick Fitzgerald	10be3e4ba8	cranelift-wasm: support multi-value Wasm (#1049 ) This commit introduces initial support for multi-value Wasm. Wasm blocks and calls can now take and return an arbitrary number of values. The encoding for multi-value blocks means that we need to keep the contents of the "Types" section around when translating function bodies. To do this, we introduce a `WasmTypesMap` type that maps the type indices to their parameters and returns, construct it when parsing the "Types" section, and shepherd it through a bunch of functions and methods when translating function bodies.	2019-10-02 12:40:35 -07:00
Erin Power	5426e42a27	Revert "Remove FunctionBuilderContext from API, and change FunctionBuilder API" This reverts commit 39e638af99dbe6537bc935bfb1a74669b62877b3.	2019-09-17 08:58:46 +02:00
data-pup	ac2ca6116b	allow module environment to parse name section	2019-09-10 11:30:54 +02:00
Aaron Power	8fd1128990	Remove FunctionBuilderContext from API, and change FunctionBuilder API	2019-09-07 14:43:07 -07:00
Adam C. Foltzer	73670aab43	Return a `WasmResult` from `ModuleEnvironment` methods (#886 ) * [wasm] return a WasmResult from `declare_table_elements` This method in particular needs to accommodate failure because any table index other than zero is currently invalid. * [wasm] additional failure handling improvements - Adds `WasmResult<()>` as the return type for most of the `ModuleEnvironment` methods that previously returned nothing. - Replaces some panics with `WasmError::Unsupported` now that the methods can return a result. - Adds a `wasm_unsupported!()` macro for early returns with a formatted unsupported message.	2019-08-07 13:23:32 -07:00
Dan Gohman	da1baf7481	Use `try_from` instead of the cast crate. Now that `try_from` is in stable Rust, we can use it here.	2019-06-03 12:40:58 +02:00
Yury Delendik	8f95c51730	Reconstruct locations of the original source variable	2019-05-09 00:35:44 -07:00
Benjamin Bouvier	02e114cf3d	[wasm] Make FuncEnvironment functions fallible (fixes #752 );	2019-04-30 13:58:18 +02:00
Yury Delendik	27b0933a4a	Preserve global wasm module offset in SourceLoc.	2019-03-05 14:51:40 +01:00
lazypassion	747ad3c4c5	moved crates in lib/ to src/, renamed crates, modified some files' text (#660 ) moved crates in lib/ to src/, renamed crates, modified some files' text (#660)	2019-01-28 15:56:54 -08:00

31 Commits