wasmtime

Author	SHA1	Message	Date
Kasey Carrothers	7bd96c8e2f	Refactor x64::Insts that use an is_64 bool to use OperandSize.	2021-02-03 10:40:11 -08:00
Kasey Carrothers	3306408100	Refactor x64::Inst to use OperandSize instead of u8s. TODO: some types take a 'is_64_bit' bool. Those are left unchanged for now.	2021-02-03 10:40:11 -08:00
Kasey Carrothers	b12d41bfe9	Expand x64 OperandSize to support 8 and 16-bit operands. This is in preparation for refactoring all x64::Inst arms to use OperandSize. Current uses of OperandSize fall into two categories: 1. XMM operations which require 32/64 bit operands 2. Immediates which only care about 64-bit or not. Adds assertions to existing Inst constructors to check that they are passed valid sizes. This change also removes the implicit widening of 1 and 2 byte values to 4 bytes. from_bytes() is only used by category 2, so removing this behavior will not change any visible behavior. Overall this change should be a no-op.	2021-02-03 10:40:11 -08:00
bjorn3	76d615049d	Make the stackslot offsets available for debuginfo	2021-02-03 17:48:52 +01:00
bjorn3	81b4e48f9f	Remove some uses of riscv in tests (#2600 ) * Remove some uses of riscv in tests * Fix typo * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Benjamin Bouvier <public@benj.me>	2021-01-30 23:54:48 +01:00
Benjamin Bouvier	13027ad670	cranelift x64: add instruction set checks for popcnt/tzcnt/lzcnt;	2021-01-30 13:38:55 +01:00
Benjamin Bouvier	2275519cb1	cranelift x64: use the POPCNT instruction for Popcount when it's available;	2021-01-29 19:41:01 +01:00
Benjamin Bouvier	6bf6612d96	cranelift x64: use the TZCNT instruction for Ctz when it's available;	2021-01-29 19:41:01 +01:00
Benjamin Bouvier	d3acd9a283	cranelift x64: use the LZCNT instruction for Clz when it's available;	2021-01-29 19:41:01 +01:00
Chris Fallin	b1b078e2bc	Merge pull request #2621 from kaseyc/nop Replace MachInst::gen_zero_len_nop with gen_nop(0)	2021-01-29 09:01:36 -08:00
Alex Crichton	0e41861662	Implement limiting WebAssembly execution with fuel (#2611 ) * Consume fuel during function execution This commit adds codegen infrastructure necessary to instrument wasm code to consume fuel as it executes. Currently nothing is really done with the fuel, but that'll come in later commits. The focus of this commit is to implement the codegen infrastructure necessary to consume fuel and account for fuel consumed correctly. * Periodically check remaining fuel in wasm JIT code This commit enables wasm code to periodically check to see if fuel has run out. When fuel runs out an intrinsic is called which can do what it needs to do in the result of fuel running out. For now a trap is thrown to have at least some semantics in synchronous stores, but another planned use for this feature is for asynchronous stores to periodically yield back to the host based on fuel running out. Checks for remaining fuel happen in the same locations as interrupt checks, which is to say the start of the function as well as loop headers. * Improve codegen by caching `const VMInterrupts` The location of the shared interrupt value and fuel value is through a double-indirection on the vmctx (load through the vmctx and then load through that pointer). The second pointer in this chain, however, never changes, so we can alter codegen to account for this and remove some extraneous load instructions and hopefully reduce some register pressure even maybe. Add tests fuel can abort infinite loops * More fuzzing with fuel Use fuel to time out modules in addition to time, using fuzz input to figure out which. * Update docs on trapping instructions * Fix doc links * Fix a fuzz test * Change setting fuel to adding fuel * Fix a doc link * Squelch some rustdoc warnings	2021-01-29 08:57:17 -06:00
Amanieu d'Antras	78f312799e	Optimize EntityList::extend and add EntityList::from_iter	2021-01-29 14:09:52 +01:00
Kasey Carrothers	99be82c866	Replace MachInst::gen_zero_len_nop with gen_nop(0)	2021-01-29 01:15:08 -08:00
Chris Fallin	ac60ad6c9a	Merge pull request #2614 from kaseyc/nop Avoid creating 0-sized nops in x64's gen_nop().	2021-01-28 21:37:39 -08:00
Kasey Carrothers	f76a9d436e	Clean up handling of NOPs in the x64 backend. 1. Restricts max nop size to 15 instead of 16. 2. Fixes an edge case where gen_nop() would return a zero sized intruction on multiples of 16. 3. Clarifies the documentation of the gen_nop interface to state that returning zero is allowed when preferred_size is zero.	2021-01-28 20:45:00 -08:00
Johnnie Birch	cbd7a6a80e	Add sse41 lowering for rounding x64	2021-01-28 17:37:17 -08:00
Alex Crichton	7f840870c7	cranelift-native: Use libstd feature detection (#2607 ) This commit switches cranelift-native to useing the `is_x86_feature_detected!` macro in the standard library instead of the `raw-cpuid` crate.	2021-01-26 16:42:11 -06:00
Alex Crichton	503129ad91	Add a method to share `Config` across machines (#2608 ) With `Module::{serialize,deserialize}` it should be possible to share wasmtime modules across machines or CPUs. Serialization, however, embeds a hash of all configuration values, including cranelift compilation settings. By default wasmtime's selection of the native ISA would enable ISA flags according to CPU features available on the host, but the same CPU features may not be available across two machines. This commit adds a `Config::cranelift_clear_cpu_flags` method which allows clearing the target-specific ISA flags that are automatically inferred by default for the native CPU. Options can then be incrementally built back up as-desired with teh `cranelift_other_flag` method.	2021-01-26 15:59:12 -06:00
omar	79649a15f6	Update README.md	2021-01-25 15:29:51 -08:00
Kasey Carrothers	c6c5fe48b6	Add i128.icmp run tests for the x64 backend.	2021-01-25 13:02:21 -08:00
Kasey Carrothers	c55c5e0506	Add additional tests for icmp-i128. Fixes #1136 . Tests added: * eq with nonzero values * gt with nonzero values * ge with nonzero values	2021-01-25 13:02:20 -08:00
Chris Fallin	3c5416446c	Fix cargo-deny issue with raw-cpuid advisory. cargo-deny tells us that we should upgrade raw-cpuid to v9.0.0. This new version also seems to lack the `nightly` feature (perhaps it has been incorporated into the base functionality) so I had to remove this feature selector to build.	2021-01-25 08:32:06 -08:00
Chris Fallin	f54d0d05c7	Address review comments.	2021-01-22 16:02:29 -08:00
Chris Fallin	7e12abce71	Fix a few comment typos and add a clarifying comment.	2021-01-21 16:01:46 -08:00
Chris Fallin	997fab55d5	Skip value-label analysis if no value labels are present.	2021-01-21 15:59:52 -08:00
Chris Fallin	c84d6be6f4	Detailed debug-info (DWARF) support in new backends (initially x64). This PR propagates "value labels" all the way from CLIF to DWARF metadata on the emitted machine code. The key idea is as follows: - Translate value-label metadata on the input into "value_label" pseudo-instructions when lowering into VCode. These pseudo-instructions take a register as input, denote a value label, and semantically are like a "move into value label" -- i.e., they update the current value (as seen by debugging tools) of the given local. These pseudo-instructions emit no machine code. - Perform a dataflow analysis at the machine-code level, tracking value-labels that propagate into registers and into [SP+constant] stack storage. This is a forward dataflow fixpoint analysis where each storage location can contain a set of value labels, and each value label can reside in a set of storage locations. (Meet function is pairwise intersection by storage location.) This analysis traces value labels symbolically through loads and stores and reg-to-reg moves, so it will naturally handle spills and reloads without knowing anything special about them. - When this analysis converges, we have, at each machine-code offset, a mapping from value labels to some number of storage locations; for each offset for each label, we choose the best location (prefer registers). Note that we can choose any location, as the symbolic dataflow analysis is sound and guarantees that the value at the value_label instruction propagates to all of the named locations. - Then we can convert this mapping into a format that the DWARF generation code (wasmtime's debug crate) can use. This PR also adds the new-backend variant to the gdb tests on CI.	2021-01-21 15:59:49 -08:00
Alex Crichton	4a351ab7fe	Update a number of dependencies (#2594 ) This commit goes through the dependencies that wasmtime has and updates versions where possible. This notably brings in a wasmparser/wast update which has some simd spec changes with new instructions. Otherwise most of these are just routine updates.	2021-01-21 15:49:13 -06:00
bjorn3	81d248c057	Implement Mach-O TLS access for x64 newBE	2021-01-21 18:25:56 +01:00
Anton Kirilov	043a8434d2	Cranelift AArch64: Improve the Popcnt implementation Now the backend uses the CNT instruction, which results into a major simplification. Copyright (c) 2021, Arm Limited.	2021-01-19 16:49:47 +00:00
Chris Fallin	c7de8f5efb	Merge pull request #2541 from cfallin/struct-arg-ret x64 and aarch64: allow StructArgument and StructReturn args.	2021-01-17 23:50:19 -08:00
Chris Fallin	456561f431	x64 and aarch64: allow StructArgument and StructReturn args. The StructReturn ABI is fairly simple at the codegen/isel level: we only need to take care to return the sret pointer as one of the return values if that wasn't specified in the initial function signature. Struct arguments are a little more complex. A struct argument is stored as a chunk of memory in the stack-args space. However, the CLIF semantics are slightly special: on the caller side, the parameter passed in is a pointer to an arbitrary memory block, and we must memcpy this data to the on-stack struct-argument; and on the callee side, we provide a pointer to the passed-in struct-argument as the CLIF block param value. This is necessary to support various ABIs other than Wasm, such as that of Rust (with the cg_clif codegen backend).	2021-01-17 23:11:45 -08:00
Chris Fallin	0f563f786a	Add ELF TLS support in new x64 backend. This follows the implementation in the legacy x86 backend, including hardcoded sequence that is compatible with what the linker expects. We could potentially do better here, but it is likely not necessary. Thanks to @bjorn3 for a bugfix to an earlier version of this.	2021-01-17 22:48:51 -08:00
Peter Huene	8640025d8b	Merge pull request #2585 from alexcrichton/module-linking-update Update support for the module linking proposal	2021-01-14 15:48:14 -08:00
Chris Fallin	71ead6e31d	x64 backend: implement 128-bit ops and misc fixes. This implements all of the ops on I128 that are implemented by the legacy x86 backend, and includes all that are required by at least one major use-case (cg_clif rustc backend). The sequences are open-coded where necessary; for e.g. the bit operations, this can be somewhat complex, but these sequences have been tested carefully. This PR also includes a drive-by fix of clz/ctz for 8- and 16-bit cases where they were incorrect previously. Also includes ridealong fixes developed while bringing up cg_clif support, because they are difficult to completely separate due to other refactors that occurred in this PR: - fix REX prefix logic for some 8-bit instructions. When using an 8-bit register in 64-bit mode on x86-64, the REX prefix semantics are somewhat subtle: without the REX prefix, register numbers 4--7 correspond to the second-to-lowest byte of the first four registers (AH, CH, BH, DH), whereas with the REX prefix, these register numbers correspond to the usual encoding (SPL, BPL, SIL, DIL). We could always emit a REX byte for instructions with 8-bit cases (this is harmless even if unneeded), but this would unnecessarily inflate code size; instead, the usual approach is to emit it only for these registers. This logic was present in some cases but missing for some other instructions: divide, not, negate, shifts. Fixes #2508. - avoid unaligned SSE loads on some f64 ops. The implementations of several FP ops, such as fabs/fneg, used SSE instructions. This is not a problem per-se, except that load-op merging did not take alignment into account. Specifically, if an op on an f64 loaded from memory happened to merge that load, and the instruction into which it was merged was an SSE instruction, then the SSE instruction imposes stricter (128-bit) alignment requirements than the load.f64 did. This PR simply forces any instruction lowerings that could use SSE instructions to implement non-SIMD operations to take inputs in registers only, and avoid load-op merging. Fixes #2507. - two bugfixes exposed by cg_clif: urem/srem.i8, select.b1. - urem/srem.i8: the 8-bit form of the DIV instruction on x86-64 places the remainder in AH, not RDX, different from all the other width-forms of this instruction. - select.b1: we were not recognizing selects of boolean values as integer-typed operations, so we were generating XMM moves instead (!).	2021-01-14 13:45:50 -08:00
Alex Crichton	703762c49e	Update support for the module linking proposal This commit updates the various tooling used by wasmtime which has new updates to the module linking proposal. This is done primarily to sync with WebAssembly/module-linking#26. The main change implemented here is that wasmtime now supports creating instances from a set of values, nott just from instantiating a module. Additionally subtyping handling of modules with respect to imports is now properly handled by desugaring two-level imports to imports of instances. A number of small refactorings are included here as well, but most of them are in accordance with the changes to `wasmparser` and the updated binary format for module linking.	2021-01-14 10:37:39 -08:00
Johnnie Birch	d17815a239	Zero newly allocated registers whose immediate use depends on content not being NaN An intermittent failure during SIMD spectests is described in #2432. This patch corrects code written in a way that assumes comparing fp equality of a register with itself will always return true. This is not true when the register value is NaN as NaN. In this case, and with all ordered comparisons involving NaN, the comparisons will always return false. This patch corrects that assumption for SIMD Fabs and Fneg which seem to be the only instructions generating the failure with #2432.	2021-01-13 19:44:00 -08:00
Chris Fallin	4638de673c	x64 bugfix: prevent load-op fusion of cmp because it could be emitted multiple times. On x64, the new backend generates `cmp` instructions at their use-sites when possible (when the icmp that generates a boolean is known) so that the condition flows directly through flags rather than a materialized boolean. E.g., both `bint` (boolean to int) and `select` (conditional select) instruction lowerings invoke `emit_cmp()` to do so. Load-op fusion in `emit_cmp()` nominally allowed `cmp` to use its `cmp reg, mem` form. However, the mergeable-load condition (load has only single use) was not adequately checked. Consider the sequence: ``` v2 = load.i64 v1 v3 = icmp eq v0, v2 v4 = bint.i64 v3 v5 = select.i64 v3, v0, v1 ``` The load `v2` is only used in the `icmp` at `v3`. However, the cmp will be separately codegen'd twice, once for the `bint` and once for the `select`. Prior to this fix, the above example would result in the load at `v2` sinking to the `cmp` just above the `select`; we then emit another `cmp` for the `bint`, but the load has already been used once so we do not allow merging. We thus (i) expect the register for `v2` to contain the loaded value, but (ii) skip the codegen for the load because it has been sunk. This results in a regalloc error (unexpected livein) as the unfilled register is upward-exposed to the entry point. Because of this, we need to accept only the reg, reg form in `emit_cmp()` (and the FP equivalent). We could get marginally better code by tracking whether the `cmp` we are emitting comes from an `icmp`/`fcmp` with only one use; but IMHO simplicity is a better rule here when subtle interactions occur.	2021-01-13 09:48:51 -08:00
Chris Fallin	7ed7c088a4	Merge pull request #2564 from cfallin/load-coalesce-bug machinst lowering: update inst color when scanning across branch to allow more load-op merging.	2021-01-11 12:06:29 -08:00
Chris Fallin	b4426be072	machinst lowering: update inst color when scanning across branch to allow more load-op merging. A branch is considered side-effecting and so updates the instruction color (which is our way of computing how far instructions can sink). However, in the lowering loop, we did not update current instruction color when scanning backward across branches, which are side-effecting. As a result, the color was stale and fewer load-op merges were permitted than are actually possible. Note that this would not have resulted in any correctness issues, as the stale color is too high (so no merges are permitted that should have been disallowed). Fixes #2562.	2021-01-11 11:20:44 -08:00
Nick Fitzgerald	5ce6e009fc	Add Cargo.toml metadata to `peepmatic-test-operator` crate	2021-01-11 10:46:00 -08:00
Julian Seward	07652ca0d4	wasm->CLIF: fn translate_operator: Select/TypedSelect: add missing bitcasts The translation of Operator::Select and Operator::TypedSelect for vector-typed operands, lacks the relevant bitcasting of the operands to I8X16. This commit adds it.	2021-01-11 11:59:05 +01:00
Andrew Brown	2adb0e8964	security: upgrade smallvec to 1.6.1 Fixes advisory https://rustsec.org/advisories/RUSTSEC-2021-0003.	2021-01-08 16:54:54 -08:00
Andrew Brown	b25a3c387e	fix: `dst` should be `Writable`, not `ValueRegs`	2021-01-08 16:49:28 -08:00
Andrew Brown	09a5b91b9d	x64: make several structures debuggable	2021-01-08 16:21:57 -08:00
Andrew Brown	bb2dd5b68b	[machinst x64]: implement load*_zero for x64	2021-01-08 16:21:57 -08:00
Chris Fallin	81bc811236	Merge pull request #2558 from cfallin/pic-symbol-refs x64: support PC-rel symbol references using the GOT when in PIC mode.	2021-01-08 10:03:10 -08:00
Yury Delendik	3580205f12	[Cranelift][Atomics] Add address folding for atomic notify/wait. (#2556 ) * fold address in wasm wait and notify ops * add atomics addr folding tests	2021-01-08 11:55:21 -06:00
Chris Fallin	3ee898cb2c	x64: support PC-rel symbol references using the GOT when in PIC mode.	2021-01-07 22:46:56 -08:00
Nick Fitzgerald	5ad82de3c5	Bump Wasmtime to 0.22.0; Cranelift to 0.69.0	2021-01-07 14:51:12 -08:00
Chris Fallin	6eea015d6c	Multi-register value support: framework for Values wider than machine regs. This will allow for support for `I128` values everywhere, and `I64` values on 32-bit targets (e.g., ARM32 and x86-32). It does not alter the machine backends to build such support; it just adds the framework for the MachInst backends to reason about a `Value` residing in more than one register.	2021-01-05 17:45:02 -08:00

1 2 3 4 5 ...

2926 Commits