wasmtime

Author	SHA1	Message	Date
MaxGraey	305659427d	changes according review	2020-07-07 01:55:18 +03:00
Peter Huene	3a33749404	Remove 'set frame pointer' unwind code from Windows x64 unwind. This commit removes the "set frame pointer" unwind code and frame pointer information from Windows x64 unwind information. In Windows x64 unwind information, a "frame pointer" is actually the base address of the static part of the local frame and would be at some negative offset to RSP upon establishing the frame pointer. Currently Cranelift uses a "traditional" notion of a frame pointer, one that is the highest address in the local frame (i.e. pointing at the previous frame pointer on the stack). Windows x64 unwind doesn't describe such frame pointers and only needs one described if the frame contains a dynamic stack allocation. Fixes #1967.	2020-07-06 14:22:57 -07:00
MaxGraey	2048278d2a	typo	2020-07-03 21:45:40 +03:00
Joey Gouly	3a67d25ed6	arm64: Implement SIMD shift instructions Copyright (c) 2020, Arm Limited.	2020-07-03 16:47:43 +01:00
Benjamin Bouvier	9d5be00de4	Address review comments - put the division in the synthetic instruction as well, - put the branch table check in the inst's emission code, - replace OneWayCondJmp by TrapIf vcode instruction, - add comments describing code generated by the synthetic instructions	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	2115e70acb	machinst x64: implement enough to support branch tables;	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	f86ecdcb86	machinst x64: lower and implement div/idiv; ADD TESTS	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	da30527368	machinst x64: implement one way conditional jmp	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	9d1bcfb2e8	machinst x64: implement cmov	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	bd88065571	machinst x64: lower binary ALU operators;	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	faeed3c7a4	machinst x64: implement spills and reloads	2020-07-03 14:33:52 +02:00
Benjamin Bouvier	08efcbd9d5	machinst x64: use movss for f32 loads;	2020-07-03 11:19:32 +02:00
MaxGraey	d21ca7f2a3	fixed added fits-in-native-word contrain	2020-07-03 11:46:37 +03:00
MaxGraey	1fcb215895	trailing new line	2020-07-03 11:03:03 +03:00
MaxGraey	1033cf5748	use (isub 1 $C) instead (iadd -1 $C)	2020-07-03 11:01:35 +03:00
MaxGraey	d839d83f7e	add reminder by a power of two peephole rule	2020-07-03 09:44:59 +03:00
Chris Fallin	b700646c93	Merge pull request #1962 from cfallin/aarch64-lowering-condbr AArch64: avoid branches with explicit offsets at lowering stage.	2020-07-02 14:05:40 -07:00
Chris Fallin	9bc0f3eceb	Merge pull request #1966 from jgouly/simd-arith arm64: Implement basic SIMD arithmetic	2020-07-02 11:07:14 -07:00
Chris Fallin	b7ecad1d74	AArch64: avoid branches with explicit offsets at lowering stage. In discussions with @bnjbvr, it came up that generating `OneWayCondBr`s with explicit, hardcoded PC-offsets as part of lowered instruction sequences is actually unsafe, because the register allocator might insert a spill or reload into the middle of our sequence. We were careful about this in some cases but somehow missed that it was a general restriction. Conceptually, all inter-instruction references should be via labels at the VCode level; explicit offsets are only ever known at emission time, and resolved by the `MachBuffer`. To allow for conditional trap checks without modifying the CFG (as seen by regalloc) during lowering, this PR instead adds a `TrapIf` pseudo-instruction that conditionally skips a single embedded trap instruction. It lowers to the same `condbr label ; trap ; label: ...` sequence, but without the hardcoded branch-target offset in the lowering code.	2020-07-02 11:02:27 -07:00
Andrew Brown	057c93b64e	Add `unarrow` instruction with x86 implementation Adds a shared `unarrow` instruction in order to lower the Wasm SIMD specification's unsigned narrowing (see https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#integer-to-integer-narrowing). Additionally, this commit implements the instruction for x86 using PACKUSWB and PACKUSDW for the applicable encodings.	2020-07-02 09:35:45 -07:00
Andrew Brown	65e6de2344	Replace `x86_packss` with `snarrow` Since the Wasm specification contains narrowing instructions (see https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#integer-to-integer-narrowing) that lower to PACKSS*, the x86-specific instruction is not necessary in the CLIF IR.	2020-07-02 09:35:45 -07:00
Joey Gouly	62e7b7f838	arm64: Implement basic SIMD arithmetic Copyright (c) 2020, Arm Limited.	2020-07-02 13:17:33 +01:00
Johnnie Birch	0aa56c500c	Add x64 lowering of Clif flt load instruction for new backend Adds support for the clif flt load instruction.	2020-07-01 17:48:33 -07:00
Johnnie Birch	f2dd1535d5	Add x64 lowering of Clif flt store instruction for new backend Adds support for the clif flt store instruction.	2020-07-01 14:54:59 -07:00
Chris Fallin	0a59a321bd	Merge pull request #1954 from cfallin/b1649432 AArch64: fix shift ops: mask shift amount.	2020-07-01 09:33:29 -07:00
Chris Fallin	a351fa52b5	Merge pull request #1930 from cfallin/spectre-heap Spectre mitigation on heap access overflow checks.	2020-07-01 09:23:04 -07:00
Chris Fallin	533f1c8d8b	Aarch64: fix shift ops: mask shift amount. The failure to mask the amount triggered a panic due to a subtraction overflow check; see https://bugzilla.mozilla.org/show_bug.cgi?id=1649432. Attempting to shift by an out-of-range amount should be defined to shift by an amount mod the operand size (i.e., masked to 5 bits for 32-bit shifts, or 6 bits for 64-bit shifts).	2020-07-01 08:57:56 -07:00
Chris Fallin	e694fb1312	Spectre mitigation on heap access overflow checks. This PR adds a conditional move following a heap bounds check through which the address to be accessed flows. This conditional move ensures that even if the branch is mispredicted (access is actually out of bounds, but speculation goes down in-bounds path), the acually accessed address is zero (a NULL pointer) rather than the out-of-bounds address. The mitigation is controlled by a flag that is off by default, but can be set by the embedding. Note that in order to turn it on by default, we would need to add conditional-move support to the current x86 backend; this does not appear to be present. Once the deprecated backend is removed in favor of the new backend, IMHO we should turn this flag on by default. Note that the mitigation is unneccessary when we use the "huge heap" technique on 64-bit systems, in which we allocate a range of virtual address space such that no 32-bit offset can reach other data. Hence, this only affects small-heap configurations.	2020-07-01 08:36:09 -07:00
Benjamin Bouvier	de9fbfa095	machinst x64: correctly assign FP registers for incoming args; Fixes #1943. Thanks to @jlb6740 for noticing the issue and @bjorn3 for catching the error!	2020-07-01 15:00:43 +02:00
Andrew Brown	26bdf9c333	Add `iabs` instructions.rs This is necessary for implementing SIMD [abs](https://github.com/WebAssembly/simd/blob/master/proposals/simd/SIMD.md#lane-wise-integer-absolute-value).	2020-06-30 14:00:17 -07:00
Nick Fitzgerald	8c5f59c0cf	wasmtime: Implement `table.get` and `table.set` These instructions have fast, inline JIT paths for the common cases, and only call out to host VM functions for the slow paths. This required some changes to `cranelift-wasm`'s `FuncEnvironment`: instead of taking a `FuncCursor` to insert an instruction sequence within the current basic block, `FuncEnvironment::translate_table_{get,set}` now take a `&mut FunctionBuilder` so that they can create whole new basic blocks. This is necessary for implementing GC read/write barriers that involve branching (e.g. checking for null, or whether a store buffer is at capacity). Furthermore, it required that the `load`, `load_complex`, and `store` instructions handle loading and storing through an `r{32,64}` rather than just `i{32,64}` addresses. This involved making `r{32,64}` types acceptable instantiations of the `iAddr` type variable, plus a few new instruction encodings. Part of #929	2020-06-30 12:00:57 -07:00
Alex Crichton	0acd2072c2	Fix doc warnings and link failures (#1948 ) Also add configuration to CI to fail doc generation if any links are broken. Unfortunately we can't blanket deny all warnings in rustdoc since some are unconditional warnings, but for now this is hopefully good enough. Closes #1947	2020-06-30 13:01:49 -05:00
Declan Kelly	a3feb8a412	Implement customer [de]serialization for `IntegerInterner` Added some basic tests that the representation of the interner is only the `values` sequence, and that it round-trips successfully.	2020-06-30 01:21:27 -07:00
Andrew Brown	c9d573d841	Provide spec-compliant legalization for SIMD floating point min/max	2020-06-25 14:48:16 -07:00
Alex Crichton	a92a31d850	Rename the `master` branch to `main` (#1924 ) * This PR is against a branch called `main` * Internally all docs/CI/etc is updated * The default branch of the repo is now `main` * All active PRs have been updated to retarget `main` Closes #1914	2020-06-25 14:03:21 -05:00
Benjamin Bouvier	c9a3f05afd	machinst x64: implement calls and int cmp/store/loads; This makes it possible to run a simple recursive fibonacci function in wasmtime.	2020-06-25 16:20:33 +02:00
Johnnie Birch	2d364f75bd	Remove xmm_r_r inst data structure and cases after related refactoring Removes unneeded data structure that was holding instructions for xmm based move instructions. These instructions can should be categorized as rm not just r. This change is intended to simplify organization and cases when lowering.	2020-06-25 14:31:51 +02:00
Johnnie Birch	f2f7706265	Implements vcode lowering for f32.copysign. This patch implements the required but not already available x64 instructions for copysign as well as the actual lowering sequence and tests for the newly implemented x64 instructions. Those instructions include: andps, andnps, movaps, movd, orps, The lowering sequence is based on the lowering for f32.copysign in the current cranelift backend. movd does not have a test yet due to some logic needed express a 32-bit register as a source for xmm_rm_r instructions. This code also begins some rethinking/refactoring of how the sse move instuctions are written and so also includes new emit cases that will replace current ones that match a different enum used to describe sse moves.	2020-06-24 11:47:26 -07:00
Chris Fallin	492000e945	MachInst isel and aarch64 backend: docs / clarity improvements. From discussion with Julian and Ben, this PR makes a few documentation- and naming-level changes (no functionality change): - Document that the `LowerCtx`-provided output register can be used as a scratch register during the lowered instruction sequence before placing the final result in it. - Rename `input_to_` helpers in the AArch64 backend to `put_input_in_`, emphasizing that these are side-effecting helpers that potentially generate code (e.g., sign/zero-extensions) to ensure an input value is in a register.	2020-06-18 12:18:50 -07:00
Chris Fallin	f84903fa43	Merge pull request #1892 from akirilov-arm/simd_fp_cmp AArch64: Implement SIMD floating-point comparisons	2020-06-18 12:17:09 -07:00
Andrew Brown	3675f95bb2	Legalize fcvt_to_sint_sat.i32x4 on x86 Use a lengthy sequence involving CVTTPS2DQ to quiet NaNs and saturate overflow.	2020-06-18 11:39:38 -07:00
Anton Kirilov	90bafae1dc	AArch64: Implement SIMD floating-point comparisons Copyright (c) 2020, Arm Limited.	2020-06-18 11:07:52 +01:00
Johnnie Birch	043571fee0	Adds f32.mul, f32.div for vcode backend for x64. Adds support for lowering clif instructions Fdiv and Fmul for new vcode backend. Misc adds lowering and test for sqrtss and removes a redundant to_string() func for the SseOpcode struct.	2020-06-17 17:19:57 -07:00
Chris Fallin	4dd1363581	Merge pull request #1890 from jgouly/simd-bool arm64: Implement AllTrue and AnyTrue	2020-06-17 09:38:10 -07:00
Benjamin Bouvier	c2692ecb8a	Wasmtime: allow using the experimental Cranelift x64 backend in cli; This introduces two changes: - first, a Cargo feature is added to make it possible to use the Cranelift x64 backend directly from wasmtime's CLI. - second, when passing a `cranelift-flags` parameter, and the given parameter's name doesn't exist at the target-independent flag level, try to set it as a target-dependent setting. These two changes make it possible to try out the new x64 backend with: cargo run --features experimental_x64 -- run --cranelift-flags use_new_backend=true -- /path/to/a.wasm Right now, this will fail because most opcodes required by the trampolines are actually not implemented yet.	2020-06-17 17:18:46 +02:00
Joey Gouly	0f462330e0	arm64: Implement AllTrue and AnyTrue This enables the simd_boolean WASM SIMD spec test. Copyright (c) 2020, Arm Limited.	2020-06-17 15:40:51 +01:00
Nick Fitzgerald	f30ce1fe97	externref: implement stack map-based garbage collection For host VM code, we use plain reference counting, where cloning increments the reference count, and dropping decrements it. We can avoid many of the on-stack increment/decrement operations that typically plague the performance of reference counting via Rust's ownership and borrowing system. Moving a `VMExternRef` avoids mutating its reference count, and borrowing it either avoids the reference count increment or delays it until if/when the `VMExternRef` is cloned. When passing a `VMExternRef` into compiled Wasm code, we don't want to do reference count mutations for every compiled `local.{get,set}`, nor for every function call. Therefore, we use a variation of deferred reference counting, where we only mutate reference counts when storing `VMExternRef`s somewhere that outlives the activation: into a global or table. Simultaneously, we over-approximate the set of `VMExternRef`s that are inside Wasm function activations. Periodically, we walk the stack at GC safe points, and use stack map information to precisely identify the set of `VMExternRef`s inside Wasm activations. Then we take the difference between this precise set and our over-approximation, and decrement the reference count for each of the `VMExternRef`s that are in our over-approximation but not in the precise set. Finally, the over-approximation is replaced with the precise set. The `VMExternRefActivationsTable` implements the over-approximized set of `VMExternRef`s referenced by Wasm activations. Calling a Wasm function and passing it a `VMExternRef` moves the `VMExternRef` into the table, and the compiled Wasm function logically "borrows" the `VMExternRef` from the table. Similarly, `global.get` and `table.get` operations clone the gotten `VMExternRef` into the `VMExternRefActivationsTable` and then "borrow" the reference out of the table. When a `VMExternRef` is returned to host code from a Wasm function, the host increments the reference count (because the reference is logically "borrowed" from the `VMExternRefActivationsTable` and the reference count from the table will be dropped at the next GC). For more general information on deferred reference counting, see An Examination of Deferred Reference Counting and Cycle Detection by Quinane: https://openresearch-repository.anu.edu.au/bitstream/1885/42030/2/hon-thesis.pdf cc #929 Fixes #1804	2020-06-15 09:39:37 -07:00
Benjamin Bouvier	357fb11f46	Review comments;	2020-06-15 16:39:08 +02:00
Benjamin Bouvier	28c40ba0f7	machinst x64: refactor lowering too;	2020-06-15 16:39:08 +02:00
Benjamin Bouvier	48fb9291bc	machinst x64: refactor REX prefix emission;	2020-06-15 16:39:08 +02:00

... 24 25 26 27 28 ...

1724 Commits