wasmtime

Author	SHA1	Message	Date
Chris Fallin	4dce51096d	MachInst backends: handle SourceLocs out-of-band, not in Insts. In existing MachInst backends, many instructions -- any that can trap or result in a relocation -- carry `SourceLoc` values in order to propagate the location-in-original-source to use to describe resulting traps or relocation errors. This is quite tedious, and also error-prone: it is likely that the necessary plumbing will be missed in some cases, and in any case, it's unnecessarily verbose. This PR factors out the `SourceLoc` handling so that it is tracked during emission as part of the `EmitState`, and plumbed through automatically by the machine-independent framework. Instruction emission code that directly emits trap or relocation records can query the current location as necessary. Then we only need to ensure that memory references and trap instructions, at their (one) emission point rather than their (many) lowering/generation points, are wired up correctly. This does have the side-effect that some loads and stores that do not correspond directly to user code's heap accesses will have unnecessary but harmless trap metadata. For example, the load that fetches a code offset from a jump table will have a 'heap out of bounds' trap record attached to it; but because it is bounds-checked, and will never actually trap if the lowering is correct, this should be harmless. The simplicity improvement here seemed more worthwhile to me than plumbing through a "corresponds to user-level load/store" bit, because the latter is a bit complex when we allow for op merging. Closes #2290: though it does not implement a full "metadata" scheme as described in that issue, this seems simpler overall.	2020-11-10 15:46:53 -08:00
Andrew Brown	83f182b390	Implement initial emission of constants This approach suffers from memory-size bloat during compile time due to the desire to de-duplicate the constants emitted and reduce runtime memory-size. As a first step, though, this provides an end-to-end mechanism for constants to be emitted in the MachBuffer islands.	2020-11-05 14:25:02 -08:00
Chris Fallin	c35904a8bf	Merge pull request #2278 from akirilov-arm/load_splat Introduce the Cranelift IR instruction `LoadSplat`	2020-10-28 12:54:03 -07:00
Yury Delendik	de4af90af6	machinst x64: New backend unwind (#2266 ) Addresses unwind for experimental x64 backend. The preliminary code enables backtrace on SystemV call convension.	2020-10-23 15:19:41 -05:00
Andrew Brown	d990dd4c9a	[machinst x64]: add source locations to more instruction formats In order to register traps for `load_splat`, several instruction formats need knowledge of `SourceLoc`s; however, since the x64 backend does not correctly and completely register traps for `RegMem::Mem` variants I opened https://github.com/bytecodealliance/wasmtime/issues/2290 to discuss and resolve this issue. In the meantime, the current behavior (i.e. remaining largely unaware of `SourceLoc`s) is retained.	2020-10-14 09:43:33 -07:00
Andrew Brown	1799b0947f	[machinst x64]: implement packed bitselect	2020-10-09 10:04:50 -07:00
Andrew Brown	95f0e96e62	[machinst x64]: implement packed not This begins to use `Inst` helper functions as discussed in #2252.	2020-10-09 10:04:50 -07:00
Benjamin Bouvier	e8c2a1763a	machinst x64: avoid emitting movzx when the input is an ALU 32-bits operation;	2020-10-09 18:49:27 +02:00
Benjamin Bouvier	116acb8dcd	machinst x64: emit nop of variable sizes;	2020-10-08 10:05:57 +02:00
Benjamin Bouvier	a470f1e0cd	machinst x64: remove dead code and allow(dead_code) annotation; The BranchTarget is always used as a label, so just use a plain MachLabel in this case.	2020-10-08 10:05:57 +02:00
Benjamin Bouvier	e32e6fb612	machinst x64: check SSE requirements for instructions against enabled features;	2020-10-08 09:21:51 +02:00
Benjamin Bouvier	c5bbc87498	machinst: allow passing constant information to the instruction emitter; A new associated type Info is added to MachInstEmit, which is the immutable counterpart to State. It can't easily be constructed from an ABICallee, since it would require adding an associated type to the latter, and making so leaks the associated type in a lot of places in the code base and makes the code harder to read. Instead, the EmitInfo state can simply be passed to the `Vcode::emit` function directly.	2020-10-08 09:21:51 +02:00
Benjamin Bouvier	84ac3feef8	machinst x64: use zero-latency move instructions for f32/f64; As found by @julian-seward1, movss/movsd aren't included in the zero-latency move instructions section of the Intel optimization manual. Use MOVAPS instead for those moves.	2020-10-07 10:55:44 +02:00
Chris Fallin	71768bb6cf	Fix AArch64 ABI to respect half-caller-save, half-callee-save vec regs. This PR updates the AArch64 ABI implementation so that it (i) properly respects that v8-v15 inclusive have callee-save lower halves, and caller-save upper halves, by conservatively approximating (to full registers) in the appropriate directions when generating prologue caller-saves and when informing the regalloc of clobbered regs across callsites. In order to prevent saving all of these vector registers in the prologue of every non-leaf function due to the above approximation, this also makes use of a new regalloc.rs feature to exclude call instructions' writes from the clobber set returned by register allocation. This is safe whenever the caller and callee have the same ABI (because anything the callee could clobber, the caller is allowed to clobber as well without saving it in the prologue). Fixes #2254.	2020-10-06 14:44:02 -07:00
Benjamin Bouvier	df8f85f4bc	machinst x64: remove non_camel_case_types;	2020-10-05 17:44:31 +02:00
Benjamin Bouvier	4a10a78e33	machinst x64: remove non_snake_case;	2020-10-05 17:44:31 +02:00
Andrew Brown	16a2538ecd	[machinst x64]: rename Inst::XmmUninitializedValue and document This approach is not the best but avoids an extra instruction; perhaps at some point, as mentioned in https://github.com/bytecodealliance/wasmtime/pull/2248, we will add the extra instruction or refactor things in such a way that this `Inst` variant is unnecessary.	2020-10-02 08:29:31 -07:00
Andrew Brown	50b9399006	[machinst x64]: lower remaining lane operations--any_true, all_true, splat	2020-10-02 08:29:31 -07:00
Andrew Brown	4565582f02	[machinst x64]: clarify parameter name of Inst::xmm_rm_r_imm	2020-10-02 08:29:31 -07:00
Andrew Brown	74226d6781	[machinst x64]: add integer comparisons	2020-10-02 08:29:31 -07:00
Andrew Brown	f4836f9ca9	[machinst x64]: add extractlane implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	29fa894790	[machinst x64]: add insertlane implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	48cf45491d	[machinst x64]: inform the register allocator of more types of packed moves	2020-09-25 18:59:01 -07:00
Benjamin Bouvier	3849dc18b1	machinst x64: revamp integer immediate emission; In particular: - try to optimize the integer emission into a 32-bit emission, when the high bits are all zero, and stop relying on the caller of `imm_r` to ensure this. - rename `Inst::imm_r`/`Inst::Imm_R` to `Inst::imm`/`Inst::Imm`. - generate a sign-extending mov 32-bit immediate to 64-bits, whenever possible. - fix a few places where the previous commit did introduce the generation of zero-constants with xor, when calling `put_input_to_reg`, thus clobbering the flags before they were read.	2020-09-11 18:13:30 +02:00
Benjamin Bouvier	9c328cc64b	machinst x64: Remove unfinished comment;	2020-09-09 18:03:59 +02:00
Chris Fallin	e8f772c1ac	x64 new backend: port ABI implementation to shared infrastructure with AArch64. Previously, in #2128, we factored out a common "vanilla 64-bit ABI" implementation from the AArch64 ABI code, with the idea that this should be largely compatible with x64. This PR alters the new x64 backend to make use of the shared infrastructure, removing the duplication that existed previously. The generated code is nearly (not exactly) the same; the only difference relates to how the clobber-save region is padded in the prologue. This also changes some register allocations in the aarch64 code because call support in the shared ABI infra now passes a temp vreg in, rather than requiring use of a fixed, non-allocable temp; tests have been updated, and the runtime behavior is unchanged.	2020-09-08 17:59:01 -07:00
bjorn3	9428480230	Merge SignExtendAlAh and SignExtendRaxRdx	2020-09-08 15:00:24 +02:00
bjorn3	3dcda164dc	Fix nits	2020-09-08 15:00:24 +02:00
bjorn3	067255ef45	x64: Implement rotl and rotr for small integers	2020-09-08 15:00:24 +02:00
bjorn3	ce033f2a0c	x64: Fix udiv and sdiv for 8bit integers	2020-09-08 15:00:24 +02:00
bjorn3	74642b166f	x64: Implement ineg and bnot	2020-09-08 15:00:24 +02:00
Johnnie Birch	a64af55cda	Adds x64 packed negation for the new backend	2020-09-07 11:56:05 -07:00
Benjamin Bouvier	7c85654285	Address review comments.	2020-08-24 17:00:30 +02:00
Benjamin Bouvier	cca10b87cb	machinst x64: optimize select/brz/brnz when the input is a comparison;	2020-08-24 17:00:30 +02:00
Julian Seward	620e4b4e82	This patch fills in the missing pieces needed to support wasm atomics on newBE/x64. It does this by providing an implementation of the CLIF instructions `AtomicRmw`, `AtomicCas`, `AtomicLoad`, `AtomicStore` and `Fence`. The translation is straightforward. `AtomicCas` is translated into x64 `cmpxchg`, `AtomicLoad` becomes a normal load because x64-TSO provides adequate sequencing, `AtomicStore` becomes a normal store followed by `mfence`, and `Fence` becomes `mfence`. `AtomicRmw` is the only complex case: it becomes a normal load, followed by a loop which computes an updated value, tries to `cmpxchg` it back to memory, and repeats if necessary. This is a minimum-effort initial implementation. `AtomicRmw` could be implemented more efficiently using LOCK-prefixed integer read-modify-write instructions in the case where the old value in memory is not required. Subsequent work could add that, if required. The x64 emitter has been updated to emit the new instructions, obviously. The `LegacyPrefix` mechanism has been revised to handle multiple prefix bytes, not just one, since it is now sometimes necessary to emit both 0x66 (Operand Size Override) and F0 (Lock). In the aarch64 implementation of atomics, there has been some minor renaming for the sake of clarity, and for consistency with this x64 implementation.	2020-08-24 11:50:06 +02:00
Andrew Brown	2767b2efc6	machinst x64: add `Inst::[move\|load\|store]` for choosing the correct x86 instruction This change primarily adds the ability to lower packed `[move\|load\|store]` instructions (the vector types were previously unimplemented), but with the addition of the utility `Inst::[move\|load\|store]` functions it became possible to remove duplicated code (e.g. `stack_load` and `stack_store`) and use these utility functions elsewhere (though not exhaustively).	2020-08-20 12:37:22 -07:00
Nick Fitzgerald	05bf9ea3f3	Rename "Stackmap" to "StackMap" And "stackmap" to "stack_map". This commit is purely mechanical.	2020-08-07 10:08:44 -07:00
Andrew Brown	4cb36afd7b	machinst x64: refactor to use types::[type] everywhere This change is a pure refactoring--no change to functionality. It removes `use crate::ir::types::*` imports and uses instead `types::I32`, e.g., throughout the x64 code. Though it increases code verbosity, this change makes it more clear where the type identifiers come from (they are generated by `cranelif-codegen-meta` so without a prefix it is difficult to find their origin), avoids IDE confusion (e.g. CLion flags the un-prefixed identifiers as errors), and avoids importing unwanted identifiers into the namespace.	2020-08-05 10:45:45 -07:00
Andrew Brown	999e04a2c4	machinst x64: refactor imports to use rustfmt convention This change is a pure refactoring--no change to functionality. It removes newlines between the `use ...` statements in the x64 backend so that rustfmt can format them according to its convention. I noticed some files had followed a manual convention but subsequent additions did not seem to fit; this change fixes that and lightly coalesces some of the occurrences of `use a::b; use a::c;` into `use::{b, c}`.	2020-08-04 09:17:54 -07:00
Benjamin Bouvier	e108f14620	machinst x64: use xor/xorpss/xorpd to generate zero constants;	2020-07-31 13:17:52 -07:00
Andrew Brown	999fa00d6a	machinst x64: add loading of inline 128-bit constants Eventually the `load + jmp + constant` pattern should be replaced with just `load` once constant pools are more tightly integrated.	2020-07-30 14:16:12 -07:00
Andrew Brown	c74a9d1225	machinst x64: add packed shifts	2020-07-30 14:16:12 -07:00
Andrew Brown	0398033447	machinst x64: add packed FP comparisons Re-orders the SseOpcode variants alphabetically.	2020-07-30 14:16:12 -07:00
Andrew Brown	77cc2f69c1	machinst x64: allow use of vector-length types	2020-07-30 14:16:12 -07:00
Benjamin Bouvier	7f109a5198	machinst x64: use a sign-extension when loading jump table offsets; The jump table offset that's loaded out of the jump table could be signed (if it's an offset to before the jump table itself), so we should use a signed extension there, not an unsigned extension.	2020-07-28 12:29:49 +02:00
Benjamin Bouvier	35d9ab19b7	Review fixes;	2020-07-24 19:29:12 +02:00
Benjamin Bouvier	ad4a2f919f	machinst x64: implement support for reference types;	2020-07-24 19:29:12 +02:00
Benjamin Bouvier	03b9e1e86a	machinst x64: implement float min/max with the right semantics;	2020-07-24 19:29:12 +02:00
Benjamin Bouvier	e43310a088	machinst x64: adapt conversions for saturation behaviors;	2020-07-24 19:29:12 +02:00
Benjamin Bouvier	cd54f05efd	machinst x64: implement float-to-int and int-to-float conversions;	2020-07-24 19:29:12 +02:00

1 2 3 4

181 Commits