wasmtime

Author	SHA1	Message	Date
Joey Gouly	eec60c9b06	arm64: Use SignedOffset rather than PreIndexed addressing mode for callee-saved registers This also passes `fixed_frame_storage_size` (previously `total_sp_adjust`) into `gen_clobber_save` so that it can be combined with other stack adjustments. Copyright (c) 2020, Arm Limited.	2020-10-02 16:22:55 +01:00
Chris Fallin	b8f0dc429f	Merge pull request #2223 from cfallin/baldrdash-2020 Support for SpiderMonkey's "Wasm ABI 2020" in general and on AArch64.	2020-09-30 15:33:05 -07:00
Chris Fallin	835db11bea	Support for SpiderMonkey's "Wasm ABI 2020". As part of a Wasm JIT update, SpiderMonkey is changing its internal WebAssembly function ABI. The new ABI's frame format includes "caller TLS" and "callee TLS" slots. The details of where these come from are not important; from Cranelift's point of view, the only relevant requirement is that we have two on-stack args that are always present (offsetting other on-stack args), and that we define special argument purposes so that we can supply values for these slots. Note that this adds a new ABI (a variant of the Baldrdash ABI) because we do not want to tightly couple the landing of this PR to the landing of the changes in SpiderMonkey; it's better if both the old and new behavior remain available in Cranelift, so SpiderMonkey can continue to vendor Cranelift even if it does not land (or backs out) the ABI change. Furthermore, note that this needs to be a Cranelift-level change (i.e. cannot be done purely from the translator environment implementation) because the special TLS arguments must always go on the stack, which would not otherwise happen with the usual argument-placement logic; and there is no primitive to push a value directly in CLIF code (the notion of a stack frame is a lower-level concept).	2020-09-30 14:55:56 -07:00
Andrew Brown	4484a00ea5	[machinst x64]: calculate extension modes in one place	2020-09-29 14:48:59 -07:00
Andrew Brown	715be68101	[machinst x64]: assert lane is correct size for extractlane This change applies a good suggestion @bjorn3 made in #2230 that I forgot to implement there.	2020-09-29 09:34:22 -07:00
Andrew Brown	f50d905152	[machinst x64]: refactor using added RegMem::from(Writable<Reg>)	2020-09-29 08:45:12 -07:00
Andrew Brown	e3eb098c99	[machinst x64]: add swizzle implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	050f078f86	[machinst x64]: add saturating addition implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	a64abf9b76	[machinst x64]: add shuffle implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	f4836f9ca9	[machinst x64]: add extractlane implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	29fa894790	[machinst x64]: add insertlane implementation	2020-09-29 08:45:12 -07:00
Andrew Brown	48cf45491d	[machinst x64]: inform the register allocator of more types of packed moves	2020-09-25 18:59:01 -07:00
Andrew Brown	ac2bf9d246	[machinst x64]: add packed min/max implementations	2020-09-23 15:40:46 -07:00
Andrew Brown	7546d98844	[machinst x64]: add avg_round implementation	2020-09-23 15:40:46 -07:00
Andrew Brown	b202464fa0	[machinst x64]: add iabs implementation	2020-09-23 15:40:46 -07:00
Benjamin Bouvier	79cff73da5	machinst x64: implement loads/stores for v128 SIMD types; This made it possible to enable more SIMD tests from the spec test suite too.	2020-09-23 16:42:03 +02:00
Jakub Krauz	f6a140a662	arm32 codegen This commit adds arm32 code generation for some IR insts. Floating-point instructions are not supported, because regalloc does not allow to represent overlapping register classes, which are needed by VFP/Neon. There is also no support for big-endianness, I64 and I128 types.	2020-09-22 12:49:42 +02:00
Chris Fallin	1c7fa7f785	Merge pull request #2181 from jgouly/madd-opt arm64: Combine mul + add into madd	2020-09-15 11:52:33 -07:00
Joshua Nelson	d28abad441	Upgrade to target-lexicon 0.11 This allows downstream library users to use `CDataModel` without having to install two different versions of target-lexicon.	2020-09-15 11:40:09 -07:00
Johnnie Birch	07d0d32b69	Adds i64x2.mul for the new backend targeting x64	2020-09-11 13:17:42 -07:00
Joey Gouly	22369cfa0d	arm64: Combine mul + add into madd Copyright (c) 2020, Arm Limited.	2020-09-11 18:06:19 +01:00
Benjamin Bouvier	3849dc18b1	machinst x64: revamp integer immediate emission; In particular: - try to optimize the integer emission into a 32-bit emission, when the high bits are all zero, and stop relying on the caller of `imm_r` to ensure this. - rename `Inst::imm_r`/`Inst::Imm_R` to `Inst::imm`/`Inst::Imm`. - generate a sign-extending mov 32-bit immediate to 64-bits, whenever possible. - fix a few places where the previous commit did introduce the generation of zero-constants with xor, when calling `put_input_to_reg`, thus clobbering the flags before they were read.	2020-09-11 18:13:30 +02:00
Benjamin Bouvier	d9052d0a9c	machinst x64: generate copies of constants during lowering;	2020-09-11 17:41:44 +02:00
Benjamin Bouvier	cace32746f	machinst x64: pattern-match addresses that are base+cst index;	2020-09-11 17:41:44 +02:00
Benjamin Bouvier	a1bdf11602	machinst x64: fix gen_store_base_offset for multi-value returns; The previous method assumed that this could be used only for I64 values, but this is actually used for multi-value returns, which can have any type.	2020-09-10 11:17:41 +02:00
Chris Fallin	bd3ba0a774	Merge pull request #2189 from bnjbvr/x64-refactor-sub machinst x64: a few small refactorings/renamings	2020-09-09 12:40:59 -07:00
Benjamin Bouvier	b4a2dd37a4	machinst x64: rename input_to_reg to put_input_to_reg; Eventually, we should be able to unify this function's implementation with the aarch64 one; but the latter does much more, and this would require abstractions brought up in another pending PR#2142.	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	cb96d16ac7	machinst x64: inline helper used only once;	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	7a833f442a	machinst: common up some instruction data helpers;	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	a835c247c0	machinst: make get_output_reg target independent;	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	6a3c4fb54e	machinst x64: rename output_to_reg to get_output_reg;	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	9620ce6bdf	machinst x64: mask shift count too;	2020-09-09 18:03:59 +02:00
Benjamin Bouvier	9c328cc64b	machinst x64: Remove unfinished comment;	2020-09-09 18:03:59 +02:00
Anton Kirilov	f612e8e7b2	AArch64: Add various missing SIMD bits In addition, improve the code for stack pointer manipulation. Copyright (c) 2020, Arm Limited.	2020-09-09 13:37:50 +01:00
Chris Fallin	e8f772c1ac	x64 new backend: port ABI implementation to shared infrastructure with AArch64. Previously, in #2128, we factored out a common "vanilla 64-bit ABI" implementation from the AArch64 ABI code, with the idea that this should be largely compatible with x64. This PR alters the new x64 backend to make use of the shared infrastructure, removing the duplication that existed previously. The generated code is nearly (not exactly) the same; the only difference relates to how the clobber-save region is padded in the prologue. This also changes some register allocations in the aarch64 code because call support in the shared ABI infra now passes a temp vreg in, rather than requiring use of a fixed, non-allocable temp; tests have been updated, and the runtime behavior is unchanged.	2020-09-08 17:59:01 -07:00
Chris Fallin	3d6c4d312f	Merge pull request #2187 from akirilov-arm/ALUOp3 AArch64: Introduce an enum for ternary integer operations	2020-09-08 12:57:59 -07:00
Chris Fallin	e913bcb26a	Merge pull request #2179 from jgouly/mvn arm64: Don't always materialise a 64-bit constant	2020-09-08 09:17:08 -07:00
bjorn3	9428480230	Merge SignExtendAlAh and SignExtendRaxRdx	2020-09-08 15:00:24 +02:00
bjorn3	3dcda164dc	Fix nits	2020-09-08 15:00:24 +02:00
bjorn3	9999913a31	Fix sign extension Co-authored-by: Max Graey <maxgraey@gmail.com>	2020-09-08 15:00:24 +02:00
bjorn3	067255ef45	x64: Implement rotl and rotr for small integers	2020-09-08 15:00:24 +02:00
bjorn3	4251a950ba	x64: Implement ishl, ushr and sshr for small integers	2020-09-08 15:00:24 +02:00
bjorn3	cc35f1e9bb	x64: Misc small integer fixes	2020-09-08 15:00:24 +02:00
bjorn3	ce033f2a0c	x64: Fix udiv and sdiv for 8bit integers	2020-09-08 15:00:24 +02:00
bjorn3	74642b166f	x64: Implement ineg and bnot	2020-09-08 15:00:24 +02:00
Anton Kirilov	e92f949663	AArch64: Introduce an enum for ternary integer operations This commit performs a small cleanup in the AArch64 backend - after the MAdd and MSub variants have been extracted, the ALUOp enum is used purely for binary integer operations. Also, Inst::Mov has been renamed to Inst::Mov64 for consistency. Copyright (c) 2020, Arm Limited.	2020-09-08 13:22:22 +01:00
Johnnie Birch	a64af55cda	Adds x64 packed negation for the new backend	2020-09-07 11:56:05 -07:00
Joey Gouly	650d48cd84	arm64: Don't always materialise a 64-bit constant This improves the mov/movk/movn sequnce when the high half of the 64-bit value is all zero. Copyright (c) 2020, Arm Limited.	2020-09-01 13:29:01 +01:00
Benjamin Bouvier	a7f7c23bf9	machinst aarch64: in baldrdash, allow returning only one value across register classes; Baldrdash's API requires that there is at most one result in a register, across all the possible register classes: in particular, it's not possible to return an i64 value in a register while returning an v128 value in another register. This patch adds a notion of "remaining register values", so this is properly taking into account when choosing whether a return value may be put into a register or not.	2020-08-31 12:36:26 +02:00
Julian Seward	8ac4bd1d0d	CL/newBE/x64: Lowering of scalar shifts: fix shift-by-imm generation The logic for generation of shifts-by-immediate was not quite right. The result was that even shifts by an amount known at compile time were being done by moving the shift immediate into %cl and then doing a variable shift by %cl. The effect is worse than it sounds, because all of those shift constants are small and often used in multiple places, so they were GVN'd up and often ended up at the entry block of the function. Hence these were connected to the use points by long live ranges which got spilled. So all in all, most of the win here comes from avoiding spilling. The problem was caused by this line, in the `Opcode::Ishl \| Opcode::Ushr ..` case: ``` let (count, rhs) = if let Some(cst) = ctx.get_constant(inputs[1].insn) { ``` `inputs[]` appears to refer to this CLIF instruction's inputs, and bizarrely `inputs[].insn` all refer to the instruction (the shift) itself. Hence `ctx.get_constant(inputs[1].insn)` asks "does this shift instruction produce a constant" to which the answer is always "no", so the shift-by-unknown amount code is always generated. The fix here is to change that expression to ``` let (count, rhs) = if let Some(cst) = ctx.get_input(insn, 1).constant { ``` `get_input`'s result conveniently includes a `constant` field of type `Option<u64>`, so we just use that instead.	2020-08-27 11:48:35 +02:00

1 2 3 4 5 ...

410 Commits