wasmtime

Author	SHA1	Message	Date
Chris Fallin	0f725a3c5c	ARM64 backend, part 5 / 11: ABI implementation. This patch provides an ARM64 implementation of the ABI-related traits required by the new backend infrasturcture. It will be used by the lowering code, when that is in place in a subsequent patch. This patch contains code written by Julian Seward <jseward@acm.org> and Benjamin Bouvier <public@benj.me>, originally developed on a side-branch before rebasing and condensing into this patch series. See the `arm64` branch at `https://github.com/cfallin/wasmtime` for original development history. This patch also contains code written by Joey Gouly <joey.gouly@arm.com> and contributed to the above branch. These contributions are "Copyright (c) 2020, Arm Limited." Co-authored-by: Julian Seward <jseward@acm.org> Co-authored-by: Benjamin Bouvier <public@benj.me> Co-authored-by: Joey Gouly <joey.gouly@arm.com>	2020-04-11 17:52:11 -07:00
Chris Fallin	548ce947bf	ARM64 backend, part 4 / 11: ARM64 instruction definitions. This patch provides the bottom layer of the ARM64 backend: it defines the `Inst` type, which represents a single machine instruction, and defines emission routines to produce machine code from a `VCode` container of `Insts`. The backend cannot produce `Inst`s with just this patch; that will come with later parts. This patch contains code written by Julian Seward <jseward@acm.org> and Benjamin Bouvier <public@benj.me>, originally developed on a side-branch before rebasing and condensing into this patch series. See the `arm64` branch at `https://github.com/cfallin/wasmtime` for original development history. This patch also contains code written by Joey Gouly <joey.gouly@arm.com> and contributed to the above branch. These contributions are "Copyright (c) 2020, Arm Limited." Finally, a contribution from Joey Gouly contains the following notice: This is a port of VIXL's Assembler::IsImmLogical. Arm has the original copyright on the VIXL code this was ported from and is relicensing it under Apache 2 for Cranelift. Co-authored-by: Julian Seward <jseward@acm.org> Co-authored-by: Benjamin Bouvier <public@benj.me> Co-authored-by: Joey Gouly <joey.gouly@arm.com>	2020-04-11 17:51:45 -07:00
Chris Fallin	d83574261c	ARM64 backend, part 3 / 11: MachInst infrastructure. This patch adds the MachInst, or Machine Instruction, infrastructure. This is the machine-independent portion of the new backend design. It contains the implementation of the "vcode" (virtual-registerized code) container, the top-level lowering algorithm and compilation pipeline, and the trait definitions that the machine backends will fill in. This backend infrastructure is included in the compilation of the `codegen` crate, but it is not yet tied into the public APIs; that patch will come last, after all the other pieces are filled in. This patch contains code written by Julian Seward <jseward@acm.org> and Benjamin Bouvier <public@benj.me>, originally developed on a side-branch before rebasing and condensing into this patch series. See the `arm64` branch at `https://github.com/cfallin/wasmtime` for original development history. Co-authored-by: Julian Seward <jseward@acm.org> Co-authored-by: Benjamin Bouvier <public@benj.me>	2020-04-11 17:51:11 -07:00
Chris Fallin	f80fe949c6	ARM64 backend, part 2 / 11: remove old ARM64 backend. This removes the old ARM64 backend completely, leaving only an empty `arm64` module. The tree at this state will not build with the `arm64` feature enabled, but that feature has to be enabled explicitly (it is not default). Subsequent patches will fill in the new backend.	2020-04-11 17:51:06 -07:00
Chris Fallin	875d2758b1	ARM64 backend, part 1 / 11: misc changes to existing code. - Add a `simple_legalize()` function that invokes a predetermined set of legalizations, without depending on the details of the current backend design. This will be used by the new backend pipeline. - Separate out `has_side_effect()` from the DCE pass. This will be used by the new backends' lowering code. - Add documentation for the `Arm64Call` relocation type.	2020-04-11 17:50:51 -07:00
iximeow	4cca510085	Windows FPRs preservation (#1216 ) Preserve FPRs as required by the Windows fastcall calling convention. This exposes an implementation limit due to Cranelift's approach to stack layout, which conflicts with expectations Windows makes in SEH layout - functions where the Cranelift user desires fastcall unwind information, that require preservation of an ABI-reserved FPR, that have a stack frame 240 bytes or larger, now produce an error when compiled. Several wasm spectests were disabled because they would trip this limit. This is a temporary constraint that should be fixed promptly. Co-authored-by: bjorn3 <bjorn3@users.noreply.github.com>	2020-04-10 13:27:20 -07:00
Andrew Brown	6fd0451bc3	Add TargetIsa::map_dwarf_register; fixes #1471 This exposes the functionality of `fde::map_reg` on the `TargetIsa` trait, avoiding compilation errors on architectures where register mapping is not yet supported. The change is conditially compiled under the `unwind` feature.	2020-04-09 09:45:20 -07:00
Alex Crichton	c4e90f729c	wasmtime: Pass around more contexts instead of fields (#1486 ) * wasmtime: Pass around more contexts instead of fields This commit refactors some wasmtime internals to pass around more context-style structures rather than individual fields of each structure. The intention here is to make the addition of fields to a structure easier to plumb throughout the internals of wasmtime. Currently you need to edit lots of functions to pass lots of parameters, but ideally after this you'll only need to edit one or two struct fields and then relevant locations have access to the information already. Updates in this commit are: * `debug_info` configuration is now folded into `Tunables`. Additionally a `wasmtime::Config` now holds a `Tunables` directly and is passed into an internal `Compiler`. Eventually this should allow for direct configuration of the `Tunables` attributes from the `wasmtime` API, but no new configuration is exposed at this time. * `ModuleTranslation` is now passed around as a whole rather than passing individual components to allow access to all the fields, including `Tunables`. This was motivated by investigating what it would take to optionally allow loops and such to get interrupted, but that sort of codegen setting was currently relatively difficult to plumb all the way through and now it's hoped to be largely just an addition to `Tunables`. * Fix lightbeam compile	2020-04-08 19:02:49 -05:00
Benjamin Bouvier	6a68130d5b	cranelift codegen: add a supplementary method add_call_site to CodeSink; This allows keeping track of indirect call sites, for instance.	2020-04-07 11:52:39 +02:00
Benjamin Bouvier	f4c4a84b84	cranelift codegen: pass source locations with external relocations;	2020-04-07 11:52:39 +02:00
Andrew Brown	a799f9f6b5	Skip extra work when calculating sizes for recipes with inferred REX prefixes As explained in the added documentation and #1342, if we prevent `infer_rex()` and `w()` from being used together then we don't need to check whether the W bit is set when calculating the size of a recipe. This should improve compile time for x86 very slightly since all `infer_rex()` instructions will no longer need this check.	2020-04-02 16:50:07 -07:00
Andrew Brown	a4c1147045	Skip extra work when inferring REX prefixes As explained in the added documentation and #1342, if we prevent `infer_rex()` and `w()` from being used together then we don't need to check whether the W bit is set when figuring out if a REX prefix is needed in `needs_rex()`. This should improve compile time for x86 very slightly since all `infer_rex()` instructions will no longer need this check.	2020-04-02 16:50:07 -07:00
Andrew Brown	e425bfcebd	Infer REX prefixes for SIMD load and store with displacement	2020-04-02 11:28:42 -07:00
Andrew Brown	dc874a5b3b	Infer REX prefixes for SIMD load_extend	2020-04-02 11:28:42 -07:00
Johnnie Birch	9e11e8d019	Remove unnecessary braces around assigned value in simple_preopt Removes unnecessary braces that is causing compilation failure in a test scenario.	2020-04-02 15:54:19 +02:00
Andrew Brown	d3df275003	Remove duplication of map_reg; fixes #1245 Both cranelift-codegen and wasmtime-debug need to map Cranelift registers to Gimli registers. Previously both crates had an almost-identical `map_reg` implementation. This change: - removes the wasmtime-debug implementation - improves the cranelift-codegen implementation with custom errors - exposes map_reg in `cranelift_codegen::isa::fde::map_reg` and subsequently `wasmtime_environ::isa::fde::map_reg`	2020-03-31 15:42:02 -07:00
Benjamin Bouvier	1d5a678124	Fixes #1240 : Add a new accessor to indicate that an opcode requires spilling all registers;	2020-03-23 12:19:28 +01:00
Andrew Brown	0d63bd12d8	Infer REX prefix for SIMD operations; fixes #1127 - Convert recipes to have necessary size calculator - Add a missing binemit function, `put_dynrexmp3` - Modify the meta-encodings of x86 SIMD instructions to use `infer_rex()`, mostly through the `enc_both_inferred()` helper - Fix up tests that previously always emitted a REX prefix	2020-03-18 10:12:50 -07:00
Andrew Brown	8598295bc4	Remove FPR32; fixes #1303 Until #1306 is resolved (some spilling/regalloc issue with larger FPR register banks), this removes FPR32 support. Only Wasm's `i64x2.mul` was using this register class and that instruction is predicated on AVX512 support; for the time being, that instruction will have to make do with the 16 FPR registers.	2020-03-17 12:46:41 -07:00
Yury Delendik	f76b36f737	Write .debug_frame information (#53 ) * Write .debug_frame information * mv map_reg	2020-03-11 10:22:51 -05:00
Andrew Brown	965714d675	Add encoding functions for emitting EVEX formats Only the `reg, vvvv, rm` form is currently supported but it should not be difficult to add more forms.	2020-03-06 10:53:22 -08:00
Andrew Brown	079fcafcb1	Expand x86 registers to include 32 XMM registers The EVEX encoding format (e.g. in AVX-512) allows addressing 32 registers instead of 16. The FPR register class currently defines 16 registers, `%xmm0`-`%xmm15`; that class is kept as-is with this change. A larger class, FPR32, is added as a super-class of FPR using a larger bank of registers, `%xmm0`-`%xmm31`.	2020-03-06 10:53:22 -08:00
Andrew Brown	1d15054310	Remove the debug crate's hard-coded dependency on register ordering	2020-03-06 10:53:22 -08:00
Andrew Brown	3f53bcb740	Remove dependency on hard-coded ordering of x86 register banks With this change, register banks can now be re-ordered and other components (e.g. unwinding, regalloc) will no longer break. The previous behavior assumed that GPR registers always started at `RegUnit` 0.	2020-03-06 10:53:22 -08:00
Alex Crichton	3179dcf6f1	Update Cranelift's documentation after the merger. (#1238 ) Update the documentation for the merger, and also for various changes in Cranelift. Remove some old obsolete documentation, and convert the remaining Sphinx files to Markdown. Some of the remaining content is still out of date, but this is a step forward.	2020-03-05 15:51:12 -06:00
Ryan Hunt	4aa8776a9b	Skip non-branching blocks now that we're using basic blocks This is a rebase of [1]. In the long term, we'll want to simplify these analysis passes. For now, this is simple and will reduce the number of instructions processed in certain cases. [1] https://github.com/bytecodealliance/cranelift/pull/866	2020-03-05 16:11:13 +01:00
Ryan Hunt	07f335dca6	Rename 'an block' to 'a block' Missed this in the automatic rename of 'Ebb' to 'Block'.	2020-03-03 13:21:13 -06:00
bjorn3	0a1bb3ba6c	Add TLS support for ELF and MachO (#1174 ) * Add TLS support * Add binemit and legalize tests * Spill all caller-saved registers when necessary	2020-02-25 17:50:04 -08:00
Andrew Brown	1a9dc743d1	Infer REX prefix for SIMD `load` instruction	2020-02-19 09:24:05 -08:00
Andrew Brown	936120dcf9	Infer REX prefix for SIMD `store` and `vconst` instructions	2020-02-19 09:24:05 -08:00
Peter Delevoryas	18b40d1101	Add ineg legalization for scalar integer types (#1385 )	2020-02-14 13:16:02 -08:00
Y-Nak	58e5a62cde	Fix inverted result of is_leaf method	2020-02-13 11:02:22 +01:00
Ryan Hunt	832666c45e	Mass rename Ebb and relatives to Block (#1365 ) * Manually rename BasicBlock to BlockPredecessor BasicBlock is a pair of (Ebb, Inst) that is used to represent the basic block subcomponent of an Ebb that is a predecessor to an Ebb. Eventually we will be able to remove this struct, but for now it makes sense to give it a non-conflicting name so that we can start to transition Ebb to represent a basic block. I have not updated any comments that refer to BasicBlock, as eventually we will remove BlockPredecessor and replace with Block, which is a basic block, so the comments will become correct. * Manually rename SSABuilder block types to avoid conflict SSABuilder has its own Block and BlockData types. These along with associated identifier will cause conflicts in a later commit, so they are renamed to be more verbose here. * Automatically rename 'Ebb' to 'Block' in .rs Automatically rename 'EBB' to 'block' in .rs Automatically rename 'ebb' to 'block' in .rs Automatically rename 'extended basic block' to 'basic block' in .rs Automatically rename 'an basic block' to 'a basic block' in .rs Manually update comment for `Block` `Block`'s wikipedia article required an update. * Automatically rename 'an `Block`' to 'a `Block`' in .rs Automatically rename 'extended_basic_block' to 'basic_block' in .rs Automatically rename 'ebb' to 'block' in .clif Manually rename clif constant that contains 'ebb' as substring to avoid conflict * Automatically rename filecheck uses of 'EBB' to 'BB' 'regex: EBB' -> 'regex: BB' '$EBB' -> '$BB' * Automatically rename 'EBB' 'Ebb' to 'block' in .clif Automatically rename 'an block' to 'a block' in .clif Fix broken testcase when function name length increases Test function names are limited to 16 characters. This causes the new longer name to be truncated and fail a filecheck test. An outdated comment was also fixed.	2020-02-07 10:46:47 -06:00
Yury Delendik	169dbef784	Properly preserve and restore CFA state in FDE (#1373 ) * Properly preserve and restore CFA state in FDE	2020-02-03 14:08:40 -08:00
Joshua Nelson	5edf015ada	Make `get_libcall_funcref` `pub(crate)` (#1291 ) * Make `get_libcall_funcref` `pub(crate)` Closes https://github.com/bytecodealliance/cranelift/issues/1273. Since get_libcall_funcref is only used internally by the verifier, it doesn't make sense to have it be public. This will encourage users to look elsewhere for `memcpy` (they should be looking at https://docs.rs/cranelift-frontend/0.51.0/cranelift_frontend/struct.FunctionBuilder.html#method.emit_small_memcpy)	2020-01-24 16:43:44 +01:00
Ryan Hunt	c360007b19	Drop 'basic-blocks' feature (#1363 ) * All: Drop 'basic-blocks' feature This makes it so that 'basic-blocks' cannot be disabled and we can start assuming it everywhere. * Tests: Replace non-bb filetests with bb version * Tests: Adapt solver-fixedconflict filetests to use basic blocks	2020-01-23 22:36:06 -07:00
Ryan Hunt	946251e655	Codegen: Align representation of stackmap with SpiderMonkey This commit aligns the representation of stackmaps to be the same as Spidermonkey's by: * Reversing the order of the bitmap from low addresses to high addresses * Including incoming stack arguments * Excluding outgoing stack arguments Additionally, some accessor functions were added to allow Spidermonkey to access the internals of the bitmap.	2020-01-23 13:37:11 -06:00
Ryan Hunt	a15bb9cfcb	Codegen: Use GPR regclass for reference types on x86	2020-01-23 13:37:11 -06:00
Benjamin Bouvier	3125431ece	Address nits from #1325	2020-01-23 09:39:49 +01:00
bjorn3	e1446cff8d	Derive Ord for all entities (#1313 )	2020-01-22 18:18:23 +01:00
Sean Stangl	b4c6bfd371	When splitting a const, insert prior to the terminal branch group. (#1325 ) * When splitting a const, insert prior to the terminal branch group. Closes #1159 Given code like the following, on x86_64, which does not have i128 registers: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 brnz v2, ebb1 jump ebb2(v1) It would be split to: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 brnz v2, ebb1 v3, v4 = isplit.i128 v1 jump ebb2(v3, v4) But that fails basic-block invariants. This patch changes that to: ebb0(v0: i64): v1 = iconst.i128 0 v2 = icmp_imm eq v0, 1 v3, v4 = isplit.i128 v1 brnz v2, ebb1 jump ebb2(v3, v4) * Add isplit-bb.clif testcase	2020-01-22 17:14:41 +01:00
jmkrauz	ae6ba1e58c	Fix narrow_icmp_imm (#1343 )	2020-01-21 15:20:44 +01:00
Andrew Brown	fd04ea2b06	Fix incorrect assertion for `insertlane` (#1355 ) Previously, the assertion checked for `lane > 0` when it should have been `lane >= 0`; since lane is unsigned, this half of the assertion can be entirely removed.	2020-01-17 14:39:31 -08:00
Alex Crichton	1266b68f9a	Use `is_wasm_parameter` in translating wasm calls (#1352 ) * Use `is_wasm_parameter` in translating wasm calls Added in #1329 it's now possible for multiple parameters to be non-wasm parameters, so the previous `param_types` method is no longer suitable for acquiring all wasm-related parameters, rather then `FuncEnvironment` must be consulted. This removes usage of `param_types()` as a method from the wasm translation and instead adds a custom method inline for filtering the parameters based on `is_wasm_parameter`. * Apply feedback * Run rustfmt * Don't require `mut` * Run rustfmt	2020-01-17 12:11:54 -08:00
Andrew Brown	e1d513ab4b	Fix remaining clippy warnings (#1340 ) * clippy: allow complex encoding function * clippy: remove unnecessary main() function in doctest * clippy: remove redundant `Type` suffix on LaneType enum variants * clippy: ignore incorrect debug_assert_with_mut_call warning * clippy: fix FDE clippy warnings	2020-01-17 14:03:30 -06:00
Dan Gohman	1d504ecf6d	Correctly count the number of wasm parameters. (#1337 ) * Correctly count the number of wasm parameters. Following up on #1329, this further replaces `num_normal_params` with a function which calls `is_wasm_parameter` to correctly count the number of wasm parameters a function has. * Move is_wasm_parameter's implementation into the trait.	2020-01-14 11:42:22 -08:00
Benjamin Bouvier	dd497c19e1	Renames Settings ⚠️ (fixes #976 ) (#1321 ) This is a breaking API change: the following settings have been renamed: - jump_tables_enabled -> enable_jump_tables - colocated_libcalls -> use_colocated_libcalls - probestack_enabled -> enable_probestack - allones_funcaddrs -> emit_all_ones_funcaddrs	2020-01-13 14:42:49 -07:00
Yury Delendik	bd88155483	Refactor unwind; add FDE support. (#1320 ) * Refactor unwind * add FDE support * use sink directly in emit functions * pref off all unwinding generation with feature	2020-01-13 10:32:55 -06:00
data-pup	250ea0e5db	Verify that EBBs are non-empty (#1330 ) Co-authored-by: Joshua Nelson <joshua@yottadb.com>	2020-01-10 10:40:27 -08:00
Andrew Brown	e8c3302bc5	Fix some additional clippy warnings	2020-01-10 08:38:40 -08:00

... 6 7 8 9 10 ...

609 Commits