wasmtime

Author	SHA1	Message	Date
Saúl Cabrera	0f8393508a	cranelift-codegen: Expose `EmitState` and `EmitInfo` from aarch64 (#5640 ) This commit exposes `EmitState` and `EmitInfo` so that they can be consumed by Winch. This is a follow up to https://github.com/bytecodealliance/wasmtime/pull/5570, in which this should've been included.	2023-01-27 19:36:26 +00:00
Trevor Elliott	058d93bc64	Migrate cranelift-wasm to brif (#5638 ) Incrementally working towards removing brz and brnz completely.	2023-01-27 10:34:41 -08:00
Jamey Sharp	915801551b	Delete old cranelift-preopt crate (#5642 ) Most of these optimizations are in the egraph `cprop.isle` rules now, making a separate crate unnecessary. Also I think the `udiv` optimizations here are straight-up wrong (doing signed instead of unsigned division, and panicking instead of preserving traps on division by zero) so I'm guessing this crate isn't seriously used anywhere. At the least, bjorn3 confirms that cg_clif doesn't use this, and I've verified that Wasmtime doesn't either. Closes #1090.	2023-01-26 21:32:33 +00:00
Trevor Elliott	a181ad2932	Cleanup the use of `maybe_uextend` in the x64 lowerings (#5637 ) Use maybe_uextend for the brnz lowerings on x64.	2023-01-25 17:28:48 -08:00
Trevor Elliott	7926808e8e	riscv64: improve unordered comparison generated code (#5636 ) Improve the generated code for unordered floating point comparisons by negating the comparison and inveritng the branches. This allows us to pick the unordered versions, which generate significantly better code.	2023-01-25 17:28:28 -08:00
Alex Crichton	a7d0d00e57	Update wasm-tools crates (#5631 ) Nothing major pulled in here, but wanted to update to the latest versions which enable tail calls by default. When used in Wasmtime, however, the feature is disabled without the possibility of being enabled since it's not implemented.	2023-01-25 16:33:26 +00:00
Trevor Elliott	b58a197d33	cranelift: Add a conditional branch instruction with two targets (#5446 ) Add a conditional branch instruction with two targets: brif. This instruction will eventually replace brz and brnz, as it encompasses the behavior of both. This PR also changes the InstructionData layout for instruction formats that hold BlockCall values, taking the same approach we use for Value arguments. This allows branch_destination to return a slice to the BlockCall values held in the instruction, rather than requiring that we pattern match on InstructionData to fetch the then/else blocks. Function generation for fuzzing has been updated to generate uses of brif, and I've run the cranelift-fuzzgen target locally for hours without triggering any new failures.	2023-01-24 14:37:16 -08:00
bjorn3	ec6922ff24	Produce an error at runtime rather than at compile time for unsupported architectures in cranelift-native (#5627 )	2023-01-24 19:17:28 +00:00
Jamey Sharp	bfc6aad184	cranelift-isle: codegen from new IR (#5435 ) ISLE's existing code-generation strategy doesn't generate the most efficient matching order for rules. This PR completely replaces it. With this PR applied, wasmtime compile retires 2% fewer instructions on the pulldown-cmark and spidermonkey benchmarks from Sightglass. A dev build of cranelift-codegen from an empty target/ directory takes 2% less time. The build script, invoking ISLE, takes a little longer, but Rust can compile the generated code faster, so it balances out.	2023-01-23 12:27:51 -08:00
Jamey Sharp	fef9f64d2c	x86: Test paired udiv/urem (#5573 ) Ideally these pairs of CLIF instructions should emit a single x86 instruction, but they don't today. This test will tell us if somebody fixes that. Similar tests might make sense for imul/umulhi as well as signed versions, but I haven't tried that.	2023-01-23 11:44:27 -08:00
Afonso Bordado	ae441c50b6	fuzzgen: Reenable some RISCV instructions (#5615 )	2023-01-23 09:03:07 -08:00
Saúl Cabrera	0e6e802c34	docs: Fix typo (#5620 )	2023-01-23 09:05:33 -06:00
yuyang	7e10bd1f58	fix issue #5497 #5524 #5526 . (#5595 ) * fix issue 5497. * fix issue 5524 * fix issue 5497 5524 5526. * some clif change because of reg alloc.	2023-01-20 14:06:26 -08:00
yuyang	299b8187f8	fix issue 5525. (#5603 ) * fix issue 5525. * reg alloc changed.	2023-01-20 09:53:54 -08:00
Chris Fallin	1faff8c2ce	Enable egraph-based optimization by default. (#5587 ) This PR follows up on #5382 and #5391, which rebuilt the egraph-based optimization framework to be more performant, by enabling it by default. Based on performance results in #5382 (my measurements on SpiderMonkey and bjorn3's independent confirmation with cg_clif), it seems that this is reasonable to enable. Now that we have been fuzzing compiler configurations with egraph opts (#5388) for 6 weeks, having fixed a few fuzzbugs that came up (#5409, #5420, #5438) and subsequently received no further reports from OSS-Fuzz, I believe it is stable enough to rely on. This PR enables `use_egraphs`, and also normalizes its meaning: previously it forced optimization (it basically meant "turn on the egraph optimization machinery"), now it runs egraph opts if the opt level indicates (it means "use egraphs to optimize if we are going to optimize"). The conditionals in the top-level pass driver are a little subtle, but will get simpler once we can remove the non-egraph path (which we plan to do eventually!). Fixes #5181.	2023-01-19 15:46:53 -08:00
Chris Fallin	704f5a5772	Cranelift/egraph mid-end: support merging effectful-but-idempotent ops (#5594 ) * Support mergeable-but-side-effectful (idempotent) operations in general in the egraph's GVN. This mirrors the similar change made in #5534. * Add tests for egraph case.	2023-01-19 11:51:19 -08:00
Ulrich Weigand	a2e9a608c1	fuzzgen: Enable s390x and disable unimplemented ops (#5596 ) Also fix assertion failure when using "i128 uext" or "i128 sext" arguments or return values, as discovered by the fuzzer.	2023-01-19 10:08:32 -08:00
Kevin Rizzo	da03ff47f1	winch: Adding support for integration tests (#5588 ) * Adding in the foundations for Winch `filetests` This commit adds two new crates into the Winch workspace: `filetests` and `test-macros`. The intent is to mimic the structure of Cranelift `filetests`, but in a simpler way. * Updates to documentation This commits adds a high level document to outline how to test Winch through the `winch-tools` utility. It also updates some inline documentation which gets propagated to the CLI. * Updating test-macro to use a glob instead of only a flat directory	2023-01-19 07:34:48 -05:00
Trevor Elliott	7cea73a81d	Refactor BranchInfo::Table to no longer have an optional default branch (#5593 )	2023-01-18 17:17:03 -08:00
Saúl Cabrera	94b51cdb17	winch: Use cranelift-codegen x64 backend for emission. (#5581 ) This change substitutes the string based emission mechanism with cranelift-codegen's x64 backend. This change _does not_: * Introduce new functionality in terms of supported instructions. * Change the semantics of the assembler/macroassembler in terms of the logic to emit instructions. The most notable differences between this change and the previous version are: * Handling of shared flags and ISA-specific flags, which for now are left with the default value. * Simplification of instruction emission per operand size: previously the assembler defined different methods depending on the operand size (e.g. `mov` for 64 bits, and `movl` for 32 bits). This change updates such approach so that each assembler method takes an operand size as a parameter, reducing duplication and making the code more concise and easier to integrate with the x64's `Inst` enum. * Introduction of a disassembler for testing purposes. As of this change, Winch generates the following code for the following test programs: ```wat (module (export "main" (func $main)) (func $main (result i32) (i32.const 10) (i32.const 20) i32.add )) ``` ```asm 0: 55 push rbp 1: 48 89 e5 mov rbp, rsp 4: b8 0a 00 00 00 mov eax, 0xa 9: 83 c0 14 add eax, 0x14 c: 5d pop rbp d: c3 ret ``` ```wat (module (export "main" (func $main)) (func $main (result i32) (local $foo i32) (local $bar i32) (i32.const 10) (local.set $foo) (i32.const 20) (local.set $bar) (local.get $foo) (local.get $bar) i32.add )) ``` ```asm 0: 55 push rbp 1: 48 89 e5 mov rbp, rsp 4: 48 83 ec 08 sub rsp, 8 8: 48 c7 04 24 00 00 00 00 mov qword ptr [rsp], 0 10: b8 0a 00 00 00 mov eax, 0xa 15: 89 44 24 04 mov dword ptr [rsp + 4], eax 19: b8 14 00 00 00 mov eax, 0x14 1e: 89 04 24 mov dword ptr [rsp], eax 21: 8b 04 24 mov eax, dword ptr [rsp] 24: 8b 4c 24 04 mov ecx, dword ptr [rsp + 4] 28: 01 c1 add ecx, eax 2a: 48 89 c8 mov rax, rcx 2d: 48 83 c4 08 add rsp, 8 31: 5d pop rbp 32: c3 ret ``` ```wat (module (export "main" (func $main)) (func $main (param i32) (param i32) (result i32) (local.get 0) (local.get 1) i32.add )) ``` ```asm 0: 55 push rbp 1: 48 89 e5 mov rbp, rsp 4: 48 83 ec 08 sub rsp, 8 8: 89 7c 24 04 mov dword ptr [rsp + 4], edi c: 89 34 24 mov dword ptr [rsp], esi f: 8b 04 24 mov eax, dword ptr [rsp] 12: 8b 4c 24 04 mov ecx, dword ptr [rsp + 4] 16: 01 c1 add ecx, eax 18: 48 89 c8 mov rax, rcx 1b: 48 83 c4 08 add rsp, 8 1f: 5d pop rbp 20: c3 ret ```	2023-01-18 06:58:13 -05:00
Trevor Elliott	1e6c13d83e	cranelift: Rework block instructions to use BlockCall (#5464 ) Add a new type BlockCall that represents the pair of a block name with arguments to be passed to it. (The mnemonic here is that it looks a bit like a function call.) Rework the implementation of jump, brz, and brnz to use BlockCall instead of storing the block arguments as varargs in the instruction's ValueList. To ensure that we're processing block arguments from BlockCall values in instructions, three new functions have been introduced on DataFlowGraph that both sets of arguments: inst_values - returns an iterator that traverses values in the instruction and block arguments map_inst_values - applies a function to each value in the instruction and block arguments overwrite_inst_values - overwrite all values in an instruction and block arguments with values from the iterator Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-01-17 16:31:15 -08:00
Jamey Sharp	3a2ca67570	Generalize iterator types in compute_use_states (#5586 ) This is a cleanup to help prepare for #5464. Most of the diff is inlining the closure for `mark_all_uses_as_multiple` which was only called once. That avoids some borrow-checker challenges. The key change is that the former `push_args_on_stack` closure no longer actually pushes the iterator on the stack, but just returns it. That way this closure doesn't need the name of the stack's type. It also allows it to be reused in the debug_assert.	2023-01-17 22:17:17 +00:00
Afonso Bordado	3ae373b073	cranelift: Disable select rule for i128 types on riscv64 (#5584 ) * fuzzgen: Disable some selects for RISC-V * cranelift: Force disable gen_select_reg rule for i128 values	2023-01-17 10:01:23 -08:00
Ayomide Bamidele	e4dc9c7944	Update Intel x86 CPU presets to match LLVM (#5490 ) * Update Intel x86 CPU presets * Add LLVM reference * Remove 32bit CPU architectures * Rename silvermont to slm * Fix haswell presets * Add icelake alias * Group streaming simd presets * Add slm silvermont preset * Remove duplicate alderlake def	2023-01-13 21:02:36 +00:00
Saúl Cabrera	f0979af157	cranelift-codegen: Prepare aarch64 for usage from Winch (#5570 ) This commit exposes the necessary aarch64 pieces to be used by Winch for binary emission.	2023-01-13 19:46:25 +00:00
Afonso Bordado	d3e6b7bd2a	fuzzgen: Enable riscv64 and disable unimplemented ops (#5502 )	2023-01-12 08:46:37 -08:00
Afonso Bordado	82494661c1	cranelift: Add `atomic_{load,store}` and `fence` to the interpreter (#5503 ) * cranelift: Add `fence` to interpreter * cranelift: Add `atomic_{load,store}` to the interpreter * fuzzgen: Add `atomic_{load,store}` * Update cranelift/fuzzgen/src/function_generator.rs Co-authored-by: Jamey Sharp <jamey@minilop.net> * fuzzgen: Use type size as the alignment size. Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-01-12 08:36:04 -08:00
Saúl Cabrera	6cb68f3287	cranelift-codegen: Expose x64 settings (#5561 ) Exposes x64 settings so that they can be consumed from Winch for binary emission.	2023-01-11 18:33:03 -05:00
Afonso Bordado	9556cb190f	cranelift: Forbid argument extensions for floats and SIMD vectors (#5536 ) * fuzzgen: Generate argument extensions only for integer argumetns * cranelift: Add verifier check for argument extensions	2023-01-10 10:26:30 -08:00
Afonso Bordado	23ea435a91	fuzzgen: Reenable `srem.i8` on x86 (#5545 )	2023-01-09 12:30:57 -08:00
Afonso Bordado	8ac04612ae	cranelift: Remove predicate not macro branch (#5552 )	2023-01-09 12:19:30 -08:00
Afonso Bordado	f845ebb450	cranelift: Remove `is_pic` predicate from x86 backend (#5548 ) This is still present as shared flags and we don't use the predicate anywhere.	2023-01-09 10:04:45 -08:00
Afonso Bordado	437f448a8d	cranelift: Cleanup testing docs (#5549 ) Removes some docs from tests that we no longer have. Updates the ebb examples to the new block based examples.	2023-01-09 10:02:59 -08:00
Alexa VanHattum	44913825b5	cranelift: fix register for `srem.i8` on x86_64 (#5540 ) * Change register written to in specific srem case. Add regression test as filetest case. Fixes #5470 * Add another test case, newline * Update comment	2023-01-06 22:18:16 +00:00
Jamey Sharp	e65a2c9280	cranelift-isle: Record all binding names (#5538 ) ...not just the ones at the outer scope of a rule. Thanks to @avanhatt for pointing out that #5221 didn't capture as much information as I intended it to.	2023-01-06 22:00:50 +00:00
Sam Sartor	1efa3d6f8b	Add `clif-util compile` option to output object file (#5493 ) * add clif-util compile option to output object file * switch from a box to a borrow * update objectmodule tests to use borrowed isa * put targetisa into an arc	2023-01-06 12:53:48 -08:00
uint256_t	b00455135e	Cranelift: Implement 'iabs' for scalar types on x86_64 (#5527 ) * Implement 'iabs' for scalar types on x86_64 * Small fix	2023-01-05 21:33:12 -08:00
Nick Fitzgerald	c50bdf600e	Cranelift: GVN all idempotently trapping but otherwise pure instructions (#5534 )	2023-01-05 15:08:06 -08:00
Trevor Elliott	36e5bdfd0e	Fuzz multiple targets in cranelift-icache (#5482 ) Fuzz additional targets in the cranelift-icache target. The list of targets fuzzed is controlled by the targets enabled in fuzz/Cargo.toml. This PR also reworks how instruction disabling is done in function generator, moving the deny-list to a function to make the decision at runtime instead of compile time.	2023-01-05 18:49:23 +00:00
Afonso Bordado	ee6a909ccb	cranelift: Cleanup SIMD `icmp` tests (#5530 ) * cranelift: Enable more SIMD tests * cranelift: Reorganize icmp tests * cranelift: Enable SIMD icmp tests for unsigned ops * cranelift: Cleanup trailing newlines	2023-01-05 09:19:03 -08:00
wasmtime-publish	7bfbec1b57	Bump Wasmtime to 6.0.0 (#5521 ) Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com>	2023-01-05 09:46:01 -06:00
Nick Fitzgerald	f4a2d5337a	Cranelift: GVN `uadd_overflow_trap` (#5520 ) * Switch duplicate loads w/ dynamic memories test to `min_size = 0` This test was accidentally hitting a special case for bounds checks for when we know that `offset + access_size < min_size` and can skip some steps. This commit changes the `min_size` of the memory to zero so that we are forced to do fully general bounds checks. * Cranelift: Mark `uadd_overflow_trap` as okay for GVN Although this improves our test sequence for duplicate loads with dynamic memories, it unfortunately doesn't have any effect on sightglass benchmarks: ``` instantiation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [34448 35607.23 37158] gvn_uadd_overflow_trap.so [34566 35734.05 36585] main.so instantiation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [44101 60449.62 92712] gvn_uadd_overflow_trap.so [44011 60436.37 92690] main.so instantiation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [35595 36675.72 38153] gvn_uadd_overflow_trap.so [35440 36670.42 37993] main.so compilation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [17370195 17405125.62 17471222] gvn_uadd_overflow_trap.so [17369324 17404859.43 17470725] main.so execution :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [7055720520 7055886880.32 7056265930] gvn_uadd_overflow_trap.so [7055719554 7055843809.33 7056193289] main.so compilation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [683589861 683767276.00 684098366] gvn_uadd_overflow_trap.so [683590024 683767998.02 684097885] main.so execution :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [46436883 46437135.10 46437823] gvn_uadd_overflow_trap.so [46436883 46437087.67 46437785] main.so compilation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [126522461 126565812.58 126647044] gvn_uadd_overflow_trap.so [126522176 126565757.75 126647522] main.so execution :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [653010531 653010533.03 653010544] gvn_uadd_overflow_trap.so [653010531 653010533.18 653010537] main.so ``` * cranelift-codegen-meta: Rename `side_effects_okay_for_gvn` to `side_effects_idempotent` * cranelift-filetests: Ensure there is a trailing newline for blessed Wasm tests	2023-01-04 22:03:16 -08:00
Trevor Elliott	e2e98f694f	Remove lower_br_fcmp from the riscv64 backend (#5519 ) Remove the lower_br_fcmp function from the riscv64 backend. This PR only affects the emit implementation for FloatRound, replacing the uses of lower_br_fcmp with direct uses of FpuRRR and CondBr. Any changes in behavior here should be already covered by the runtests for ceil, floor, trunc, and nearest.	2023-01-04 14:22:35 -08:00
Trevor Elliott	5d429e46e8	Remove the MInst::TrapFf constructor from the riscv64 backend (#5515 ) Remove the MInst::TrapFf instruction in the riscv64 backend. It was only used in two places in the emit case for FloatRound, and was easily replaced with a combination of FpuRRR and TrapIf.	2023-01-04 13:34:46 -08:00
Alexa VanHattum	4bc4fae571	Small update for filename in `isle_integration.md` (#5516 ) * Small update for filename in `isle_integration.md` The name "clif.isle" is stale (since #4953), now two files "clif_lower.isle" and "clif_opt.isle" are generated. Not sure if that PR necessitates other changes this this doc. * Update cranelift/docs/isle-integration.md Co-authored-by: Jamey Sharp <jsharp@fastly.com>	2023-01-04 20:06:49 +00:00
Nick Fitzgerald	937601c7c3	Cranelift: GVN spectre guards and run redundant load elimination twice (#5517 ) * Cranelift: Make spectre guards GVN-able While these instructions have a side effect that is otherwise invisible to the optimizer, the side effect in question is idempotent, so it can be de-duplicated by GVN. * Cranelift: Run redundant load replacement and GVN twice This allows us to actually replace redundant Wasm loads with dynamic memories. While this improves our hand-crafted test sequences, it doesn't seem to have any improvement on sightglass benchmarks run with dynamic memories, however it also isn't a hit to compilation times, so seems generally good to land anyways: ``` $ cargo run --release -- benchmark -e ~/scratch/once.so -e ~/scratch/twice.so -m insts-retired --processes 20 --iterations-per-process 3 --engine-flags="--static-memory-maximum-size 0" -- benchmarks/default.suite compilation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [683595240 683768610.53 684097577] once.so [683597068 700115966.83 1664907164] twice.so instantiation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [44107 60411.07 92785] once.so [44138 59552.32 92097] twice.so compilation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [17369916 17404839.78 17471458] once.so [17369935 17625713.87 30700150] twice.so compilation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [126523640 126566170.80 126648265] once.so [126523076 127174580.30 163145149] twice.so instantiation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [34569 35686.25 36513] once.so [34651 35749.97 36953] twice.so instantiation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [35146 36639.10 37707] once.so [34472 36580.82 38431] twice.so execution :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [7055720115 7055841324.82 7056180024] once.so [7055717681 7055877095.85 7056225217] twice.so execution :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [46436881 46437081.28 46437691] once.so [46436883 46437127.68 46437766] twice.so execution :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [653010530 653010533.27 653010539] once.so [653010531 653010532.95 653010538] twice.so ```	2023-01-04 20:05:43 +00:00
Trevor Elliott	b2d5afdf83	riscv64: Implement fcmp in ISLE (#5512 ) Rework the compilation of fcmp in the riscv64 backend to be in ISLE, removing the need for the dedicated Fcmp instruction. This change is motivated by #5500, which showed that the riscv64 backend was generating branch instructions in the middle of a basic block. We can't remove lower_br_fcmp quite yet as it's used in a few places in the emit module, but it's now no longer reachable from the ISLE lowerings. Fixes #5500	2023-01-04 11:52:00 -08:00
Nick Fitzgerald	d1920f5a2d	cranelift: Add wasm tests for duplicate loads (#5514 ) * cranelift-filetests: Add the ability to test optimized CLIF in Wasm tests * cranelift: Add Wasm tests for identical loads, back to back	2023-01-04 18:52:32 +00:00
Afonso Bordado	52ba72f341	riscv64: Fix masking on `iabs` (#5505 ) * cranelift: Add `iabs.i128` runtest * riscv64: Fix incorrect extension in iabs When lowering iabs, we were accidentally comparing the unextended value this caused the instruction to misbehave with certain top bits. This commit also adds a zbb lowering that does not use jumps.	2023-01-03 17:37:25 -08:00
Nick Fitzgerald	276bc6ad2e	cranelift-wasm: Better track reachability after translating loads (#5511 ) We can sometimes statically determine that a given load will unconditionally trap. When this happens, we emit an unconditional trap, and we need to stop adding new instructions to the block. This commit introduces a `Reachability<T>` type that is like `Option<T>` but specifically for communicating reachability and is marked `must_use` to receivers to handle transitions from reachable to unreachable states. Additionally, adds handling of reachable -> unreachable state transitions to some SIMD op translations that weren't checking for it. Fixes #5455 Fixes #5456	2023-01-03 22:04:18 +00:00

1 2 3 4 5 ...

4251 Commits