wasmtime

Author	SHA1	Message	Date
Chris Fallin	d61e4e0559	Merge pull request #3709 from cfallin/cold-blocks-dead-code-bug Cranelift: Fix cold-blocks-related lowering bug.	2022-01-21 11:18:48 -08:00
Chris Fallin	ef1b2d2fa8	Cranelift: Fix cold-blocks-related lowering bug. If a block is marked cold but has side-effect-free code that is only used by side-effectful code in non-cold blocks, we will erroneously fail to emit it, causing a regalloc failure. This is due to the interaction of block ordering and lowering: we rely on block ordering to visit uses before defs (except for backedges) so that we can effectively do an inline liveness analysis and skip lowering operations that are not used anywhere. This "inline DCE" is needed because instruction lowering can pattern-match and merge one instruction into another, removing the need to generate the source instruction. Unfortunately, the way that I added cold-block support in #3698 was oblivious to this -- it just changed the block sort order. For efficiency reasons, we generate code in its final order directly, so it would not be tenable to generate it in e.g. RPO first and then reorder cold blocks to the bottom; we really do want to visit in the same order as the final code. This PR fixes the bug by moving the point at which cold blocks are sunk to emission-time instead. This is cheaper than either trying to visit blocks during lowering in RPO but add to VCode out-of-order, or trying to do some expensive analysis to recover proper liveness. It's not clear that the latter would be possible anyway -- the need to lower some instructions depends on other instructions' isel results/merging success, so we really do need to visit in RPO, and we can't simply lower all instructions as side-effecting roots (some can't be toplevel nodes). The one downside of this approach is that the VCode itself still has cold blocks inline; so in the text format (and hence compile-tests) it's not possible to see the sinking. This PR adds a test for cold-block sinking that actually verifies the machine code. (The test also includes an add-instruction in the cold path that would have been incorrectly skipped prior to this fix.) Fortunately this bug would not have been triggered by the one current use of cold blocks in #3699, because there the only operation in the cold block was an (always effectful) call instruction. The worst-case effect of the bug in other code would be a regalloc panic; no silent miscompilations could result.	2022-01-21 10:47:49 -08:00
Chris Fallin	51649d56b7	Add syntax for cold blocks to CLIF. This commit adds support for denoting cold blocks in the CLIF text format as follows: ```plain function %f() { block0(...): ... block1 cold: ... block2(...) cold: ... block3: ... ``` With this syntax, we are able to see the cold-block flag in CLIF, we can write tests using it, and it is preserved when round-tripping. Fixes #3701.	2022-01-20 16:49:52 -08:00
Chris Fallin	2615ef967f	Merge pull request #3702 from uweigand/isle-prep-s390x s390x: Codegen fixes and preparation for ISLE migration	2022-01-20 12:02:08 -08:00
Ulrich Weigand	be60a19623	ISLE standard prelude: Additional types and helpers In preparing to move the s390x back-end to ISLE, I noticed a few missing pieces in the common prelude code. This patch: - Defines the reference types $R32 / $R64. - Provides a trap_code_bad_conversion_to_integer helper. - Provides an avoid_div_traps helper. This requires passing the generic flags in addition to the ISA-specifc flags into the ISLE lowering context.	2022-01-20 17:23:31 +01:00
Ulrich Weigand	c08a013b53	s390x: Codegen fixes and preparation for ISLE migration In preparing the back-end to move to ISLE, I detected a number of codegen bugs in the existing code, which are fixed here: - Fix internal compiler error with uload16/icmp corner case. - Fix broken Cls lowering. - Correctly mask shift count for i8/i16 shifts. In addition, I made several changes to operand encodings in various MInst patterns. These should not have any functional effect, but will make the ISLE migration easier: - Encode floating-point constants as u32/u64 in MInst patterns. - Encode shift amounts as u8 and Reg in ShiftOp pattern. - Use MemArg in LoadMultiple64 and StoreMultiple64 patterns.	2022-01-20 16:59:18 +01:00
Chris Fallin	ae476fde60	Merge pull request #3698 from cfallin/cold-blocks Cranelift: add support for cold blocks.	2022-01-19 12:58:33 -08:00
Chris Fallin	f489b83835	Cranelift: add support for cold blocks. This PR adds a flag to each block that can be set via the frontend/builder interface that indicates that the block will not be frequently executed. As such, the compiler backend should place the block "out of line" in the final machine code, so that the ordinary, more frequent execution path that excludes the block does not have to jump around it. This is useful for adding handlers for exceptional conditions (slow-paths, guard violations) in a way that minimizes performance cost. Fixes #2747.	2022-01-19 12:17:41 -08:00
Freddie Liardet	b5531580e7	Improve code generation for floating-point constants Copyright (c) 2022, Arm Limited.	2022-01-18 10:39:05 +00:00
Anton Kirilov	89919f4b1f	Pass the ISA-specific compilation flags to the ABI implementations Copyright (c) 2021, Arm Limited.	2022-01-14 14:18:01 +00:00
Nick Fitzgerald	a052285340	Fix typo: s/sentinals/sentinels/	2022-01-13 16:50:15 -08:00
Nick Fitzgerald	658c5d33c1	cranelift: Port `trap` and `resumable_trap` lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5bb3645bd4	cranelift: Port `ineg` SIMD lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5917f1d2c2	cranelift: Port `ineg` scalar lowering to ISLE on x64	2022-01-13 15:08:01 -08:00
Nick Fitzgerald	b78731839b	cranelift: Use `x64_` prefix to disambiguate with clif in ISLE Instead of using `m_` like we used to, which was short for "mach inst" but not obvious or clear at all.	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	a41fdb0303	cranelift: Port `rotr` lowering to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4120e40318	cranelift: Update assertions to indicate that `rotl` is fully ported to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4e34dd8239	cranelift: Port `ushr` SIMD lowerings to ISLE on x64	2022-01-13 14:39:06 -08:00
Nick Fitzgerald	a7dba81c1d	cranelift: Port `ishl` SIMD lowerings to ISLE (#3686 )	2022-01-13 09:34:37 -06:00
Chris Fallin	13f17db297	Merge pull request #3680 from bjorn3/remove_code_sink Remove the CodeSink interface in favor of MachBufferFinalized	2022-01-12 10:47:23 -08:00
Nick Fitzgerald	7454f1f3af	cranelift: port `sshr` to ISLE on x64 (#3681 )	2022-01-12 09:13:58 -06:00
bjorn3	f0e821b9e0	Remove all Sink traits	2022-01-11 19:03:10 +01:00
bjorn3	b803514d55	Remove sink arguments from compile_and_emit The data can be accessed after the fact using context.mach_compile_result	2022-01-11 18:17:29 +01:00
bjorn3	55d722db05	Remove CodeSink	2022-01-11 17:10:37 +01:00
bjorn3	a48a60f958	Remove reloc_external from CodeSink And introduce MachBufferFinalized::relocs() in the place.	2022-01-11 16:54:27 +01:00
bjorn3	63e2360346	Remove trap from CodeSink And introduce MachBufferFinalized::traps() in the place.	2022-01-11 16:42:52 +01:00
bjorn3	38aaa6e1da	Remove add_call_site from CodeSink and RelocSink And introduce MachBufferFinalized::call_sites() in the place.	2022-01-11 16:32:57 +01:00
bjorn3	379c9c65a3	Inline MemoryCodeSink::write	2022-01-11 15:10:02 +01:00
bjorn3	37598ad170	Remove end_codegen method from CodeSink	2022-01-11 14:52:04 +01:00
bjorn3	354c4f7bf8	Remove unused CodeSink methods	2022-01-11 14:52:04 +01:00
bjorn3	88baac4ca6	Move the TestCodeSink functionality to MachBufferFinalized	2022-01-11 14:40:53 +01:00
Alex Crichton	3ab6ef048b	aarch64: Migrate `popcnt` to ISLE (#3662 ) Nothing too unusual here, the translation was quite straightforward!	2022-01-07 13:06:53 -06:00
Nick Fitzgerald	6b5e9d8732	Merge pull request #3659 from fitzgen/vselect-isle cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:51:33 -08:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Chris Fallin	a98f9982fd	Merge pull request #3655 from bjorn3/machinst_cleanups2 Remove MachBackend	2022-01-06 13:32:36 -08:00
Alex Crichton	72e2b7fe80	aarch64: Migrate bitrev/clz/cls/ctz to ISLE (#3658 ) This commit migrates these existing instructions to ISLE from the manual lowerings implemented today. This was mostly straightforward but while I was at it I fixed what appeared to be broken translations for I{8,16} for `clz`, `cls`, and `ctz`. Previously the lowerings would produce results as-if the input was 32-bits, but now I believe they all correctly account for the bit-width.	2022-01-06 15:18:32 -06:00
Nick Fitzgerald	23efaf2196	cranelift: Remove unused x64 instruction helpers	2022-01-06 11:22:54 -08:00
Nick Fitzgerald	09aa09fd76	cranelift: Port `bitselect` over to ISLE on x64	2022-01-06 11:22:54 -08:00
bjorn3	376c93bda0	Remove MachBackend It is identical to TargetIsa	2022-01-06 15:08:12 +01:00
bjorn3	58c25d9e24	Add text_section_builder method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	03dc74d8e7	Add emit_unwind_info method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	9eba87a6c8	Add compile_function method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	d50f27e8f9	Remove reg_universe method from MachBackend and MachInst	2022-01-06 14:39:50 +01:00
bjorn3	96b8879e4b	Take reg_universe as argument to machinst::compile	2022-01-06 14:39:50 +01:00
wasmtime-publish	8043c1f919	Release Wasmtime 0.33.0 (#3648 ) * Bump Wasmtime to 0.33.0 [automatically-tag-and-release-this-commit] * Update relnotes for 0.33.0 * Wordsmithing relnotes Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com> Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2022-01-05 13:26:50 -06:00
Chris Fallin	e2b37a57dc	Merge pull request #3639 from bjorn3/machinst_cleanups Various cleanups around machinst	2022-01-05 10:01:27 -08:00
Chris Fallin	be24edf9d8	Merge pull request #3645 from cfallin/fix-xmm-spillslot-fuzzbug Fix spillslot size bug in SIMD by removing type-dependent spillslot allocation.	2022-01-04 14:12:11 -08:00
Chris Fallin	833ebeed76	Fix spillslot size bug in SIMD by removing type-dependent spillslot allocation. This patch makes spillslot allocation, spilling and reloading all based on register class only. Hence when we have a 32- or 64-bit value in a 128-bit XMM register on x86-64 or vector register on aarch64, this results in larger spillslots and spills/restores. Why make this change, if it results in less efficient stack-frame usage? Simply put, it is safer: there is always a risk when allocating spillslots or spilling/reloading that we get the wrong type and make the spillslot or the store/load too small. This was one contributing factor to CVE-2021-32629, and is now the source of a fuzzbug in SIMD code that puns an arbitrary user-controlled vector constant over another stackslot. (If this were a pointer, that could result in RCE. SIMD is not yet on by default in a release, fortunately. In particular, we have not been particularly careful about using moves between values of different types, for example with `raw_bitcast` or with certain SIMD operations, and such moves indicate to regalloc.rs that vregs are in equivalence classes and some arbitrary vreg in the class is provided when allocating the spillslot or spilling/reloading. Since regalloc.rs does not track actual type, and since we haven't been careful about moves, we can't really trust this "arbitrary vreg in equivalence class" to provide accurate type information. In the fix to CVE-2021-32629 we fixed this for integer registers by always spilling/reloading 64 bits; this fix can be seen as the analogous change for FP/vector regs.	2022-01-04 13:24:40 -08:00
Teymour Aldridge	40072f844e	Clarify some documentation. (#3641 )	2022-01-04 11:15:19 -08:00
bjorn3	17c3c1813f	Remove MachInstEmitInfo	2022-01-04 18:06:01 +01:00

1 2 3 4 5 ...

1627 Commits