wasmtime

Author	SHA1	Message	Date
Chris Fallin	d61e4e0559	Merge pull request #3709 from cfallin/cold-blocks-dead-code-bug Cranelift: Fix cold-blocks-related lowering bug.	2022-01-21 11:18:48 -08:00
Chris Fallin	ef1b2d2fa8	Cranelift: Fix cold-blocks-related lowering bug. If a block is marked cold but has side-effect-free code that is only used by side-effectful code in non-cold blocks, we will erroneously fail to emit it, causing a regalloc failure. This is due to the interaction of block ordering and lowering: we rely on block ordering to visit uses before defs (except for backedges) so that we can effectively do an inline liveness analysis and skip lowering operations that are not used anywhere. This "inline DCE" is needed because instruction lowering can pattern-match and merge one instruction into another, removing the need to generate the source instruction. Unfortunately, the way that I added cold-block support in #3698 was oblivious to this -- it just changed the block sort order. For efficiency reasons, we generate code in its final order directly, so it would not be tenable to generate it in e.g. RPO first and then reorder cold blocks to the bottom; we really do want to visit in the same order as the final code. This PR fixes the bug by moving the point at which cold blocks are sunk to emission-time instead. This is cheaper than either trying to visit blocks during lowering in RPO but add to VCode out-of-order, or trying to do some expensive analysis to recover proper liveness. It's not clear that the latter would be possible anyway -- the need to lower some instructions depends on other instructions' isel results/merging success, so we really do need to visit in RPO, and we can't simply lower all instructions as side-effecting roots (some can't be toplevel nodes). The one downside of this approach is that the VCode itself still has cold blocks inline; so in the text format (and hence compile-tests) it's not possible to see the sinking. This PR adds a test for cold-block sinking that actually verifies the machine code. (The test also includes an add-instruction in the cold path that would have been incorrectly skipped prior to this fix.) Fortunately this bug would not have been triggered by the one current use of cold blocks in #3699, because there the only operation in the cold block was an (always effectful) call instruction. The worst-case effect of the bug in other code would be a regalloc panic; no silent miscompilations could result.	2022-01-21 10:47:49 -08:00
Chris Fallin	51649d56b7	Add syntax for cold blocks to CLIF. This commit adds support for denoting cold blocks in the CLIF text format as follows: ```plain function %f() { block0(...): ... block1 cold: ... block2(...) cold: ... block3: ... ``` With this syntax, we are able to see the cold-block flag in CLIF, we can write tests using it, and it is preserved when round-tripping. Fixes #3701.	2022-01-20 16:49:52 -08:00
Chris Fallin	2615ef967f	Merge pull request #3702 from uweigand/isle-prep-s390x s390x: Codegen fixes and preparation for ISLE migration	2022-01-20 12:02:08 -08:00
Ulrich Weigand	be60a19623	ISLE standard prelude: Additional types and helpers In preparing to move the s390x back-end to ISLE, I noticed a few missing pieces in the common prelude code. This patch: - Defines the reference types $R32 / $R64. - Provides a trap_code_bad_conversion_to_integer helper. - Provides an avoid_div_traps helper. This requires passing the generic flags in addition to the ISA-specifc flags into the ISLE lowering context.	2022-01-20 17:23:31 +01:00
Ulrich Weigand	c08a013b53	s390x: Codegen fixes and preparation for ISLE migration In preparing the back-end to move to ISLE, I detected a number of codegen bugs in the existing code, which are fixed here: - Fix internal compiler error with uload16/icmp corner case. - Fix broken Cls lowering. - Correctly mask shift count for i8/i16 shifts. In addition, I made several changes to operand encodings in various MInst patterns. These should not have any functional effect, but will make the ISLE migration easier: - Encode floating-point constants as u32/u64 in MInst patterns. - Encode shift amounts as u8 and Reg in ShiftOp pattern. - Use MemArg in LoadMultiple64 and StoreMultiple64 patterns.	2022-01-20 16:59:18 +01:00
Chris Fallin	ae476fde60	Merge pull request #3698 from cfallin/cold-blocks Cranelift: add support for cold blocks.	2022-01-19 12:58:33 -08:00
Chris Fallin	f489b83835	Cranelift: add support for cold blocks. This PR adds a flag to each block that can be set via the frontend/builder interface that indicates that the block will not be frequently executed. As such, the compiler backend should place the block "out of line" in the final machine code, so that the ordinary, more frequent execution path that excludes the block does not have to jump around it. This is useful for adding handlers for exceptional conditions (slow-paths, guard violations) in a way that minimizes performance cost. Fixes #2747.	2022-01-19 12:17:41 -08:00
Freddie Liardet	b5531580e7	Improve code generation for floating-point constants Copyright (c) 2022, Arm Limited.	2022-01-18 10:39:05 +00:00
Anton Kirilov	89919f4b1f	Pass the ISA-specific compilation flags to the ABI implementations Copyright (c) 2021, Arm Limited.	2022-01-14 14:18:01 +00:00
Nick Fitzgerald	a052285340	Fix typo: s/sentinals/sentinels/	2022-01-13 16:50:15 -08:00
Nick Fitzgerald	658c5d33c1	cranelift: Port `trap` and `resumable_trap` lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5bb3645bd4	cranelift: Port `ineg` SIMD lowering to ISLE on x64	2022-01-13 15:57:17 -08:00
Nick Fitzgerald	5917f1d2c2	cranelift: Port `ineg` scalar lowering to ISLE on x64	2022-01-13 15:08:01 -08:00
Nick Fitzgerald	b78731839b	cranelift: Use `x64_` prefix to disambiguate with clif in ISLE Instead of using `m_` like we used to, which was short for "mach inst" but not obvious or clear at all.	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	a41fdb0303	cranelift: Port `rotr` lowering to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4120e40318	cranelift: Update assertions to indicate that `rotl` is fully ported to ISLE on x64	2022-01-13 14:59:09 -08:00
Nick Fitzgerald	4e34dd8239	cranelift: Port `ushr` SIMD lowerings to ISLE on x64	2022-01-13 14:39:06 -08:00
Nick Fitzgerald	a7dba81c1d	cranelift: Port `ishl` SIMD lowerings to ISLE (#3686 )	2022-01-13 09:34:37 -06:00
Chris Fallin	13f17db297	Merge pull request #3680 from bjorn3/remove_code_sink Remove the CodeSink interface in favor of MachBufferFinalized	2022-01-12 10:47:23 -08:00
Nick Fitzgerald	7454f1f3af	cranelift: port `sshr` to ISLE on x64 (#3681 )	2022-01-12 09:13:58 -06:00
bjorn3	f0e821b9e0	Remove all Sink traits	2022-01-11 19:03:10 +01:00
bjorn3	b803514d55	Remove sink arguments from compile_and_emit The data can be accessed after the fact using context.mach_compile_result	2022-01-11 18:17:29 +01:00
bjorn3	55d722db05	Remove CodeSink	2022-01-11 17:10:37 +01:00
bjorn3	a48a60f958	Remove reloc_external from CodeSink And introduce MachBufferFinalized::relocs() in the place.	2022-01-11 16:54:27 +01:00
bjorn3	63e2360346	Remove trap from CodeSink And introduce MachBufferFinalized::traps() in the place.	2022-01-11 16:42:52 +01:00
bjorn3	38aaa6e1da	Remove add_call_site from CodeSink and RelocSink And introduce MachBufferFinalized::call_sites() in the place.	2022-01-11 16:32:57 +01:00
bjorn3	379c9c65a3	Inline MemoryCodeSink::write	2022-01-11 15:10:02 +01:00
bjorn3	37598ad170	Remove end_codegen method from CodeSink	2022-01-11 14:52:04 +01:00
bjorn3	354c4f7bf8	Remove unused CodeSink methods	2022-01-11 14:52:04 +01:00
bjorn3	88baac4ca6	Move the TestCodeSink functionality to MachBufferFinalized	2022-01-11 14:40:53 +01:00
Alex Crichton	1ef0abb12c	Update lots of `isa//.clif` tests to `precise-output` (#3677 ) * Update lots of `isa//.clif` tests to `precise-output` This commit goes through the `aarch64` and `x64` subdirectories and subjectively changes tests from `test compile` to add `precise-output`. This then auto-updates all the test expectations so they can be automatically instead of manually updated in the future. Not all tests were migrated, largely subject to the whims of myself, mainly looking to see if the test was looking for specific instructions or just checking the whole assembly output. * Filter out `;;` comments from test expctations Looks like the cranelift parser picks up all comments, not just those trailing the function, so use a convention where `;;` is used for human-readable-comments in test cases and `;`-prefixed comments are the test expectation.	2022-01-10 13:38:23 -06:00
Alex Crichton	a8ea0ec097	cranelift: Add ability to auto-update test expectations (#3612 ) * cranelift: Add ability to auto-update test expectations One of the problems of the current `.clif` testing is that the files are difficult to update when widespread changes are made (such as removing modification of the frame pointer). Additionally when changing register allocation or similar it can cause a large number of changes in tests but the tests themselves didn't actually break. For this reason this commit adds the ability to automatically update test expectations. The idea behind this commit is that tests of the form `test compile` can also optionally be flagged with the `precise-output` flag: test compile precise-output and when doing so the compiled form of each function is asserted to 100% match the following comments and their test expectations. If a match is not found then a `BLESS=1` environment variable can be used to automatically rewrite the test file itself with the correct assertion. If the environment variable isn't present and the expectation doesn't match then the test fails. It's hoped that, if approved, a follow-up commit can add `precise-output` to all current `test compile` tests (or make it the default) and all tests can be mass-updated. When developing locally test expectations need not be written and instead tests can be run with `BLESS=1` and the output can be manually verified. The environment variable will not be present on CI which means that changes to the output which don't also change the test expectation will cause CI to fail. Furthermore this should still make updates to the test output easily readable in review on CI because the test expectations are intended to look the same as before. Closes #1539 Use raw vcode output in tests * Fix a merge conflict * Review comments	2022-01-10 11:59:45 -06:00
Nick Fitzgerald	ab5aea7b28	Merge pull request #3665 from fitzgen/re-add-tests cranelift: Re-add some tests that were accidentally removed	2022-01-07 11:37:53 -08:00
Alex Crichton	3ab6ef048b	aarch64: Migrate `popcnt` to ISLE (#3662 ) Nothing too unusual here, the translation was quite straightforward!	2022-01-07 13:06:53 -06:00
Nick Fitzgerald	95d8dd1424	cranelift: Re-add some tests that were accidentally removed	2022-01-07 11:00:58 -08:00
Teymour Aldridge	8d50cf3e23	Add a link to the JIT demo.	2022-01-07 16:08:05 +00:00
Nick Fitzgerald	6b5e9d8732	Merge pull request #3659 from fitzgen/vselect-isle cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:51:33 -08:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Chris Fallin	a98f9982fd	Merge pull request #3655 from bjorn3/machinst_cleanups2 Remove MachBackend	2022-01-06 13:32:36 -08:00
Alex Crichton	72e2b7fe80	aarch64: Migrate bitrev/clz/cls/ctz to ISLE (#3658 ) This commit migrates these existing instructions to ISLE from the manual lowerings implemented today. This was mostly straightforward but while I was at it I fixed what appeared to be broken translations for I{8,16} for `clz`, `cls`, and `ctz`. Previously the lowerings would produce results as-if the input was 32-bits, but now I believe they all correctly account for the bit-width.	2022-01-06 15:18:32 -06:00
Nick Fitzgerald	b60a4df2af	cranelift: Move `bitselect` runtest file to shared runtests directory	2022-01-06 11:25:27 -08:00
Nick Fitzgerald	23efaf2196	cranelift: Remove unused x64 instruction helpers	2022-01-06 11:22:54 -08:00
Nick Fitzgerald	09aa09fd76	cranelift: Port `bitselect` over to ISLE on x64	2022-01-06 11:22:54 -08:00
bjorn3	376c93bda0	Remove MachBackend It is identical to TargetIsa	2022-01-06 15:08:12 +01:00
bjorn3	58c25d9e24	Add text_section_builder method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	03dc74d8e7	Add emit_unwind_info method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	9eba87a6c8	Add compile_function method to TargetIsa	2022-01-06 14:39:50 +01:00
bjorn3	d50f27e8f9	Remove reg_universe method from MachBackend and MachInst	2022-01-06 14:39:50 +01:00
bjorn3	96b8879e4b	Take reg_universe as argument to machinst::compile	2022-01-06 14:39:50 +01:00

1 2 3 4 5 ...

3559 Commits