wasmtime

Author	SHA1	Message	Date
Chris Fallin	13f17db297	Merge pull request #3680 from bjorn3/remove_code_sink Remove the CodeSink interface in favor of MachBufferFinalized	2022-01-12 10:47:23 -08:00
Nick Fitzgerald	7454f1f3af	cranelift: port `sshr` to ISLE on x64 (#3681 )	2022-01-12 09:13:58 -06:00
bjorn3	b803514d55	Remove sink arguments from compile_and_emit The data can be accessed after the fact using context.mach_compile_result	2022-01-11 18:17:29 +01:00
bjorn3	55d722db05	Remove CodeSink	2022-01-11 17:10:37 +01:00
bjorn3	a48a60f958	Remove reloc_external from CodeSink And introduce MachBufferFinalized::relocs() in the place.	2022-01-11 16:54:27 +01:00
bjorn3	63e2360346	Remove trap from CodeSink And introduce MachBufferFinalized::traps() in the place.	2022-01-11 16:42:52 +01:00
bjorn3	37598ad170	Remove end_codegen method from CodeSink	2022-01-11 14:52:04 +01:00
bjorn3	354c4f7bf8	Remove unused CodeSink methods	2022-01-11 14:52:04 +01:00
Alex Crichton	1ef0abb12c	Update lots of `isa//.clif` tests to `precise-output` (#3677 ) * Update lots of `isa//.clif` tests to `precise-output` This commit goes through the `aarch64` and `x64` subdirectories and subjectively changes tests from `test compile` to add `precise-output`. This then auto-updates all the test expectations so they can be automatically instead of manually updated in the future. Not all tests were migrated, largely subject to the whims of myself, mainly looking to see if the test was looking for specific instructions or just checking the whole assembly output. * Filter out `;;` comments from test expctations Looks like the cranelift parser picks up all comments, not just those trailing the function, so use a convention where `;;` is used for human-readable-comments in test cases and `;`-prefixed comments are the test expectation.	2022-01-10 13:38:23 -06:00
Alex Crichton	a8ea0ec097	cranelift: Add ability to auto-update test expectations (#3612 ) * cranelift: Add ability to auto-update test expectations One of the problems of the current `.clif` testing is that the files are difficult to update when widespread changes are made (such as removing modification of the frame pointer). Additionally when changing register allocation or similar it can cause a large number of changes in tests but the tests themselves didn't actually break. For this reason this commit adds the ability to automatically update test expectations. The idea behind this commit is that tests of the form `test compile` can also optionally be flagged with the `precise-output` flag: test compile precise-output and when doing so the compiled form of each function is asserted to 100% match the following comments and their test expectations. If a match is not found then a `BLESS=1` environment variable can be used to automatically rewrite the test file itself with the correct assertion. If the environment variable isn't present and the expectation doesn't match then the test fails. It's hoped that, if approved, a follow-up commit can add `precise-output` to all current `test compile` tests (or make it the default) and all tests can be mass-updated. When developing locally test expectations need not be written and instead tests can be run with `BLESS=1` and the output can be manually verified. The environment variable will not be present on CI which means that changes to the output which don't also change the test expectation will cause CI to fail. Furthermore this should still make updates to the test output easily readable in review on CI because the test expectations are intended to look the same as before. Closes #1539 Use raw vcode output in tests * Fix a merge conflict * Review comments	2022-01-10 11:59:45 -06:00
Nick Fitzgerald	95d8dd1424	cranelift: Re-add some tests that were accidentally removed	2022-01-07 11:00:58 -08:00
Nick Fitzgerald	6b5e9d8732	Merge pull request #3659 from fitzgen/vselect-isle cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:51:33 -08:00
Nick Fitzgerald	056f7c2674	cranelift: Port `vselect` over to ISLE on x64	2022-01-06 14:10:57 -08:00
Alex Crichton	72e2b7fe80	aarch64: Migrate bitrev/clz/cls/ctz to ISLE (#3658 ) This commit migrates these existing instructions to ISLE from the manual lowerings implemented today. This was mostly straightforward but while I was at it I fixed what appeared to be broken translations for I{8,16} for `clz`, `cls`, and `ctz`. Previously the lowerings would produce results as-if the input was 32-bits, but now I believe they all correctly account for the bit-width.	2022-01-06 15:18:32 -06:00
Nick Fitzgerald	b60a4df2af	cranelift: Move `bitselect` runtest file to shared runtests directory	2022-01-06 11:25:27 -08:00
Nick Fitzgerald	09aa09fd76	cranelift: Port `bitselect` over to ISLE on x64	2022-01-06 11:22:54 -08:00
wasmtime-publish	8043c1f919	Release Wasmtime 0.33.0 (#3648 ) * Bump Wasmtime to 0.33.0 [automatically-tag-and-release-this-commit] * Update relnotes for 0.33.0 * Wordsmithing relnotes Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com> Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2022-01-05 13:26:50 -06:00
Chris Fallin	e2b37a57dc	Merge pull request #3639 from bjorn3/machinst_cleanups Various cleanups around machinst	2022-01-05 10:01:27 -08:00
Chris Fallin	833ebeed76	Fix spillslot size bug in SIMD by removing type-dependent spillslot allocation. This patch makes spillslot allocation, spilling and reloading all based on register class only. Hence when we have a 32- or 64-bit value in a 128-bit XMM register on x86-64 or vector register on aarch64, this results in larger spillslots and spills/restores. Why make this change, if it results in less efficient stack-frame usage? Simply put, it is safer: there is always a risk when allocating spillslots or spilling/reloading that we get the wrong type and make the spillslot or the store/load too small. This was one contributing factor to CVE-2021-32629, and is now the source of a fuzzbug in SIMD code that puns an arbitrary user-controlled vector constant over another stackslot. (If this were a pointer, that could result in RCE. SIMD is not yet on by default in a release, fortunately. In particular, we have not been particularly careful about using moves between values of different types, for example with `raw_bitcast` or with certain SIMD operations, and such moves indicate to regalloc.rs that vregs are in equivalence classes and some arbitrary vreg in the class is provided when allocating the spillslot or spilling/reloading. Since regalloc.rs does not track actual type, and since we haven't been careful about moves, we can't really trust this "arbitrary vreg in equivalence class" to provide accurate type information. In the fix to CVE-2021-32629 we fixed this for integer registers by always spilling/reloading 64 bits; this fix can be seen as the analogous change for FP/vector regs.	2022-01-04 13:24:40 -08:00
bjorn3	4915162230	Remove unnecessary fields from CodeInfo	2022-01-04 18:05:45 +01:00
bjorn3	32c3afe4b3	Add regression runtests	2021-12-17 20:58:32 +01:00
Alex Crichton	e94ebc2263	aarch64: Translate rot{r,l} to ISLE (#3614 ) This commit translates the `rotl` and `rotr` lowerings already existing to ISLE. The port was relatively straightforward with the biggest changing being the instructions generated around i128 rotl/rotr primarily due to register changes.	2021-12-17 12:37:17 -06:00
Alex Crichton	d8974ce6bc	aarch64: Migrate ishl/ushr/sshr to ISLE (#3608 ) * aarch64: Migrate ishl/ushr/sshr to ISLE This commit migrates the `ishl`, `ushr`, and `sshr` instructions to ISLE. These involve special cases for almost all types of integers (including vectors) and helper functions for the i128 lowerings since the i128 lowerings look to be used for other instructions as well. This doesn't delete the i128 lowerings in the Rust code just yet because they're still used by Rust lowerings, but they should be deletable in due time once those lowerings are translated to ISLE. * Use more descriptive names for i128 lowerings * Use a with_flags-lookalike for csel * Use existing `with_flags_` Coment backwards order * Update generated code	2021-12-16 17:37:53 -06:00
Chris Fallin	fd171ca063	Fix OperandSize: need clamp-to-32-bit behavior in most cases, but true-width for shifts.	2021-12-16 12:32:28 -08:00
Chris Fallin	1323ae417e	Fix some 16- and 8-bit behavior in x64 backend related to rotates. Uncovered by @bjorn3 (thanks!): 8- and 16-bit rotates were not working properly in recent versions of Cranelift with part of the lowering migrated to ISLE. This PR fixes a few issues: - 8- and 16-bit rotate-left needs to mask a constant amount, if any, because we use a 32-bit rotate instruction and so don't get the appropriate shift-amount masking for free from x86 semantics. - `operand_size_from_type` was incorrect: it only handled 32- and 64-bit types and silently returned `OperandSize::Size32` for everything else. Now uses the `OperandSize::from_ty(ty)` helper as the pre-ISLE code did. Our test coverage for narrow value types is not great; this PR adds some runtests for rotl/rotr but more would always be better!	2021-12-16 11:34:24 -08:00
Alex Crichton	d29b7c8a59	Fix a simd shuffle test (#3607 ) Cranelift shuffles require indices to be in-bounds, which the avx512-using backend also requires via a debug assert, so this commit fixes a test with simd shuffles to only use in-bounds indices. This is motivated by another failure on CI where the machine we were running on presumably had avx512 things enabled. This should fix those failures. Closes #3581	2021-12-16 10:36:52 -08:00
Alex Crichton	4236319a53	aarch64: Migrate some bit-ops to ISLE (#3602 ) * aarch64: Migrate some bit-ops to ISLE This commit migrates these instructions to ISLE: * `bnot` * `band` * `bor` * `bxor` * `band_not` * `bor_not` * `bxor_not` The translations were relatively straightforward but the interesting part here was trying to reduce the duplication between all these instructions. I opted for a route that's similar to what the lowering does today, having a `decl` which takes the `ALUOp` and then performs further pattern matching internally. This enabled each instruction's lowering to be pretty simple while we still get to handle all the fancy cases of shifts, constants, etc, for each instruction. * Actually delete previous lowerings * Remove dead code	2021-12-15 10:41:36 -06:00
Alex Crichton	d89410ec4e	aarch64: Migrate `uextend`/`sextend` to ISLE This commit migrates the sign/zero extension instructions from `lower_inst.rs` to ISLE. There's actually a fair amount going on in this migration since a few other pieces needed touching up along the way as well: * First is the actual migration of `uextend` and `sextend`. These instructions are relatively simple but end up having a number of special cases. I've attempted to replicate all the cases here but double-checks would be good. * This commit actually fixes a few issues where if the result of a vector extraction is sign/zero-extended into i128 that actually results in panics in the current backend. * This commit adds exhaustive testing for extension-of-a-vector-extraction is a noop wrt extraction. * A bugfix around ISLE glue was required to get this commit working, notably the case where the `RegMapper` implementation was trying to map an input to an output (meaning ISLE was passing through an input unmodified to the output) wasn't working. This requires a `mov` instruction to be generated and this commit updates the glue to do this. At the same time this commit updates the ISLE glue to share more infrastructure between x64 and aarch64 so both backends get this fix instead of just aarch64. Overall I think that the translation to ISLE was a net benefit for these instructions. It's relatively obvious what all the cases are now unlike before where it took a few reads of the code and some boolean switches to figure out which path was taken for each flavor of input. I think there's still possible improvements here where, for example, the `put_in_reg_{s,z}ext64` helper doesn't use this logic so technically those helpers could also pattern match the "well atomic loads and vector extractions automatically do this for us" but that's a possible future improvement for later (and shouldn't be too too hard with some ISLE refactoring).	2021-12-14 07:01:37 -08:00
Alex Crichton	20e090b114	aarch64: Migrate {s,u}{div,rem} to ISLE (#3572 ) * aarch64: Migrate {s,u}{div,rem} to ISLE This commit migrates four different instructions at once to ISLE: * `sdiv` * `udiv` * `srem` * `urem` These all share similar codegen and center around the `div` instruction to use internally. The main feature of these was to model the manual traps since the `div` instruction doesn't trap on overflow, instead requiring manual checks to adhere to the semantics of the instruction itself. While I was here I went ahead and implemented an optimization for these instructions when the right-hand-side is a constant with a known value. For `udiv`, `srem`, and `urem` if the right-hand-side is a nonzero constant then the checks for traps can be skipped entirely. For `sdiv` if the constant is not 0 and not -1 then additionally all checks can be elided. Finally if the right-hand-side of `sdiv` is -1 the zero-check is elided, but it still needs a check for `i64::MIN` on the left-hand-side and currently there's a TODO where `-1` is still checked too. * Rebasing and review conflicts	2021-12-13 17:27:11 -06:00
wasmtime-publish	c1c4c59670	Release Wasmtime 0.32.0 (#3589 ) * Bump Wasmtime to 0.32.0 [automatically-tag-and-release-this-commit] * Update release notes for 0.32.0 Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com> Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2021-12-13 13:47:30 -06:00
Pat Hickey	cf03b2a513	cranelift codegen & filetests: silence new dead code warnings in rust 1.57	2021-12-03 10:33:09 -08:00
Alex Crichton	0e90d4b903	Update addr2line and gimli deps (#3580 ) Just a routine update, figured it was good to stay close to their most recent versions	2021-12-01 15:48:36 -06:00
Alex Crichton	98ce029bbd	Add commutative addition cases	2021-11-19 07:00:04 -08:00
Alex Crichton	7d0f6ab90f	aarch64: Migrate `iadd` and `isub` to ISLE This commit is the first "meaty" instruction added to ISLE for the AArch64 backend. I chose to pick the first two in the current lowering's `match` statement, `isub` and `iadd`. These two turned out to be particularly interesting for a few reasons: * Both had clearly migratable-to-ISLE behavior along the lines of special-casing per type. For example 128-bit and vector arithmetic were both easily translateable. * The `iadd` instruction has special cases for fusing with a multiplication to generate `madd` which is expressed pretty easily in ISLE. * Otherwise both instructions had a number of forms where they attempted to interpret the RHS as various forms of constants, extends, or shifts. There's a bit of a design space of how best to represent this in ISLE and what I settled on was to have a special case for each form of instruction, and the special cases are somewhat duplicated between `iadd` and `isub`. There's custom "extractors" for the special cases and instructions that support these special cases will have an `rule`-per-case. Overall I think the ISLE transitioned pretty well. I don't think that the aarch64 backend is going to follow the x64 backend super closely, though. For example the x64 backend is having a helper-per-instruction at the moment but with AArch64 it seems to make more sense to only have a helper-per-enum-variant-of-`MInst`. This is because the same instruction (e.g. `ALUOp::Sub32`) can be expressed with multiple different forms depending on the payload. It's worth noting that the ISLE looks like it's a good deal larger than the code actually being removed from lowering as part of this commit. I think this is deceptive though because a lot of the logic in `put_input_in_rse_imm12_maybe_negated` and `alu_inst_imm12` is being inlined into the ISLE definitions for each instruction instead of having it all packed into the helper functions. Some of the "boilerplate" here is the addition of various ISLE utilities as well.	2021-11-19 06:51:38 -08:00
Nick Fitzgerald	d2d0a0f36b	Remove Peepmatic!!! Peepmatic was an early attempt at a DSL for peephole optimizations, with the idea that maybe sometime in the future we could user it for instruction selection as well. It didn't really pan out, however: * Peepmatic wasn't quite flexible enough, and adding new operators or snippets of code implemented externally in Rust was a bit of a pain. * The performance was never competitive with the hand-written peephole optimizers. It was very size efficient, but that came at the cost of run-time efficiency. Everything was table-based and interpreted, rather than generating any Rust code. Ultimately, because of these reasons, we never turned Peepmatic on by default. These days, we just landed the ISLE domain-specific language, and it is better suited than Peepmatic for all the things that Peepmatic was originally designed to do. It is more flexible and easy to integrate with external Rust code. It is has better time efficiency, meeting or even beating hand-written code. I think a small part of the reason why ISLE excels in these things is because its design was informed by Peepmatic's failures. I still plan on continuing Peepmatic's mission to make Cranelift's peephole optimizer passes generated from DSL rewrite rules, but using ISLE instead of Peepmatic. Thank you Peepmatic, rest in peace!	2021-11-17 13:04:17 -08:00
Nick Fitzgerald	33fcd6b4a5	x64: special case `0` to use `xor` in `Inst::gen_constant` for `i128`s	2021-11-10 15:57:58 -08:00
Nick Fitzgerald	b5105c025c	MachInst: always rematerialize constants, rather than assign them registers There were a few previous code paths that attempted to handle this, but this new check handles it for all callers. Rematerializing constants, rather than assigning and reusing a register, allows for lower register pressure.	2021-11-10 15:45:43 -08:00
Nick Fitzgerald	b8494822dc	ISLE: finish porting `imul` lowering to ISLE	2021-11-05 15:41:24 -07:00
Nick Fitzgerald	d377b665c6	Initial ISLE integration with the x64 backend On the build side, this commit introduces two things: 1. The automatic generation of various ISLE definitions for working with CLIF. Specifically, it generates extern type definitions for clif opcodes and the clif instruction data `enum`, as well as extractors for matching each clif instructions. This happens inside the `cranelift-codegen-meta` crate. 2. The compilation of ISLE DSL sources to Rust code, that can be included in the main `cranelift-codegen` compilation. Next, this commit introduces the integration glue code required to get ISLE-generated Rust code hooked up in clif-to-x64 lowering. When lowering a clif instruction, we first try to use the ISLE code path. If it succeeds, then we are done lowering this instruction. If it fails, then we proceed along the existing hand-written code path for lowering. Finally, this commit ports many lowering rules over from hand-written, open-coded Rust to ISLE. In the process of supporting ISLE, this commit also makes the x64 `Inst` capable of expressing SSA by supporting 3-operand forms for all of the existing instructions that only have a 2-operand form encoding: dst = src1 op src2 Rather than only the typical x86-64 2-operand form: dst = dst op src This allows `MachInst` to be in SSA form, since `dst` and `src1` are disentangled. ("3-operand" and "2-operand" are a little bit of a misnomer since not all operations are binary operations, but we do the same thing for, e.g., unary operations by disentangling the sole operand from the result.) There are two motivations for this change: 1. To allow ISLE lowering code to have value-equivalence semantics. We want ISLE lowering to translate a CLIF expression that evaluates to some value into a `MachInst` expression that evaluates to the same value. We want both the lowering itself and the resulting `MachInst` to be pure and referentially transparent. This is both a nice paradigm for compiler writers that are authoring and maintaining lowering rules and is a prerequisite to any sort of formal verification of our lowering rules in the future. 2. Better align `MachInst` with `regalloc2`'s API, which requires that the input be in SSA form.	2021-10-12 17:11:58 -07:00
Chris Fallin	5e96a447f0	Add back the `ifcmp_sp` CLIF opcode. This opcode was removed as part of the old-backend cleanup in #3446. While this opcode will definitely go away eventually, it is unfortunately still used today in Lucet (as we just discovered while working to upgrade Lucet's pinned Cranelift version). Lucet is deprecated and slated to eventually be completely sunset in favor of Wasmtime; but until that happens, we need to keep this opcode.	2021-11-01 13:34:31 -07:00
wasmtime-publish	c1a6a0523d	Release Wasmtime 0.31.0 (#3489 ) * Bump Wasmtime to 0.31.0 [automatically-tag-and-release-this-commit] * Update 0.31.0 release notes Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com> Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2021-10-29 09:09:35 -05:00
Nick Fitzgerald	8aa8dfe26a	cranelift: in `test_compile` filetest, log disasm not clif With the old backends, this would log the lowered+legalized clif, but the log is useles now with the new backends. Logging the disasm is the new moral equivalent.	2021-10-26 10:21:31 -07:00
Chris Fallin	e9921574d7	Update to regalloc.rs 0.0.32. It appears that some allocation heuristics have changed slightly since 0.0.31, so some of the golden-output filetests are updated as well. Ideally we would rely more on runtests rather than golden-compilation tests; but for now this is sufficient. (I'm not sure exactly what in regalloc.rs changed to alter these heuristics; it's actually been almost a year since the 0.0.31 release with several refactorings and tweaks merged since then.) Fixes #3441.	2021-10-20 15:28:42 -07:00
bjorn3	a05bf2bf42	Remove instructions necessary for the old regalloc	2021-10-12 14:37:36 +02:00
bjorn3	1fd491dadd	Remove fallthrough instruction	2021-10-12 14:22:07 +02:00
bjorn3	5b24e117ee	Remove instructions used by old br_table legalization	2021-10-12 14:18:52 +02:00
bjorn3	3f87b768d5	Update filetests	2021-10-11 17:44:21 +02:00
bjorn3	d78f436daf	Remove reloc_constant It is no longer used by the new backends	2021-10-10 14:43:55 +02:00
bjorn3	2db3b5b9df	Remove code offsets from Function (#3412 ) * Remove code offsets from Function * Remove reloc_jt and fix wasmtime-cranelift	2021-10-07 15:54:00 +02:00
Afonso Bordado	fc33700071	cranelift: Enable `umulhi` tests for s390x	2021-10-06 20:59:53 +01:00

1 2 3 4 5 ...

1113 Commits