wasmtime

Author	SHA1	Message	Date
Ömer Sinan Ağacan	0592b5a995	Fix umbrella crate URL in docs/index.md (#1694 )	2020-05-13 17:05:55 -07:00
Dan Gohman	fb0b9e3ae6	Change `proc_exit` to unwind the stack rather than exiting the host process. (#1646 ) * Remove Cranelift's OutOfBounds trap, which is no longer used. * Change proc_exit to unwind instead of exit the host process. This implements the semantics in https://github.com/WebAssembly/WASI/pull/235. Fixes #783. Fixes #993. * Fix exit-status tests on Windows. * Revert the wiggle changes and re-introduce the wasi-common implementations. * Move `wasi_proc_exit` into the wasmtime-wasi crate. * Revert the spec_testsuite change. * Remove the old proc_exit implementations. * Make `TrapReason` an implementation detail. * Allow exit status 2 on Windows too. * Fix a documentation link. * Really fix a documentation link.	2020-05-13 15:59:43 -07:00
Cerberuser	f5eab5225f	Fixed links in compare-llvm.md (#1690 ) Several links were broken by line-breaks between the link caption and the link itself. This commit fixes them by moving each on its own line. Co-authored-by: k.bagrov <k.bagrov@g.nsu.ru>	2020-05-13 11:52:36 +02:00
Benjamin Bouvier	5987cf5cda	machinst: add a linear-scan checked variant too;	2020-05-13 10:56:32 +02:00
Benjamin Bouvier	07c55fa50f	aarch64: suggest a scratch register that's not caller-saved; If the scratch register is caller-saved, then it might appear in fixed ranges because of call clobbers. Instead, use a register that's not caller-saved and has no predefined use in the ABI.	2020-05-13 10:56:32 +02:00
Julian Seward	94190d5724	cranelift/reader/src/parser.rs: fn parse_inst_resuts: produce the results as a SmallVec<[Value; 1]>, not as a Vec<Value>. This isn't a useful change for any non-developer use of Cranelift, but it does significantly reduce the amount of allocation "noise" seen when tuning the new backend pipeline as driven by clif-util reading .clif files. In one case the number of malloc calls declined by about 20% with this change.	2020-05-11 12:27:15 +02:00
Chris Fallin	ee2f861fdd	Merge pull request #1674 from cfallin/machinst-reg-universe-opt MachInst backend: don't reallocate RealRegUniverses for each function compilation.	2020-05-09 14:10:26 -07:00
whitequark	4ec16fa057	Legalize 64 bit shifts on x86_32 using PSLLQ/PSRLQ. Co-authored-by: iximeow <git@iximeow.net>	2020-05-09 03:28:19 -07:00
whitequark	2331403741	Extend X86 ABI to cover stack overflow checking on X86-32. In stark contrast with every reasonable architecture, X86-32 does not pass any parameters in registers. Because of that we have to resort to reading arguments from stack without being able to use the stack slot machinery. (This wouldn't have been avoidable even by pinning a register because there is a trampoline in wasmtime with the C ABI that Cranelift needs to be able to call.)	2020-05-09 03:27:06 -07:00
Chris Fallin	17cef9140c	MachInst backend: don't reallocate RealRegUniverses for each function compilation. This saves ~0.14% instruction count, ~0.18% allocated bytes, and ~1.5% allocated blocks on a `clif-util wasm` compilation of `bz2.wasm` for aarch64.	2020-05-08 15:35:16 -07:00
Julian Seward	0bc0503f3f	Add a transformation pass which removes phi nodes to which it can demonstrate that only one value ever flows. Has been observed to improve generated code run times by up to 8%. Compilation cost increases by about 0.6%, but up to 7% total cost has been observed to be saved; iow it can be a significant win in terms of compilation time, overall.	2020-05-08 09:41:16 +02:00
Andrew Brown	b65bd1c8a2	Add an `interpret` command to clif-util	2020-05-07 16:51:09 -07:00
Andrew Brown	9cf90b836b	Move `iterate_files` to the utils module	2020-05-07 16:51:09 -07:00
Andrew Brown	b26ca3cbdd	Add `test interpret` support to filetests	2020-05-07 16:51:09 -07:00
Andrew Brown	8b18fc5937	Add a CLIF interpreter This is an incomplete version of a Cranelift IR interpreter: only a small subset of instructions are implemented and (known) missing parts are marked with TODO or FIXME.	2020-05-07 16:51:09 -07:00
Andrew Brown	b4238229c2	Cast DataValues to and from native types Also, returns a `Result` in the `RunCommand::run` helper.	2020-05-07 16:51:09 -07:00
Benjamin Bouvier	528d3c1355	machinst: Steal the used/defs Sets when emitting a call in ABICall;	2020-05-07 12:24:02 +02:00
Benjamin Bouvier	19d8a7f1fb	machinst: Reuse memory accross loop iterations in lowering;	2020-05-07 12:24:02 +02:00
Benjamin Bouvier	b24b711c16	machinst: Reduce the number of vec allocations for edge blocks;	2020-05-07 12:24:02 +02:00
Benjamin Bouvier	9215b610ef	machinst: Avoid a lot of short-lived allocations in ABICall;	2020-05-07 12:24:02 +02:00
Benjamin Bouvier	4f919c6460	machinst: bump regalloc to 0.0.23 and return a slice on the successor indexes, in block_succs;	2020-05-07 12:24:02 +02:00
Julian Seward	48521393ae	Update to regalloc.rs version 0.22.	2020-05-06 20:16:31 +02:00
Chris Fallin	6d73fdb70a	Merge pull request #1607 from cfallin/aarch64-stack-frame Rework aarch64 stack frame implementation to use positive offsets.	2020-05-06 10:29:30 -07:00
Chris Fallin	a66724aafd	Rework aarch64 stack frame implementation. This PR changes the aarch64 ABI implementation to use positive offsets from SP, rather than negative offsets from FP, to refer to spill slots and stack-local storage. This allows for better addressing-mode options, and hence slightly better code: e.g., the unsigned scaled 12-bit offset mode can be used to reach anywhere in a 32KB frame without extra address-construction instructions, whereas negative offsets are limited to a signed 9-bit unscaled mode (-256 bytes). To enable this, the PR introduces a notion of "nominal SP offsets" as a virtual addressing mode, lowered during the emission pass. The offsets are relative to "SP after adjusting downward to allocate stack/spill slots", but before pushing clobbers. This allows the addressing-mode expressions to be generated before register allocation (or during it, for spill/reload sequences). To convert these offsets into true offsets from SP, we need to track how much further SP is moved downward, and compensate for this. We do so with "virtual SP offset adjustment" pseudo-instructions: these are seen by the emission pass, and result in no instruction (0 byte output), but update state that is now threaded through each instruction emission in turn. In this way, we can push e.g. stack args for a call and adjust the virtual SP offset, allowing reloads from nominal-SP-relative spillslots while we do the argument setup with "real SP offsets" at the same time.	2020-05-06 09:23:55 -07:00
Benjamin Bouvier	1d90751ba9	machinst: Avoid a full instructions traversal of all the blocks when computing the final block ordering;	2020-05-06 15:13:25 +02:00
whitequark	162fcd3d75	Legalize [su]extend.i64 to iconst/sshr_imm + iconcat. This was already done for [su]extend.i128, and is necessary for codegen for 32-bit x86.	2020-05-05 16:08:58 -07:00
whitequark	14bdaf3ce3	Legalize ireduce.iN.i2N to isplit.	2020-05-05 14:13:30 -07:00
Alex Crichton	a7d90af19d	Update wasmparser and wast dependencies (#1663 ) Brings in updates to SIMD spec ops renumbering.	2020-05-05 16:13:14 -05:00
Andrew Brown	cd49ed9582	Add x86 legalization for sshr.i64x2	2020-05-05 12:01:46 -07:00
Andrew Brown	4155d15e69	Fix masking of vector shift values Previously, the logic was wrong on two counts: - It used the bits of the entire vector (e.g. i32x4 -> 128) instead of just the lane bits (e.g. i32x4 -> 32). - It used the type of the first operand before it was bitcast to its correct type. Remember that, by default, vectors are handed around as i8x16 and we must bitcast them to their correct type for Cranelift's verifier; see https://github.com/bytecodealliance/wasmtime/issues/1147 for discussion on this. This fix simply uses the type of the instruction itself, which is equivalent and hopefully less fragile to any changes.	2020-05-05 12:01:46 -07:00
Chris Fallin	59039df001	Merge pull request #1570 from cfallin/fix-long-range-aarch64-call Fix long-range (non-colocated) aarch64 calls to not use Arm64Call reloc, and fix simplejit to use new long-distance call.	2020-05-05 10:45:55 -07:00
Chris Fallin	e39b4aba1c	Fix long-range (non-colocated) aarch64 calls to not use Arm64Call reloc, and fix simplejit to use it. Previously, every call was lowered on AArch64 to a `call` instruction, which takes a signed 26-bit PC-relative offset. Including the 2-bit left shift, this gives a range of +/- 128 MB. Longer-distance offsets would cause an impossible relocation record to be emitted (or rather, a record that a more sophisticated linker would fix up by inserting a shim/veneer). This commit adds a notion of "relocation distance" in the MachInst backends, and provides this information for every call target and symbol reference. The intent is that backends on architectures like AArch64, where there are different offset sizes / addressing strategies to choose from, can either emit a regular call or a load-64-bit-constant / call-indirect sequence, as necessary. This avoids the need to implement complex linking behavior. The MachInst driver code provides this information based on the "colocated" bit in the CLIF symbol references, which appears to have been designed for this purpose, or at least a similar one. Combined with the `use_colocated_libcalls` setting, this allows client code to ensure that library calls can link to library code at any location in the address space. Separately, the `simplejit` example did not handle `Arm64Call`; rather than doing so, it appears all that is necessary to get its tests to pass is to set the `use_colocated_libcalls` flag to false, to make use of the above change. This fixes the `libcall_function` unit-test in this crate.	2020-05-05 09:55:12 -07:00
Benjamin Bouvier	fa54422854	Add a work-in-progress backend for x86_64 using the new instruction selection; Most of the work is credited to Julian Seward. Co-authored-by: Julian Seward <jseward@acm.org> Co-authored-by: Chris Fallin <cfallin@mozilla.com>	2020-05-05 16:35:41 +02:00
Benjamin Bouvier	6bee767129	clif-util: try both global and target-dependent settings when parsing --set flags;	2020-05-05 16:35:41 +02:00
Andrew Brown	d6796d0d23	Improve documentation of the filetest `run` command (#1645 ) * Improve output display of RunCommand The previous use of Debug for displaying `print` and `run` results was less than clear. * Avoid checking the types of vectors during trampoline construction Because DataValue only understands `V128` vectors, we avoid type-checking vector values when constructing the trampoline arguments. * Improve the documentation of the filetest `run` command Adds an up-to-date example of how to use the `run` and `print` directives and includes an actual use of the new directives in a SIMD arithmetic filetest.	2020-05-04 14:08:27 -05:00
Nick Fitzgerald	4471a82b0c	Merge pull request #1635 from fitzgen/filetests-threads Allow setting the number of filetest threads via the CRANELIFT_FILETESTS_THREADS env var	2020-05-01 10:06:26 -07:00
Nick Fitzgerald	c0503455be	Add documentation about the `CRANELIFT_FILETESTS_THREADS` environment variable	2020-05-01 09:15:46 -07:00
Chris Fallin	8393412c40	Merge pull request #1632 from cfallin/aarch64-fix-srclocs MachInst backend: attach SourceLoc span information to all ranges.	2020-04-30 16:13:55 -07:00
Chris Fallin	964c6087bd	MachInst backend: attach SourceLoc span information to all ranges. Previously, the SourceLoc information transferred in `VCode` only included PC-spans for non-default SourceLocs. I realized that the invariant we're supposed to keep here is that every PC is covered; if no source information, just use `SourceLoc::default()`. This was spurred by @bjorn3's comment in #1575 (thanks!).	2020-04-30 15:40:55 -07:00
Andrew Brown	49622bde58	Use complex load-extend instructions in `optimize_complex_addresses`; fixes #1186	2020-04-30 11:38:01 -07:00
Andrew Brown	a312506262	Add x86 complex encodings for SIMD load-extend instructions	2020-04-30 11:38:01 -07:00
Andrew Brown	38dff29179	Add ability to call CLIF functions with arbitrary arguments in filetests This resolves the work started in https://github.com/bytecodealliance/cranelift/pull/1231 and https://github.com/bytecodealliance/wasmtime/pull/1436. Cranelift filetests currently have the ability to run CLIF functions with a signature like `() -> b*` and check that the result is true under the `test run` directive. This PR adds the ability to call functions with arbitrary arguments and non-boolean returns and either print the result or check against a list of expected results: - `run` commands look like `; run: %add(2, 2) == 4` or `; run: %add(2, 2) != 5` and verify that the executed CLIF function returns the expected value - `print` commands look like `; print: %add(2, 2)` and print the result of the function to stdout To make this work, this PR compiles a single Cranelift `Function` into a `CompiledFunction` using a `SingleFunctionCompiler`. Because we will not know the signature of the function until runtime, we use a `Trampoline` to place the values in the appropriate location for the calling convention; this should look a lot like what @alexcrichton is doing with `VMTrampoline` in wasmtime (see `3b7cb6ee64/crates/api/src/func.rs (L510-L526)`, `3b7cb6ee64/crates/jit/src/compiler.rs (L260)`). To avoid re-compiling `Trampoline`s for the same function signatures, `Trampoline`s are cached in the `SingleFunctionCompiler`.	2020-04-30 11:21:00 -07:00
Andrew Brown	2048d3d30c	Add x86 encodings for same-size bint conversions up to 64 bits	2020-04-30 11:21:00 -07:00
Nick Fitzgerald	c4292fb2be	Allow setting the number of filetest threads via the CRANELIFT_FILETESTS_THREADS env var	2020-04-30 09:20:23 -07:00
Yury Delendik	1873c0ae46	Fix value label ranges resolution (#1572 ) There was a bug how value labels were resolved, which caused some DWARF expressions not be transformed, e.g. those are in the registers. * Implements FIXME in expression.rs * Move TargetIsa from CompiledExpression structure * Fix expression format for GDB * Add tests for parsing * Proper logic in ValueLabelRangesBuilder::process_label * Tests for ValueLabelRangesBuilder * Refactor build_with_locals to return Iterator instead of Vec<_> * Misc comments and magical numbers	2020-04-30 08:07:55 -05:00
Benjamin Bouvier	b7cfd39b53	aarch64: split emit tests into its own file; This is done to satisfy a check done on the maximal file's size when vendoring Rust source code into Mozilla central's repository.	2020-04-30 13:50:45 +02:00
Benjamin Bouvier	4c066b1c73	codegen: split lower.rs into multiple files; This splits off lower.rs into two files: lower.rs keeps all the utility functions, while lower_inst.rs contains the (gigantic!) function lowering a single Cranelift instruction into vcode. This is done to satisfy a check done on the maximal file's size when vendoring Rust source code into Mozilla central's repository.	2020-04-30 13:50:45 +02:00
Benjamin Bouvier	a2b6c19861	Fix arm32 build: ensure that the expand group is always generated;	2020-04-30 13:50:45 +02:00
Dan Gohman	864cf98c8d	Update release notes, wasmtime 0.16, cranelift 0.63.	2020-04-29 17:30:25 -07:00
Alex Crichton	363cd2d20f	Expose memory-related options in `Config` (#1513 ) * Expose memory-related options in `Config` This commit was initially motivated by looking more into #1501, but it ended up balooning a bit after finding a few issues. The high-level items in this commit are: * New configuration options via `wasmtime::Config` are exposed to configure the tunable limits of how memories are allocated and such. * The `MemoryCreator` trait has been updated to accurately reflect the required allocation characteristics that JIT code expects. * A bug has been fixed in the cranelift wasm code generation where if no guard page was present bounds checks weren't accurately performed. The new `Config` methods allow tuning the memory allocation characteristics of wasmtime. Currently 64-bit platforms will reserve 6GB chunks of memory for each linear memory, but by tweaking various config options you can change how this is allocate, perhaps at the cost of slower JIT code since it needs more bounds checks. The methods are intended to be pretty thoroughly documented as to the effect they have on the JIT code and what values you may wish to select. These new methods have been added to the spectest fuzzer to ensure that various configuration values for these methods don't affect correctness. The `MemoryCreator` trait previously only allocated memories with a `MemoryType`, but this didn't actually reflect the guarantees that JIT code expected. JIT code is generated with an assumption about the minimum size of the guard region, as well as whether memory is static or dynamic (whether the base pointer can be relocated). These properties must be upheld by custom allocation engines for JIT code to perform correctly, so extra parameters have been added to `MemoryCreator::new_memory` to reflect this. Finally the fuzzing with `Config` turned up an issue where if no guard pages present the wasm code wouldn't correctly bounds-check memory accesses. The issue here was that with a guard page we only need to bounds-check the first byte of access, but without a guard page we need to bounds-check the last byte of access. This meant that the code generation needed to account for the size of the memory operation (load/store) and use this as the offset-to-check in the no-guard-page scenario. I've attempted to make the various comments in cranelift a bit more exhaustive too to hopefully make it a bit clearer for future readers! Closes #1501 * Review comments * Update a comment	2020-04-29 17:10:00 -07:00

1 2 3 4 5 ...

2056 Commits