wasmtime

Author	SHA1	Message	Date
Anton Kirilov	a1b39276e1	Enable more CLIF tests on AArch64 The tests for the SIMD floating-point maximum and minimum operations require particular care because the handling of the NaN values is non-deterministic and may vary between platforms. There is no way to match several NaN values in a test, so the solution is to extract the non-deterministic test cases into a separate file that is subsequently replicated for every backend under test, with adjustments made to the expected results. Copyright (c) 2021, Arm Limited.	2021-08-17 13:27:58 +01:00
Chris Fallin	7c0948fe0b	Merge pull request #3102 from afonso360/fix-bool-trampolines cranelift: Fix trampoline args for b1 types	2021-08-14 15:50:30 -07:00
Alex Crichton	e68aa99588	Implement the memory64 proposal in Wasmtime (#3153 ) * Implement the memory64 proposal in Wasmtime This commit implements the WebAssembly [memory64 proposal][proposal] in both Wasmtime and Cranelift. In terms of work done Cranelift ended up needing very little work here since most of it was already prepared for 64-bit memories at one point or another. Most of the work in Wasmtime is largely refactoring, changing a bunch of `u32` values to something else. A number of internal and public interfaces are changing as a result of this commit, for example: * Acessors on `wasmtime::Memory` that work with pages now all return `u64` unconditionally rather than `u32`. This makes it possible to accommodate 64-bit memories with this API, but we may also want to consider `usize` here at some point since the host can't grow past `usize`-limited pages anyway. * The `wasmtime::Limits` structure is removed in favor of minimum/maximum methods on table/memory types. * Many libcall intrinsics called by jit code now unconditionally take `u64` arguments instead of `u32`. Return values are `usize`, however, since the return value, if successful, is always bounded by host memory while arguments can come from any guest. * The `heap_addr` clif instruction now takes a 64-bit offset argument instead of a 32-bit one. It turns out that the legalization of `heap_addr` already worked with 64-bit offsets, so this change was fairly trivial to make. * The runtime implementation of mmap-based linear memories has changed to largely work in `usize` quantities in its API and in bytes instead of pages. This simplifies various aspects and reflects that mmap-memories are always bound by `usize` since that's what the host is using to address things, and additionally most calculations care about bytes rather than pages except for the very edge where we're going to/from wasm. Overall I've tried to minimize the amount of `as` casts as possible, using checked `try_from` and checked arithemtic with either error handling or explicit `unwrap()` calls to tell us about bugs in the future. Most locations have relatively obvious things to do with various implications on various hosts, and I think they should all be roughly of the right shape but time will tell. I mostly relied on the compiler complaining that various types weren't aligned to figure out type-casting, and I manually audited some of the more obvious locations. I suspect we have a number of hidden locations that will panic on 32-bit hosts if 64-bit modules try to run there, but otherwise I think we should be generally ok (famous last words). In any case I wouldn't want to enable this by default naturally until we've fuzzed it for some time. In terms of the actual underlying implementation, no one should expect memory64 to be all that fast. Right now it's implemented with "dynamic" heaps which have a few consequences: * All memory accesses are bounds-checked. I'm not sure how aggressively Cranelift tries to optimize out bounds checks, but I suspect not a ton since we haven't stressed this much historically. * Heaps are always precisely sized. This means that every call to `memory.grow` will incur a `memcpy` of memory from the old heap to the new. We probably want to at least look into `mremap` on Linux and otherwise try to implement schemes where dynamic heaps have some reserved pages to grow into to help amortize the cost of `memory.grow`. The memory64 spec test suite is scheduled to now run on CI, but as with all the other spec test suites it's really not all that comprehensive. I've tried adding more tests for basic things as I've had to implement guards for them, but I wouldn't really consider the testing adequate from just this PR itself. I did try to take care in one test to actually allocate a 4gb+ heap and then avoid running that in the pooling allocator or in emulation because otherwise that may fail or take excessively long. [proposal]: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md * Fix some tests * More test fixes * Fix wasmtime tests * Fix doctests * Revert to 32-bit immediate offsets in `heap_addr` This commit updates the generation of addresses in wasm code to always use 32-bit offsets for `heap_addr`, and if the calculated offset is bigger than 32-bits we emit a manual add with an overflow check. * Disable memory64 for spectest fuzzing * Fix wrong offset being added to heap addr * More comments! * Clarify bytes/pages	2021-08-12 09:40:20 -05:00
Afonso Bordado	8862499529	cranelift: Fix trampoline args for b1 types Our DataValues only have one size of booleans so we are always going to have this mismatch of sizes	2021-08-08 17:42:50 +01:00
Chris Fallin	9c550fcf41	Merge pull request #3128 from sparker-arm/aarch64-atomics Re-implement AArch64 atomic load and stores	2021-08-06 14:38:25 -07:00
Alex Crichton	ee3ff52661	Refactor cranelift immediates slightly I've run up against the `Into`-vs-`From` impls a few times and figured I'd go ahead and put up a refactoring. This switches `Into` impls into `From` impls which allows using both traits instead of just the `Into` version. Additionally this removes a few small `as` casts in favor of infallible `from`/`into` or `try_from` with error handling.	2021-08-06 09:14:25 -07:00
Alex Crichton	c6b095f9a3	cranelift: Implement nan canonicalization for vectors (#3146 ) This fixes some fuzz bugs that came about enabling simd where nan canonicalization is performed on the fuzzers but cranelift would panic on these ops for vectors. This adds some custom codegen with `bitselect` to ensure any nan lanes are canonical-nan lanes in the canonicalized operations.	2021-08-05 13:44:16 -05:00
Alex Crichton	9e142f8792	Fix some warnings on nightly Rust (#3148 ) Looks like these trailing-semicolons-in-macros are likely to become a hard error in the future, so this updates to remove them as necessary.	2021-08-05 13:02:44 -05:00
Alex Crichton	4cfa031c5f	Implement API support for v128-globals (#3147 ) Found via fuzzing, and looks like these were accidentally left out along the way SIMD was taking shape.	2021-08-05 13:02:34 -05:00
Sam Parker	b6f6ac116a	Revert IR changes Along with the x64 and s390x changes. Now pattern matching the uextend(atomic_load) in the aarch64 backend.	2021-08-05 09:35:32 +01:00
Sam Parker	cbb7229457	Re-implement atomic load and stores The AArch64 support was a bit broken and was using Armv7 style barriers, which aren't required with Armv8 acquire-release load/stores. The fallback CAS loops and RMW, for AArch64, have also been updated to use acquire-release, exclusive, instructions which, again, remove the need for barriers. The CAS loop has also been further optimised by using the extending form of the cmp instruction. Copyright (c) 2021, Arm Limited.	2021-08-05 09:08:08 +01:00
Alex Crichton	a33caec9be	Bump the wasm-tools crates (#3139 ) * Bump the wasm-tools crates Pulls in some updates here and there, mostly for updating crates to the latest version to prepare for later memory64 work. * Update lightbeam	2021-08-04 09:53:47 -05:00
Sam Parker	3bc2f0c701	Enable simd_X_extadd_pairwise_X for AArch64 Lower to [u\|s]addlp for AArch64. Copyright (c) 2021, Arm Limited.	2021-08-03 10:25:09 +01:00
Chris Fallin	a13a777230	Bump to Wasmtime v0.29.0 and Cranelift 0.76.0.	2021-08-02 11:24:09 -07:00
Alex Crichton	63a3bbbf5a	Change VMMemoryDefinition::current_length to `usize` (#3134 ) * Change VMMemoryDefinition::current_length to `usize` This commit changes the definition of `VMMemoryDefinition::current_length` to `usize` from its previous definition of `u32`. This is a pretty impactful change because it also changes the cranelift semantics of "dynamic" heaps where the bound global value specifier must now match the pointer type for the platform rather than the index type for the heap. The motivation for this change is that the `current_length` field (or bound for the heap) is intended to reflect the current size of the heap. This is bound by `usize` on the host platform rather than `u32` or` u64`. The previous choice of `u32` couldn't represent a 4GB memory because we couldn't put a number representing 4GB into the `current_length` field. By using `usize`, which reflects the host's memory allocation, this should better reflect the size of the heap and allows Wasmtime to support a full 4GB heap for a wasm program (instead of 4GB minus one page). This commit also updates the legalization of the `heap_addr` clif instruction to appropriately cast the address to the platform's pointer type, handling bounds checks along the way. The practical impact for today's targets is that a `uextend` is happening sooner than it happened before, but otherwise there is no intended impact of this change. In the future when 64-bit memories are supported there will likely need to be fancier logic which handles offsets a bit differently (especially in the case of a 64-bit memory on a 32-bit host). The clif `filetest` changes should show the differences in codegen, and the Wasmtime changes are largely removing casts here and there. Closes #3022 * Add tests for memory.size at maximum memory size * Add a dfg helper method	2021-08-02 13:09:40 -05:00
Johnnie Birch	e519fca61c	Refactor and turn on lowering for extend-add-pairwise	2021-07-31 10:52:39 -07:00
Johnnie Birch	e373ddfe1b	Add extend-add-pairwise instructions x64	2021-07-30 15:06:58 -07:00
Andrew Brown	26c78c06ef	refactor: remove unused field PR #3131 fixed the failing builds by allowing this field to be dead. After looking at it further the field is not being used and can be removedi completely.	2021-07-30 10:58:37 -07:00
Alex Crichton	4632b6a816	Fix warning on new-stable (#3131 ) One of the fields of `TargetIsa` isn't used in the cranelift-codegen-meta crate, but instead of refactoring to try to remove it this just adds `#[allow(dead_code)]` for now in the assumption that when the old backends go away this will probably go away as well.	2021-07-30 11:13:21 -05:00
Johnnie Birch	4f601edc36	Add x64 support for remaining int-to-int extend simd instructions Adds remaming support for int to int extend simd instructions. Specifically adds support for remaining I32x4->I64x2 instructions	2021-07-28 23:33:42 -07:00
Sam Parker	5eb2dca9f1	Added doc comment And removed an accidental code move. Copyright (c) 2021, Arm Limited.	2021-07-28 13:14:20 +01:00
Sam Parker	f2806a9192	rebase and ran cargo fmt Copyright (c) 2021, Arm Limited.	2021-07-28 13:14:20 +01:00
Sam Parker	541a4ee428	Enable simd_extmul_* for AArch64 Lower simd_extmul_[low/high][signed/unsigned] to [s\|u]widen inputs to an imul node. Copyright (c) 2021, Arm Limited.	2021-07-28 13:14:20 +01:00
Johnnie Birch	500f530322	Add support for i32x4_trunc_sat_f64x2_s for x64	2021-07-26 22:24:30 -07:00
Johnnie Birch	23290f0450	Add support for i32x4_trunc_sat_f64x2_u for x64	2021-07-26 22:24:30 -07:00
Johnnie Birch	5deda27977	Add support for Saturating Rounding Q-format Multiplication for x64	2021-07-26 20:32:46 -07:00
Johnnie Birch	ffec1f9b41	Fix for 3089 X64 ext_mul_i8x16 has incorrect lowering Also factors out unnecessary temp register	2021-07-26 20:06:43 -07:00
Andrew Brown	766774e1f5	refactor: reorganize crate imports	2021-07-26 13:39:16 -07:00
Andrew Brown	6b86984c41	x64: avoid load-coalescing SIMD operations with non-aligned loads Fixes #2943, though not as optimally as may be desired. With x64 SIMD instructions, the memory operand must be aligned--this change adds that check. There are cases, however, where we can do better--see #3106.	2021-07-26 13:39:16 -07:00
Nick Fitzgerald	a2cfddff9c	Merge pull request #3116 from fitzgen/update-gimli-and-addr2line Update `gimli` to 0.25; `addr2line` to 0.16	2021-07-26 13:01:37 -07:00
Chris Fallin	0f068ac933	Merge pull request #3117 from fitzgen/log-levels cranelift: Move most debug-level logs to the trace level	2021-07-26 12:52:39 -07:00
Nick Fitzgerald	4283d2116d	cranelift: Move most debug-level logs to the trace level Cranelift crates have historically been much more verbose with debug-level logging than most other crates in the Rust ecosystem. We log things like how many parameters a basic block has, the color of virtual registers during regalloc, etc. Even for Cranelift hackers, these things are largely only useful when hacking specifically on Cranelift and looking at a particular test case, not even when using some Cranelift embedding (such as Wasmtime). Most of the time, when people want logging for their Rust programs, they do something like: RUST_LOG=debug cargo run This means that they get all that mostly not useful debug logging out of Cranelift. So they might want to disable logging for Cranelift, or change it to a higher log level: RUST_LOG=debug,cranelift=info cargo run The problem is that this is already more annoying to type that `RUST_LOG=debug`, and that Cranelift isn't one single crate, so you actually have to play whack-a-mole with naming all the Cranelift crates off the top of your head, something more like this: RUST_LOG=debug,cranelift=info,cranelift_codegen=info,cranelift_wasm=info,... Therefore, we're changing most of the `debug!` logs into `trace!` logs: anything that is very Cranelift-internal, unlikely to be useful/meaningful to the "average" Cranelift embedder, or prints a message for each instruction visited during a pass. On the other hand, things that just report a one line statistic for a whole pass, for example, are left as `debug!`. The more verbose the log messages are, the higher the bar they must clear to be `debug!` rather than `trace!`.	2021-07-26 11:50:16 -07:00
Nick Fitzgerald	3d76cbdf34	Update `gimli` to 0.25; `addr2line` to 0.16	2021-07-26 11:04:53 -07:00
Afonso Bordado	a2fb019ba7	cranelift: Add basic i128 support in interpreter	2021-07-23 11:22:07 -07:00
Afonso Bordado	084383f60a	cranelift: Add support for i128 values in DataValue	2021-07-23 11:22:07 -07:00
Afonso Bordado	3a38400447	aarch64: Refactor lower_icmp to use a single materialize_bool_result	2021-07-19 09:31:14 -07:00
Afonso Bordado	14d1c7ee9f	aarch64: Refactor lower_icmp to allow returning a different flag	2021-07-19 09:31:14 -07:00
Afonso Bordado	e628fb376f	aarch64: Fix incorrect code generation for overflow icmp in i16 values	2021-07-19 09:31:14 -07:00
Afonso Bordado	db5566dadb	aarch64: Fix lowering amounts for shifts This commit addresses two issues: * A panic when shifting any non i128 type by i128 amounts (#3064) * Wrong results when lowering shifts with small types (i8, i16) In these types when shifting for amounts larger than the size of the type, we would not get the wrapping behaviour that we see on i32 and i64. This is because in these larger types, the wrapping behaviour is automatically implemented by using the appropriate instruction, however we do not have i8 and i16 specific instructions, so we have to manually wrap the shift amount with an AND instruction. This issue is also found on x86_64 and s390x, and a separate issue will be filed for those. Closes #3064	2021-07-16 22:08:02 +01:00
Anton Kirilov	6c3d7092b9	Enable the simd_conversions test for AArch64 Copyright (c) 2021, Arm Limited.	2021-07-16 22:04:45 +01:00
Johnnie Birch	2452a4cd74	Refactor lowering structure for ext_mul on x64 and add comments	2021-07-15 01:07:52 -07:00
Johnnie Birch	e5b6bee968	Add emit tests to ext_mul_* instructions	2021-07-15 01:07:52 -07:00
Johnnie Birch	6fbe0b72bd	Add simd_extmul_* support for x64	2021-07-15 01:07:52 -07:00
Johnnie Birch	d8e813204e	Fold fcvt_low_from_uinit into previously existing clif instructions	2021-07-09 10:39:05 -07:00
Johnnie Birch	6dd2df4fb3	Update comment on fcvt_low_from_sint instruction	2021-07-09 10:39:05 -07:00
Johnnie Birch	2d676d838f	Implements f64x2.convert_low_i32x4_u for x64	2021-07-09 10:39:05 -07:00
Chris Fallin	c71ad9490e	Merge pull request #3056 from afonso360/aarch64-fix-overflow-imm aarch64: Fix incorrect encoding of large const values in icmp.	2021-07-03 16:05:49 -07:00
Afonso Bordado	eebae8d4c8	aarch64: Fix incorrect encoding of large const values in icmp. When encoding constants as immediates into an RSE Imm12 instruction we need to take special care to check if the value that we are trying to input does not overflow its type when viewed as a signed value. (i.e. iconst.i8 200) We cannot both put an immediate and sign extend it, so we need to lower it into a separate reg, and emit the sign extend into the instruction. For more details see the [cg_clif bug report](https://github.com/bjorn3/rustc_codegen_cranelift/issues/1184#issuecomment-873214796).	2021-07-03 22:42:15 +01:00
bjorn3	37115c10e0	Implement Display for settings::Value	2021-07-03 14:34:42 +02:00
Benjamin Bouvier	4c595f4f9d	Remove unused store_stackslot/load_stackslot trait methods.	2021-07-02 18:09:33 +02:00

1 2 3 4 5 ...

1372 Commits