wasmtime

Author	SHA1	Message	Date
Daniel Marin	b86cba98a9	fixed typo in examples/memory.rs (#5576 )	2023-01-16 20:02:23 -06:00
Bobby Holley	72a74efe2f	Bump cargo-vet to 0.3.1 (#5579 ) This relaxes import parsing to ensure that adoption of the next major release by other projects doesn't break imports for this project. See https://github.com/mozilla/cargo-vet/issues/360#issuecomment-1384605968	2023-01-16 20:01:51 -06:00
Ayomide Bamidele	e4dc9c7944	Update Intel x86 CPU presets to match LLVM (#5490 ) * Update Intel x86 CPU presets * Add LLVM reference * Remove 32bit CPU architectures * Rename silvermont to slm * Fix haswell presets * Add icelake alias * Group streaming simd presets * Add slm silvermont preset * Remove duplicate alderlake def	2023-01-13 21:02:36 +00:00
Saúl Cabrera	f0979af157	cranelift-codegen: Prepare aarch64 for usage from Winch (#5570 ) This commit exposes the necessary aarch64 pieces to be used by Winch for binary emission.	2023-01-13 19:46:25 +00:00
Jamey Sharp	7682a40d62	`generated-code.rs` is not itself generated code (#5571 ) cranelift/codegen/src/isa/*/lower/isle/generated_code.rs are hand-written modules that `include!` the actual generated code. Tagging these hand-written files as `linguist-generated` in .gitattributes tells GitHub not to show them in diffs by default. But they're actually important to review, so they should be included in diffs.	2023-01-13 11:03:50 -08:00
Alex Crichton	cbeec5ddb9	Optimize some functions in the `wiggle` crate (#5566 ) * wiggle: Inline some trivial functions This commit marks a number of functions in wiggle as `#[inline]` as they're otherwise trivial, mostly returning constants. This comes out of some work I looked at recently with Andrew where some of these functions showed up in profiles when they shouldn't. * wiggle: Optimize the `GuestMemory` for shared memory This commit implements a minor optimization to the `GuestMemory` implementation for Wasmtime to skip most methods if a shared memory is in play. Shared memories never get borrowed and this can be used to internally skip some borrow-checker methods. * wiggle: Optimize `GuestPtr::to_vec` This commit replaces the safe implementation of `GuestPtr::to_vec` with an unsafe implementation. The purpose of this is to speed up the function when used with shared memory which otherwise performs a bunch of atomic reads for types like `u8` which does validation-per-element and isn't vectorizable. On a benchmark I was helping Andrew with this sped up the host code enough to the point that guest code dwarfed the execution time. * Fix build	2023-01-12 21:49:56 +00:00
Afonso Bordado	d3e6b7bd2a	fuzzgen: Enable riscv64 and disable unimplemented ops (#5502 )	2023-01-12 08:46:37 -08:00
Afonso Bordado	82494661c1	cranelift: Add `atomic_{load,store}` and `fence` to the interpreter (#5503 ) * cranelift: Add `fence` to interpreter * cranelift: Add `atomic_{load,store}` to the interpreter * fuzzgen: Add `atomic_{load,store}` * Update cranelift/fuzzgen/src/function_generator.rs Co-authored-by: Jamey Sharp <jamey@minilop.net> * fuzzgen: Use type size as the alignment size. Co-authored-by: Jamey Sharp <jamey@minilop.net>	2023-01-12 08:36:04 -08:00
Dan Gohman	6a20ca5512	Treat `wasmtime::component::Val::Float{32,64}` zero and negative zero as inequal (#5562 ) Following up on #5535, treat positive and negative zero as inequal in wasmtime::component::Val::Float{32,64}'s `PartialEq` logic. IEEE 754 equality considers these values equal, but they are semantically distinct values, and testing and fuzzing should be aware of the difference.	2023-01-11 16:24:39 -08:00
Saúl Cabrera	6cb68f3287	cranelift-codegen: Expose x64 settings (#5561 ) Exposes x64 settings so that they can be consumed from Winch for binary emission.	2023-01-11 18:33:03 -05:00
Kyle Brown	963d73a83b	Add component model wasmtime feature to Docs.rs (#5558 )	2023-01-11 16:42:18 +00:00
Dan Gohman	4f0445b9e8	Update the documentation for `Caller::get_export`. (#5556 ) Update the documentation for `Caller::get_export` to clarify that it's not expected to be removed in the future. Components do offer an alternative to `Caller::get_export`, so add a brief note mentioning that. Also, as of #4431 `get_export` now works for all exports, not just memories and functions.	2023-01-10 17:46:23 -08:00
Afonso Bordado	9556cb190f	cranelift: Forbid argument extensions for floats and SIMD vectors (#5536 ) * fuzzgen: Generate argument extensions only for integer argumetns * cranelift: Add verifier check for argument extensions	2023-01-10 10:26:30 -08:00
Afonso Bordado	23ea435a91	fuzzgen: Reenable `srem.i8` on x86 (#5545 )	2023-01-09 12:30:57 -08:00
Afonso Bordado	8ac04612ae	cranelift: Remove predicate not macro branch (#5552 )	2023-01-09 12:19:30 -08:00
Andrew Brown	f85e3f8517	wasi: avoid buffer underflow with shared memory (#5543 ) * wasi: avoid buffer underflow with shared memory This change fixes an issue identified when using wasi-threads to perform file reads. In order to maintain Rust safety guarantees in the presence of WebAssembly shared memory, which can be modified concurrently by any of the running threads, the WASI implementations of `fd_read` and `fd_pread` were given special code paths when shared memory is detected: in these cases, the data is first read into a host-limited buffer and then subsequently copied into linear memory. The problem was that the rather-complex logic for doing this "buffer then copy" idea for multiple IO vectors could fail due to buffer underflow. If, e.g., a read was limited by the host to 64K (or even if the read returned less than the total buffer size) the `UnsafeGuestSlice::copy_from_slice` logic would fail, complaining that the sizes of both buffers were unequal. This change both simplifies and fixes the logic: - only the first IO vector is filled; this could represent a performance penalty for threaded programs, but the "buffer then copy" idea already imposes a non-trivial overhead. This simplifies the logic, allowing us to... - resize the shared memory buffer to the exact number of bytes read * review: early return when no IO vectors passed to shared memory * fix: add empty RoFlags on early exit	2023-01-09 19:28:09 +00:00
Afonso Bordado	f845ebb450	cranelift: Remove `is_pic` predicate from x86 backend (#5548 ) This is still present as shared flags and we don't use the predicate anywhere.	2023-01-09 10:04:45 -08:00
Afonso Bordado	437f448a8d	cranelift: Cleanup testing docs (#5549 ) Removes some docs from tests that we no longer have. Updates the ebb examples to the new block based examples.	2023-01-09 10:02:59 -08:00
Afonso Bordado	a5fc046161	wasmtime: Add Android AArch64 Check (#5507 ) We accidentally broke the build for Android when introducing the jit-icache-coherence crate. To avoid this happening again, add a check job just to ensure that it can build. See #5323 and #5331 for context.	2023-01-09 08:46:19 -06:00
Alex Crichton	6d24de74d5	Update release notes for 5.0.0 (#5539 )	2023-01-09 08:45:18 -06:00
Alexa VanHattum	44913825b5	cranelift: fix register for `srem.i8` on x86_64 (#5540 ) * Change register written to in specific srem case. Add regression test as filetest case. Fixes #5470 * Add another test case, newline * Update comment	2023-01-06 22:18:16 +00:00
Jamey Sharp	e65a2c9280	cranelift-isle: Record all binding names (#5538 ) ...not just the ones at the outer scope of a rule. Thanks to @avanhatt for pointing out that #5221 didn't capture as much information as I intended it to.	2023-01-06 22:00:50 +00:00
Nick Fitzgerald	a491eaca0f	Ability to disable cache after it's been configured on `Config` (#5542 ) * Wasmtime: Add `Config::disable_cache` * bench-api: Always disable the cache * bench-api: Always get a `Config` from CLI flags This commit fixes an issue that I ran into just now where benchmarking one `.so` with `--engine-flags` was giving wildly unexpected results comparing to something without `--engine-flags`. The root cause here appears to that when specifying `--engine-flags` the CLI parsing code is used to create a `Config` and when omitted a `Config::new` instance is created. The main difference between these is that for the CLI caching is enabled by default and for `Config::new` it is not. Coupled with the fact that caching doesn't really work for the `main` branch this ended up giving wild results. The fix here is to first always use the CLI parsing code to create a `Config` to ensure that a config is consistently created. Next the `--disable-cache` flag is unconditionally passed to the CLI parsing to ensure that compilation actually happens. Once applied this enables comparing an engine without flags and an engine with flags which provides consistent results. Fix compile error Co-authored-by: Alex Crichton <alex@alexcrichton.com>	2023-01-06 21:12:59 +00:00
Sam Sartor	1efa3d6f8b	Add `clif-util compile` option to output object file (#5493 ) * add clif-util compile option to output object file * switch from a box to a borrow * update objectmodule tests to use borrowed isa * put targetisa into an arc	2023-01-06 12:53:48 -08:00
Lann	9156cca8ca	Treat `wasmtime::component::Val::Float{32,64}` NaNs as equal (#5535 ) In #5510 we changed the value types of these variants from u{32,64} to f{32,64}. One side effect of this change was that two NaN values would no longer compare equal. While this is behavior complies with IEEE-754 floating point operations, it broke equality assumptions in fuzzing. This commit changes equality for Val to make NaNs compare equal. Since the component model requires NaN canonicalization, all NaN bit representations compare equal, which is different from the original behavior. This also gives Vals the semantics of Eq again, so that trait impl has been reintroduced to related types as well.	2023-01-06 10:40:19 -06:00
uint256_t	b00455135e	Cranelift: Implement 'iabs' for scalar types on x86_64 (#5527 ) * Implement 'iabs' for scalar types on x86_64 * Small fix	2023-01-05 21:33:12 -08:00
Nick Fitzgerald	c50bdf600e	Cranelift: GVN all idempotently trapping but otherwise pure instructions (#5534 )	2023-01-05 15:08:06 -08:00
Nick Fitzgerald	d5b8da6eea	Wasmtime: Avoid a multiplication overflow when given 64-bit memories whose minimum size is the maximum memory64 size (#5533 )	2023-01-05 21:49:37 +00:00
Trevor Elliott	36e5bdfd0e	Fuzz multiple targets in cranelift-icache (#5482 ) Fuzz additional targets in the cranelift-icache target. The list of targets fuzzed is controlled by the targets enabled in fuzz/Cargo.toml. This PR also reworks how instruction disabling is done in function generator, moving the deny-list to a function to make the decision at runtime instead of compile time.	2023-01-05 18:49:23 +00:00
Afonso Bordado	ee6a909ccb	cranelift: Cleanup SIMD `icmp` tests (#5530 ) * cranelift: Enable more SIMD tests * cranelift: Reorganize icmp tests * cranelift: Enable SIMD icmp tests for unsigned ops * cranelift: Cleanup trailing newlines	2023-01-05 09:19:03 -08:00
wasmtime-publish	7bfbec1b57	Bump Wasmtime to 6.0.0 (#5521 ) Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com>	2023-01-05 09:46:01 -06:00
Nick Fitzgerald	f4a2d5337a	Cranelift: GVN `uadd_overflow_trap` (#5520 ) * Switch duplicate loads w/ dynamic memories test to `min_size = 0` This test was accidentally hitting a special case for bounds checks for when we know that `offset + access_size < min_size` and can skip some steps. This commit changes the `min_size` of the memory to zero so that we are forced to do fully general bounds checks. * Cranelift: Mark `uadd_overflow_trap` as okay for GVN Although this improves our test sequence for duplicate loads with dynamic memories, it unfortunately doesn't have any effect on sightglass benchmarks: ``` instantiation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [34448 35607.23 37158] gvn_uadd_overflow_trap.so [34566 35734.05 36585] main.so instantiation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [44101 60449.62 92712] gvn_uadd_overflow_trap.so [44011 60436.37 92690] main.so instantiation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [35595 36675.72 38153] gvn_uadd_overflow_trap.so [35440 36670.42 37993] main.so compilation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [17370195 17405125.62 17471222] gvn_uadd_overflow_trap.so [17369324 17404859.43 17470725] main.so execution :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [7055720520 7055886880.32 7056265930] gvn_uadd_overflow_trap.so [7055719554 7055843809.33 7056193289] main.so compilation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [683589861 683767276.00 684098366] gvn_uadd_overflow_trap.so [683590024 683767998.02 684097885] main.so execution :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [46436883 46437135.10 46437823] gvn_uadd_overflow_trap.so [46436883 46437087.67 46437785] main.so compilation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [126522461 126565812.58 126647044] gvn_uadd_overflow_trap.so [126522176 126565757.75 126647522] main.so execution :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [653010531 653010533.03 653010544] gvn_uadd_overflow_trap.so [653010531 653010533.18 653010537] main.so ``` * cranelift-codegen-meta: Rename `side_effects_okay_for_gvn` to `side_effects_idempotent` * cranelift-filetests: Ensure there is a trailing newline for blessed Wasm tests	2023-01-04 22:03:16 -08:00
Nick Fitzgerald	b46ad1b54d	Wasmtime: set the `cranelift_wasm::Heap`'s min size (#5522 ) This unlocks certain bounds checking optimizations in some configurations. Wasn't able to measure any delta in sightglass, but still worth doing anyways.	2023-01-05 01:18:46 +00:00
Trevor Elliott	e2e98f694f	Remove lower_br_fcmp from the riscv64 backend (#5519 ) Remove the lower_br_fcmp function from the riscv64 backend. This PR only affects the emit implementation for FloatRound, replacing the uses of lower_br_fcmp with direct uses of FpuRRR and CondBr. Any changes in behavior here should be already covered by the runtests for ceil, floor, trunc, and nearest.	2023-01-04 14:22:35 -08:00
Trevor Elliott	5d429e46e8	Remove the MInst::TrapFf constructor from the riscv64 backend (#5515 ) Remove the MInst::TrapFf instruction in the riscv64 backend. It was only used in two places in the emit case for FloatRound, and was easily replaced with a combination of FpuRRR and TrapIf.	2023-01-04 13:34:46 -08:00
Alexa VanHattum	4bc4fae571	Small update for filename in `isle_integration.md` (#5516 ) * Small update for filename in `isle_integration.md` The name "clif.isle" is stale (since #4953), now two files "clif_lower.isle" and "clif_opt.isle" are generated. Not sure if that PR necessitates other changes this this doc. * Update cranelift/docs/isle-integration.md Co-authored-by: Jamey Sharp <jsharp@fastly.com>	2023-01-04 20:06:49 +00:00
Nick Fitzgerald	937601c7c3	Cranelift: GVN spectre guards and run redundant load elimination twice (#5517 ) * Cranelift: Make spectre guards GVN-able While these instructions have a side effect that is otherwise invisible to the optimizer, the side effect in question is idempotent, so it can be de-duplicated by GVN. * Cranelift: Run redundant load replacement and GVN twice This allows us to actually replace redundant Wasm loads with dynamic memories. While this improves our hand-crafted test sequences, it doesn't seem to have any improvement on sightglass benchmarks run with dynamic memories, however it also isn't a hit to compilation times, so seems generally good to land anyways: ``` $ cargo run --release -- benchmark -e ~/scratch/once.so -e ~/scratch/twice.so -m insts-retired --processes 20 --iterations-per-process 3 --engine-flags="--static-memory-maximum-size 0" -- benchmarks/default.suite compilation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [683595240 683768610.53 684097577] once.so [683597068 700115966.83 1664907164] twice.so instantiation :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [44107 60411.07 92785] once.so [44138 59552.32 92097] twice.so compilation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [17369916 17404839.78 17471458] once.so [17369935 17625713.87 30700150] twice.so compilation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [126523640 126566170.80 126648265] once.so [126523076 127174580.30 163145149] twice.so instantiation :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [34569 35686.25 36513] once.so [34651 35749.97 36953] twice.so instantiation :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [35146 36639.10 37707] once.so [34472 36580.82 38431] twice.so execution :: instructions-retired :: benchmarks/spidermonkey/benchmark.wasm No difference in performance. [7055720115 7055841324.82 7056180024] once.so [7055717681 7055877095.85 7056225217] twice.so execution :: instructions-retired :: benchmarks/pulldown-cmark/benchmark.wasm No difference in performance. [46436881 46437081.28 46437691] once.so [46436883 46437127.68 46437766] twice.so execution :: instructions-retired :: benchmarks/bz2/benchmark.wasm No difference in performance. [653010530 653010533.27 653010539] once.so [653010531 653010532.95 653010538] twice.so ```	2023-01-04 20:05:43 +00:00
Trevor Elliott	b2d5afdf83	riscv64: Implement fcmp in ISLE (#5512 ) Rework the compilation of fcmp in the riscv64 backend to be in ISLE, removing the need for the dedicated Fcmp instruction. This change is motivated by #5500, which showed that the riscv64 backend was generating branch instructions in the middle of a basic block. We can't remove lower_br_fcmp quite yet as it's used in a few places in the emit module, but it's now no longer reachable from the ISLE lowerings. Fixes #5500	2023-01-04 11:52:00 -08:00
Nick Fitzgerald	d1920f5a2d	cranelift: Add wasm tests for duplicate loads (#5514 ) * cranelift-filetests: Add the ability to test optimized CLIF in Wasm tests * cranelift: Add Wasm tests for identical loads, back to back	2023-01-04 18:52:32 +00:00
Andrew Brown	7c67378ab6	wiggle: copy guest strings from shared memory (#5475 ) * wiggle: copy guest strings from shared memory Along the same lines as #5471, this change adds a new smart pointer, `GuestStrCow`, to copy the string bytes over from Wasm memory to the host when the string is found in shared memory. This is necessary to maintain Rust guarantees: with shared memory, the bytes backing a `GuestStr` could be altered by another thread and this would invalidate the assumption that we can dereference at any point to `&str`. `GuestStrCow` is essentially a wrapper around `GuestStr` when the memory is not shared but copies the memory region into a `String` when the memory is shared. This change updates the uses of Wiggle strings in both wasi-common and wasi-crypto. * review: perform UTF-8 check on `GuestStr` construction	2023-01-04 10:10:00 -06:00
Afonso Bordado	52ba72f341	riscv64: Fix masking on `iabs` (#5505 ) * cranelift: Add `iabs.i128` runtest * riscv64: Fix incorrect extension in iabs When lowering iabs, we were accidentally comparing the unextended value this caused the instruction to misbehave with certain top bits. This commit also adds a zbb lowering that does not use jumps.	2023-01-03 17:37:25 -08:00
Nick Fitzgerald	276bc6ad2e	cranelift-wasm: Better track reachability after translating loads (#5511 ) We can sometimes statically determine that a given load will unconditionally trap. When this happens, we emit an unconditional trap, and we need to stop adding new instructions to the block. This commit introduces a `Reachability<T>` type that is like `Option<T>` but specifically for communicating reachability and is marked `must_use` to receivers to handle transitions from reachable to unreachable states. Additionally, adds handling of reachable -> unreachable state transitions to some SIMD op translations that weren't checking for it. Fixes #5455 Fixes #5456	2023-01-03 22:04:18 +00:00
Lann	0029ff95ac	Use floats for `wasmtime::component::Val::Float*` (#5510 ) The definitions of `wasmtime::component::Val::Float{32,64}` mirrored `wasmtime::Val::F{32,64}` by using integers as their wrapped types, storing the bit representation of their floating point values. This was necessary for the core Wasm `f32`/`f64` types because Rust floats don't have guaranteed NaN bit representations. The component model `float32`/`float64` types require NaN canonicalization, so we can use normal Rust `f{32,64}` instead. Closes #5480	2023-01-03 20:23:38 +00:00
Afonso Bordado	7e94704264	riscv64: Add masking for small types when lowering select (#5504 ) When lowering `select+icmp` we have an optimization that allows us to avoid materializing the icmp result. We were accidentally not masking the high bits for i8 and i16 in this case. Issue #5498 reported this as an illegal instruction but what was happening there was that the invalid select caused a division by zero.	2023-01-03 19:59:14 +00:00
Andrew Brown	f911855612	wiggle: copy guest slices back to shared memory (#5471 ) This change upgrades `UnsafeGuestSlice` in Wiggle to expose more functionality to be able to use `std::ptr::copy` for writing bytes into Wasm shared memory. Additionally, it adds a new `GuestCow` type for delineating between Wasm memory regions that can be borrowed (non-shared memory) or must be copied (shared memory) in order to maintain Rust guarantees. With these in place, it is now possible to implement the `preview1` "read" functions for shared memory. Previously, these would panic if attempting to copy to a shared memory. This change removes the panic and introduces some (rather complex) logic for handling both the shared and non-shared cases: - if reading into a Wasm non-shared memory, Wiggle guarantees that no other guest pointers will touch the memory region and, in the absence of concurrency, a WASI function can write directly to this memory - if reading into a Wasm shared memory, the memory region can be concurrently modified. At @alexcrichton's request re: Rust safety, this change copies all of the bytes into an intermediate buffer before using `std::ptr::copy` to move them into Wasm memory. This change only applies to the `preview0` and `preview1` implementations of `wasi-common`. Fixing up other WASI implementations (esp. wasi-crypto) is left for later.	2023-01-03 19:51:34 +00:00
Lann	69b7ecf90e	Add `wasmtime::UnknownImportError` (#5509 ) This adds a new error type `UnknownImportError` which will be returned (wrapped in an `anyhow::Error`) by `Linker::instantiate{,_async,_pre}` if a module has an unresolvable import. This error type is also used by `Linker::define_unknown_imports_as_traps`; any resulting traps will also downcast to `UnknownImportError`. Closes #5416	2023-01-03 19:01:57 +00:00
Afonso Bordado	c9c7d4991c	riscv64: Fix br-table segfault with zero sized jump tables (#5508 ) We had a off-by-one bounds check error when checking if we should jump to the default block in a br-table. Instead of always jumping to the default block when we have a jump table with 0 targets we would try to compute an offset past the end of the table. This sometimes would not crash, but it would crash if the there was no block after the br_table, thus adding a cold block would cause a segfault. The actual fix is quite simple, do not count the default block as a jump table entry when computing the limits. This commit also does a bunch of cleanup and adding some comments to the br_table emission code.	2023-01-03 10:22:48 -08:00
Afonso Bordado	0043f8e17a	wasmtime: Add FreeBSD x86_64 check (#5506 ) We accidentally broke the build for FreeBSD when introducing the jit-icache-coherence crate. To avoid this happening again, add a check job just to ensure that it can build. See #5323 and #5331 for context.	2023-01-03 18:03:27 +00:00
KarelPeeters	320d67fe8d	Cranelift: include return values in instruction pretty print output. (#5489 )	2023-01-03 09:06:47 -08:00
Alexander Günsche	e3c7bf638a	adding missing step of dependencies installation (#5492 )	2023-01-03 09:48:24 -06:00

1 2 3 4 5 ...

10727 Commits