wasmtime

Author	SHA1	Message	Date
Alex Crichton	88ff0247bf	Update wasm proposal support docs (#4175 ) * Update wasm proposal support docs Rename `--enable` flags to simply names and additionally replace module linking with the component model. * Fix a typo	2022-06-01 13:34:33 -07:00
Chris Fallin	8f61eb9341	Upgrade to regalloc2 version 0.2.1. (#4199 ) This resolves an edge-case where mul.i128 with an input that continues to be live after the instruction could cause an invalid regalloc constraint (basically, the regalloc did not previously support an instruction use and def both being constrained to the same physical reg; and the "mul" variant used for mul.i128 on x64 was the only instance of such operands in Cranelift). Causes two extra move instructions in the mul.i128 filetest, but that's the price to pay for the slightly more general (works in all cases) handling of the constraints.	2022-06-01 13:26:20 -07:00
Alex Crichton	704db02e00	Add a first-class `StoreId` type to Wasmtime (#4204 ) * Add a first-class `StoreId` type to Wasmtime This commit adds a `StoreId` type to uniquely identify a store internally within Wasmtime. This hasn't been created previously as it was never really needed but I've run across a case for its usage in the component model so I've gone ahead and split out a commit to add this type. While I was here in this file I opted to improve some other miscellaneous things as well: * Notes were added to the `Index` impls that unchecked indexing could be used in theory if we ever need it one day. * The check in `Index` for the same store should now be a bit lighter on codegen where instead of having a `panic!()` in the codegen for each `Index` there's now an out-of-line version which is `#[cold]`. This should improve codegen as calling a function with no arguments is slighly more efficient than calling the panic macro with one string argument. * An `assert!` guarded with a `cfg(debug_assertions)` was changed to a `debug_assert!`. * Allocation of a `StoreId` was refactored to a method on the `StoreId` itself. * Review comments * Fix an ordering	2022-06-01 14:46:21 -05:00
Alex Crichton	2a4851ad2b	Change some `VMContext` pointers to `()` pointers (#4190 ) * Change some `VMContext` pointers to `()` pointers This commit is motivated by my work on the component model implementation for imported functions. Currently all context pointers in wasm are `mut VMContext` but with the component model my plan is to make some pointers instead along the lines of `mut VMComponentContext`. In doing this though one worry I have is breaking what has otherwise been a core invariant of Wasmtime for quite some time, subtly introducing bugs by accident. To help assuage my worry I've opted here to erase knowledge of `mut VMContext` where possible. Instead where applicable a context pointer is simply known as `mut ()` and the embedder doesn't actually know anything about this context beyond the value of the pointer. This will help prevent Wasmtime from accidentally ever trying to interpret this context pointer as an actual `VMContext` when it might instead be a `VMComponentContext`. Overall this was a pretty smooth transition. The main change here is that the `VMTrampoline` (now sporting more docs) has its first argument changed to `mut ()`. The second argument, the caller context, is still configured as `mut VMContext` though because all functions are always called from wasm still. Eventually for component-to-component calls I think we'll probably "fake" the second argument as the same as the first argument, losing track of the original caller, as an intentional way of isolating components from each other. Along the way there are a few host locations which do actually assume that the first argument is indeed a `VMContext`. These are valid assumptions that are upheld from a correct implementation, but I opted to add a "magic" field to `VMContext` to assert this in debug mode. This new "magic" field is inintialized during normal vmcontext initialization and it's checked whenever a `VMContext` is reinterpreted as an `Instance` (but only in debug mode). My hope here is to catch any future accidental mistakes, if ever. * Use a VMOpaqueContext wrapper * Fix typos	2022-06-01 11:00:43 -05:00
Alex Crichton	f4b9020913	Change wasm-to-host trampolines to take the values_vec size (#4192 ) * Change wasm-to-host trampolines to take the values_vec size This commit changes the ABI of wasm-to-host trampolines, which are only used right now for functions created with `Func::new`, to pass along the size of the `values_vec` argument. Previously the trampoline simply received `mut ValRaw` and assumed that it was the appropriate size. By receiving a size as well we can thread through `&mut [ValRaw]` internally instead of `mut ValRaw`. The original motivation for this is that I'm planning to leverage these trampolines for the component model for host-defined functions. Out of an abundance of caution of making sure that everything lines up I wanted to be able to write down asserts about the size received at runtime compared to the size expected. This overall led me to the desire to thread this size parameter through on the assumption that it would not impact performance all that much. I ran two benchmarks locally from the `call.rs` benchmark and got: * `sync/no-hook/wasm-to-host - nop - unchecked` - no change * `sync/no-hook/wasm-to-host - nop-params-and-results - unchecked` - 5% slower This is what I roughly expected in that if nothing actually reads the new parameter (e.g. no arguments) then threading through the parameter is effectively otherwise free. Otherwise though accesses to the `ValRaw` storage is now bounds-checked internally in Wasmtime instead of assuming it's valid, leading to the 5% slowdown (~9.6ns to ~10.3ns). If this becomes a peformance bottleneck for a particular use case then we should be fine to remove the bounds checking here or otherwise only bounds check in debug mode, otherwise I plan on leaving this as-is. Of particular note this also changes the C API for `_unchecked` functions where the C callback now receives the size of the array as well. Add docs	2022-06-01 09:05:37 -05:00
Andrew Brown	0bdd8e3510	Upgrade listenfd to v1.0.0 (#4197 ) Previously, `listenfd` depended on an old version of the `uuid` crate which caused cargo deny failures. https://github.com/mitsuhiko/listenfd/pull/13 upgrades the `uuid` dependency and a new version of `listenfd` is published. This change moves to the latest version of `listenfd`.	2022-05-31 08:44:44 -07:00
Alex Crichton	4d9e10dae1	Fix panics in the C API related to trap frames (#4196 ) The `wasmtime-cpp` test suite uncovered an issue where asking for the frames of a trap would fail immediately after the trap was created. In addition to fixing this issue I've also updated the documentation of `Trap::frames` to indicate when it returns `None`.	2022-05-31 10:39:11 -05:00
Alex Crichton	7d3639522e	Capture unresolved backtraces on traps (#4193 ) I was running tests recently and was surprised that the `--test all` test was taking more than a minute to run when I didn't recall it ever taking more than a minute historically. A bisection pointed out #4183 as the cause and after re-reviewing I realized I forgot that we capture unresolved backtraces by default (and don't actually resolve them anywhere yet but that's a problem for another day) rather than resolved backtraces. This means that it's intended that we use `Backtrace::new_unresolved` instead of `Backtrace::new` in the traphandlers crate. The reason that tests were running so slowly is that the tests which deal with deep stacks (e.g. stack overflow) would take forever in testing as the Rust-based decoding of DWARF information is egregiously slow in unoptimized mode. I did discover independently that optimizing these dependencies makes the tests ~6x faster, but that's irrelevant if we're not symbolicating in the first place.	2022-05-31 09:56:56 -05:00
Pat Hickey	bffce37050	make backtrace collection a Config field rather than a cargo feature (#4183 ) * sorta working in runtime * wasmtime-runtime: get rid of wasm-backtrace feature * wasmtime: factor to make backtraces recording optional. not configurable yet * get rid of wasm-backtrace features * trap tests: now a Trap optionally contains backtrace * eliminate wasm-backtrace feature * code review fixes * ci: no more wasm-backtrace feature * c_api: backtraces always enabled * config: unwind required by backtraces and ref types * plumbed * test that disabling backtraces works * code review comments * fuzzing generator: wasm_backtrace is a runtime config now * doc fix	2022-05-25 12:25:50 -07:00
Sam Parker	010e028d67	[AArch64] Port AtomicCAS to isle (#4140 ) Copyright (c) 2022, Arm Limited.	2022-05-25 09:19:24 +01:00
Alex Crichton	a02a609528	Make `ValRaw` fields private (#4186 ) * Make `ValRaw` fields private Force accessing to go through constructors and accessors to localize the knowledge about little-endian-ness. This is spawned since I made a mistake in #4039 about endianness. * Fix some tests * Component model changes	2022-05-24 19:14:29 -05:00
Alex Crichton	140b83597b	components: Implement the ability to call component exports (#4039 ) * components: Implement the ability to call component exports This commit is an implementation of the typed method of calling component exports. This is intended to represent the most efficient way of calling a component in Wasmtime, similar to what `TypedFunc` represents today for core wasm. Internally this contains all the traits and implementations necessary to invoke component exports with any type signature (e.g. arbitrary parameters and/or results). The expectation is that for results we'll reuse all of this infrastructure except in reverse (arguments and results will be swapped when defining imports). Some features of this implementation are: * Arbitrary type hierarchies are supported * The Rust-standard `Option`, `Result`, `String`, `Vec<T>`, and tuple types all map down to the corresponding type in the component model. * Basic utf-16 string support is implemented as proof-of-concept to show what handling might look like. This will need further testing and benchmarking. * Arguments can be behind "smart pointers", so for example `&Rc<Arc<[u8]>>` corresponds to `list<u8>` in interface types. * Bulk copies from linear memory never happen unless explicitly instructed to do so. The goal of this commit is to create the ability to actually invoke wasm components. This represents what is expected to be the performance threshold for these calls where it ideally should be optimal how WebAssembly is invoked. One major missing piece of this is a `#[derive]` of some sort to generate Rust types for arbitrary `.wit` types such as custom records, variants, flags, unions, etc. The current trait impls for tuples and `Result<T, E>` are expected to have fleshed out most of what such a derive would look like. There are some downsides and missing pieces to this commit and method of calling components, however, such as: Passing `&[u8]` to WebAssembly is currently not optimal. Ideally this compiles down to a `memcpy`-equivalent somewhere but that currently doesn't happen due to all the bounds checks of copying data into memory. I have been unsuccessful so far at getting these bounds checks to be removed. * There is no finalization at this time (the "post return" functionality in the canonical ABI). Implementing this should be relatively straightforward but at this time requires `wasmparser` changes to catch up with the current canonical ABI. * There is no guarantee that results of a wasm function will be validated. As results are consumed they are validated but this means that if function returns an invalid string which the host doesn't look at then no trap will be generated. This is probably not the intended semantics of hosts in the component model. * At this time there's no support for memory64 memories, just a bunch of `FIXME`s to get around to. It's expected that this won't be too onerous, however. Some extra care will need to ensure that the various methods related to size/alignment all optimize to the same thing they do today (e.g. constants). * The return value of a typed component function is either `T` or `Value<T>`, and it depends on the ABI details of `T` and whether it takes up more than one return value slot or not. This is an ABI-implementation detail which is being forced through to the API layer which is pretty unfortunate. For example if you say the return value of a function is `(u8, u32)` then it's a runtime type-checking error. I don't know of a great way to solve this at this time. Overall I'm feeling optimistic about this trajectory of implementing value lifting/lowering in Wasmtime. While there are a number of downsides none seem completely insurmountable. There's naturally still a good deal of work with the component model but this should be a significant step up towards implementing and testing the component model. * Review comments * Write tests for calling functions This commit adds a new test file for actually executing functions and testing their results. This is not written as a `.wast` test yet since it's not 100% clear if that's the best way to do that for now (given that dynamic signatures aren't supported yet). The tests themselves could all largely be translated to `.wast` testing in the future, though, if supported. Along the way a number of minor issues were fixed with lowerings with the bugs exposed here. * Fix an endian mistake * Fix a typo and the `memory.fill` instruction	2022-05-24 17:02:31 -05:00
Benjamin Bouvier	3a7910ecb0	Reuse Cranelift codegen contexts across wasmtime compilations (#4181 )	2022-05-24 11:03:01 +02:00
Chris Fallin	b830c3cf93	Pull in regalloc2 v0.2.0, with no more separate scratch registers. (#4182 ) RA2 recently removed the need for a dedicated scratch register for cyclic moves (bytecodealliance/regalloc2#51). This has moderate positive performance impact on function bodies that were register-constrained, as it means that one more register is available. In Sightglass, I measured +5-8% on `blake3-scalar`, at least among current benchmarks.	2022-05-23 12:51:04 -07:00
Benjamin Bouvier	6e828df632	Remove unused `SourceLoc` in many `Mach` data structures (#4180 ) * Remove unused srcloc in MachReloc * Remove unused srcloc in MachTrap * Use `into_iter` on array in bench code to suppress a warning * Remove unused srcloc in MachCallSite	2022-05-23 09:27:28 -07:00
Chris Fallin	32622b3e6f	Cranelift: fix use of pinned reg with SysV calling convention. (#4176 ) Previously, the pinned register (enabled by the `enable_pinned_reg` Cranelift setting and used via the `get_pinned_reg` and `set_pinned_reg` CLIF ops) was only used when Cranelift was embedded in SpiderMonkey, in order to support a pinned heap register. SpiderMonkey has its own calling convention in Cranelift (named after the integration layer, "Baldrdash"). However, the feature is more general, and should be usable with the default system calling convention too, e.g. SysV or Windows Fastcall. This PR fixes the ABI code to properly treat the pinned register as a globally allocated register -- and hence an implicit input and output to every function, not saved/restored in the prologue/epilogue -- for SysV on x86-64 and aarch64, and Fastcall on x86-64. Fixes #4170.	2022-05-23 09:18:51 -07:00
Saúl Cabrera	2d8ff7a9a9	docs: Remove regalloc entry from documentation index (#4179 ) This is a follow up to https://github.com/bytecodealliance/wasmtime/pull/4013 in which the outdated regalloc documentation was removed.	2022-05-22 15:40:38 -07:00
Alex Crichton	fcf6208750	Initial skeleton of some component model processing (#4005 ) * Initial skeleton of some component model processing This commit is the first of what will likely be many to implement the component model proposal in Wasmtime. This will be structured as a series of incremental commits, most of which haven't been written yet. My hope is to make this incremental and over time to make this easier to review and easier to test each step in isolation. Here much of the skeleton of how components are going to work in Wasmtime is sketched out. This is not a complete implementation of the component model so it's not all that useful yet, but some things you can do are: * Process the type section into a representation amenable for working with in Wasmtime. * Process the module section and register core wasm modules. * Process the instance section for core wasm modules. * Process core wasm module imports. * Process core wasm instance aliasing. * Ability to compile a component with core wasm embedded. * Ability to instantiate a component with no imports. * Ability to get functions from this component. This is already starting to diverge from the previous module linking representation where a `Component` will try to avoid unnecessary metadata about the component and instead internally only have the bare minimum necessary to instantiate the module. My hope is we can avoid constructing most of the index spaces during instantiation only for it to all ge thrown away. Additionally I'm predicting that we'll need to see through processing where possible to know how to generate adapters and where they are fused. At this time you can't actually call a component's functions, and that's the next PR that I would like to make. * Add tests for the component model support This commit uses the recently updated wasm-tools crates to add tests for the component model added in the previous commit. This involved updating the `wasmtime-wast` crate for component-model changes. Currently the component support there is quite primitive, but enough to at least instantiate components and verify the internals of Wasmtime are all working correctly. Additionally some simple tests for the embedding API have also been added.	2022-05-20 15:33:18 -05:00
Alex Crichton	a75f383f96	Improve the `wasmtime` crate's README (#4174 ) * Improve the `wasmtime` crate's README This commit is me finally getting back to #2688 and improving the README of the `wasmtime` crate. Currently we have a [pretty drab README][drab] that doesn't really convey what we want about Wasmtime. While I was doing this I opted to update the feature list of Wasmtime as well in the main README (which is mirrored into the crate readme), namely adding a bullet point for "secure" which I felt was missing relative to how we think about Wasmtime. Naturally there's a lot of ways to paint this shed, so feedback is of course welcome on this! (I'm not the best writer myself) [drab]: https://crates.io/crates/wasmtime/0.37.0 * Expand the "Fast" bullet a bit more * Reference the book from the wasmtime crate * Update more security docs Also merge the sandboxing security page with the main security page to avoid the empty security page.	2022-05-20 15:33:00 -05:00
Chris Fallin	0824abbae4	Add a basic alias analysis with redundant-load elim and store-to-load fowarding opts. (#4163 ) This PR adds a basic alias analysis, and optimizations that use it. This is a "mid-end optimization": it operates on CLIF, the machine-independent IR, before lowering occurs. The alias analysis (or maybe more properly, a sort of memory-value analysis) determines when it can prove a particular memory location is equal to a given SSA value, and when it can, it replaces any loads of that location. This subsumes two common optimizations: * Redundant load elimination: when the same memory address is loaded two times, and it can be proven that no intervening operations will write to that memory, then the second load is redundant and its result must be the same as the first. We can use the first load's result and remove the second load. * Store-to-load forwarding: when a load can be proven to access exactly the memory written by a preceding store, we can replace the load's result with the store's data operand, and remove the load. Both of these optimizations rely on a "last store" analysis that is a sort of coloring mechanism, split across disjoint categories of abstract state. The basic idea is that every memory-accessing operation is put into one of N disjoint categories; it is disallowed for memory to ever be accessed by an op in one category and later accessed by an op in another category. (The frontend must ensure this.) Then, given this, we scan the code and determine, for each memory-accessing op, when a single prior instruction is a store to the same category. This "colors" the instruction: it is, in a sense, a static name for that version of memory. This analysis provides an important invariant: if two operations access memory with the same last-store, then no other store can alias in the time between that last store and these operations. This must-not-alias property, together with a check that the accessed address is exactly the same (same SSA value and offset), and other attributes of the access (type, extension mode) are the same, let us prove that the results are the same. Given last-store info, we scan the instructions and build a table from "memory location" key (last store, address, offset, type, extension) to known SSA value stored in that location. A store inserts a new mapping. A load may also insert a new mapping, if we didn't already have one. Then when a load occurs and an entry already exists for its "location", we can reuse the value. This will be either RLE or St-to-Ld depending on where the value came from. Note that this does work across basic blocks: the last-store analysis is a full iterative dataflow pass, and we are careful to check dominance of a previously-defined value before aliasing to it at a potentially redundant load. So we will do the right thing if we only have a "partially redundant" load (loaded already but only in one predecessor block), but we will also correctly reuse a value if there is a store or load above a loop and a redundant load of that value within the loop, as long as no potentially-aliasing stores happen within the loop.	2022-05-20 13:19:32 -07:00
Alex Crichton	08b7c87793	Refactor binary-compatible-builds for releases (#4171 ) * Refactor binary-compatible-builds for releases I was poking around this yesterday and noticed a few things that could be improved for our release builds: * The centos container for the x86_64 builds contained a bunch of extra tooling we no longer need such as python3 and a C++ compiler. Along with custom toolchain things this could all get removed since the C we include now is quite simple. * The aarch64 and s390x cross-compiled builds had relatively high glibc version requirements compared to the x86_64 build. This was because we don't use a container to build the cross-compiled binaries. I added containers here along the lines of the x86_64 build to use an older glibc to build the release binary to lower our version requirement. This lower the aarch64 version requirement from glibc 2.28 to 2.17. Additionally the s390x requirement dropped from 2.28 to 2.16. * To make the containers a bit easier to read/write I added `Dockerfile`s for them in a new `ci/docker` directory instead of hardcoding install commands in JS. This isn't intended to be a really big change or anything for anyone, but it's intended to keep our Linux-based builds consistent at least as best we can. * Remove temporary change	2022-05-20 12:13:50 -05:00
Alex Crichton	985ed07c3f	Improve documentation around `ResourceLimiter` (#4173 ) * Improve documentation around `ResourceLimiter` This commit takes a pass through the `Store::limiter` method and related types/traits to improve the documentation with an example and soup up any recent developments in the documentation. Closes #4138 * Fix a broken doc link	2022-05-20 12:06:11 -05:00
Alex Crichton	6cf4c95585	Ensure `simd` is enabled for spectest fuzzing (#4172 ) This is required now that the simd specification has been merged into the upstream specification, so to run the spec tests this must always be enabled instead of being left to the whims of the fuzzer about whether to enable it or not.	2022-05-20 09:57:56 -05:00
wasmtime-publish	4e8d54836d	Update release date of Wasmtime 0.37.0 (#4168 ) [skip ci] Co-authored-by: Wasmtime Publish <wasmtime-publish@users.noreply.github.com>	2022-05-20 09:11:48 -05:00
Alex Crichton	89ccc56e46	Update the wasm-tools family of crates (#4165 ) * Update the wasm-tools family of crates This commit updates these crates as used by Wasmtime for the recently published versions to pull in changes necessary to support the component model. I've split this out from #4005 to make it clear what's impacted here and #4005 can simply rebase on top of this to pick up the necessary changes. * More test fixes	2022-05-19 14:13:04 -05:00
Alex Crichton	0a0c232a14	Fix CI for Rust 1.61.0 (#4164 ) A new version of rustc was released this morning and we have a few small breakages on our CI which need fixing: * A new warning was coming out of the c-api crate about an unneeded `unsafe` block. * The panic message of a task in `cranelift-object` needed updating since the standard library changed how it formats strings with the nul byte.	2022-05-19 10:44:45 -05:00
Alex Crichton	411f3d60f3	Tweak CLI fallback to the `run` command (#4161 ) I ran across a case in Wasmtime today where a poor error message came out of the CLI. For example before this commit you would get: $ cargo run wast --wasm-features component-model foo.wast error: Invalid value "wast" for '<MODULE>': module name cannot be the same as a subcommand and now after this commit you get: $ cargo run wast --wasm-features component-model foo.wast error: Invalid value "component-model" for '--wasm-features <FEATURE,FEATURE,...>': unsupported WebAssembly feature 'component-model' I believe this was an accidental regression from #4082 since Wasmtime 0.36.0 produces the error message as expected. I opted to invert the conditional logic for falling back to the `run` subcommand. Instead of having a small set of error kinds that print the first-level error a small set of error kinds are now used to fall back to the `run` subcommand by default. My hope is that as `ErrorKind` is extended over time with various sorts of errors of parsing argumenst this'll be more robust because most of the time we want the CLI invocation to print out the normal CLI error, it's only in a select few cases that using `run` is likely to succeed.	2022-05-18 15:30:41 -05:00
Chris Fallin	02d5edc591	Upgrade to regalloc2 0.1.3. (#4157 ) * Upgrade to regalloc2 0.1.3. This pulls in bytecodealliance/regalloc2#49, which slightly improves codegen in some cases where a safepoint (for reference-typed values) occurs in the same liverange as a register-constrained use. For example, in bytecodealliance/wasmtime#3785, an extra move instruction appeared and a callee-save register was used (necessitating a more expensive prologue) because of suboptimal splitting heuristics, which this PR fixes. The updated RA2 heuristics appear to have no measured downsides in existing benchmarks and improve the manually-observed codegen issue. * Update filetests where regalloc2 improvement altered behavior with reftypes.	2022-05-18 11:48:40 -07:00
Anton Kirilov	ca106e9bcd	Update the WebAssembly spec testsuite (#4160 ) Copyright (c) 2022, Arm Limited.	2022-05-18 09:51:32 -05:00
Andrew Brown	e898cb750a	x64: remove TODO for i128 load (#4159 ) This work has already been finished in a previous PR.	2022-05-17 17:43:43 -07:00
Anton Kirilov	302bb5b213	Add notes for the Cranelift project meeting on 16.05.2022 (#4158 ) Copyright (c) 2022, Arm Limited.	2022-05-17 09:38:05 -07:00
Chris Fallin	cb13175c42	Add Cranelift meeting agendas for rest of 2022. (#4156 ) As per discussion today, when we have a holiday (affecting any regular attendee), we will push the meeting by a week. This does mean we sometimes have meetings in contiguous weeks, but given the number of topics we usually have to discuss, erring on the side of more discussion time (rather than just canceling) is probably not a bad thing. For the rest of this calendar year, given an otherwise regular biweekly-on-Mondays cadence, the holiday conflicts I am aware of are: US Memorial Day (falls on Mon May 28, pushed meeting to Mon Jun 6); US Labor Day (falls on Mon Sept 5, pushed to Mon Sept 12). If there are any other holidays in the below dates, I'm happy to update further!	2022-05-16 18:54:49 -05:00
Anton Kirilov	edf07a8da6	Cranelift AArch64: Migrate Bitselect and Vselect to ISLE (#4139 ) Copyright (c) 2022, Arm Limited.	2022-05-16 09:39:28 -07:00
Jonathan Coates	f19d8cc851	Run a callback when the interruption epoch is reached (#4152 ) * Run a callback when the interruption epoch is reached Adds Store::epoch_deadline_callback. This accepts a callback which, when invoked, can mutate the store's contents. The callback can either return an error (in which case we trap) or return a delta which we'll use to set the new epoch deadline. * Add a basic test for epoch interruption callback * Some small nits - Remove use of &mut in the pattern match - Return both yields and state from run_and_count_yields_or_trap in test code and assert on them separately. - Add a test for trapping on a state failure.	2022-05-16 07:28:23 -05:00
Olexiy Kulchitskiy	8d7bccefcb	Expose cranelift nan canonicalization config via C API (#4154 ) * Add cranelift_nan_canonicalization to c api header * Add cranelift_nan_canonicalization to capi/config.rs * Fix func name	2022-05-14 11:28:49 -07:00
Chris Fallin	2e14a0ecc5	ISLE: provide locations in errors in basic non-miette mode. (#4151 ) In #4143 we made ISLE compilation part of the normal build flow again, to avoid the issues with the checked-in source. To make this acceptably fast, we cut down dependencies of the ISLE compiler, so the "fancy" error printing is now optional. When not included, it just prints error messages to stderr in a list. However, this did not include file locations. It might be nice to have this without enabling the "fancy printing" and waiting for that to build. Fortunately most of the plumbing for this was already present (we had it at one point before switching to miette). This PR adds back locations to the basic error output. It now looks like: ``` Error building ISLE files: ISLE errors: src/isa/aarch64/inst.isle:1:1: parse error: Unexpected token Symbol("asdf") ```	2022-05-12 12:55:00 -07:00
bjorn3	c4eab2beb6	Avoid spurious build script runs (#4150 ) * Don't attempt to track the generated clif.isle in cargo This causes the build script to rerun every time for me. * Put build script debug messages on stderr instead of stdout This keeps stdout reserved for cargo build script directives	2022-05-12 11:49:20 -07:00
Andrew Brown	2111f7dba8	Slides for Wasmtime meeting on 2022-05-12 (#4149 )	2022-05-12 10:50:55 -07:00
Chris Fallin	42873111b0	Notes from Wasmtime meeting on 2022-05-12. (#4148 )	2022-05-12 10:38:44 -07:00
Ulrich Weigand	0243a16679	s390x: Fix bitwise operations (#4146 ) Current codegen had a number of logic errors confusing NAND with AND WITH COMPLEMENT, and NOR with OR WITH COMPLEMENT. Add support for the missing z15 instructions and fix logic.	2022-05-12 10:05:22 -07:00
bjorn3	9538336f82	Use HashMaps instead of BTreeMaps in isle where possible (#4147 ) The HashMap implementation is significantly simpler than the BTreeMap implementation. Because of this switching reduces compilation time of cranelift-isle by about 10%. # Before $ hyperfine --prepare "cargo clean" "cargo build" Benchmark 1: cargo build Time (mean ± σ): 5.221 s ± 0.094 s [User: 10.659 s, System: 0.734 s] Range (min … max): 5.151 s … 5.420 s 10 runs # After $ hyperfine --prepare "cargo clean" "cargo build" Benchmark 1: cargo build Time (mean ± σ): 4.746 s ± 0.150 s [User: 9.109 s, System: 0.721 s] Range (min … max): 4.630 s … 5.144 s 10 runs	2022-05-12 10:02:23 -07:00
Chris Fallin	5d671952ee	Cranelift: do not check in generated ISLE code; regenerate on every compile. (#4143 ) This PR fixes #4066: it modifies the Cranelift `build.rs` workflow to invoke the ISLE DSL compiler on every compilation, rather than only when the user specifies a special "rebuild ISLE" feature. The main benefit of this change is that it vastly simplifies the mental model required of developers, and removes a bunch of failure modes we have tried to work around in other ways. There is now just one "source of truth", the ISLE source itself, in the repository, and so there is no need to understand a special "rebuild" step and how to handle merge errors. There is no special process needed to develop the compiler when modifying the DSL. And there is no "noise" in the git history produced by constantly-regenerated files. The two main downsides we discussed in #4066 are: - Compile time could increase, by adding more to the "meta" step before the main build; - It becomes less obvious where the source definitions are (everything becomes more "magic"), which makes exploration and debugging harder. This PR addresses each of these concerns: 1. To maintain reasonable compile time, it includes work to cut down the dependencies of the `cranelift-isle` crate to nothing (only the Rust stdlib), in the default build. It does this by putting the error-reporting bits (`miette` crate) under an optional feature, and the logging (`log` crate) under a feature-controlled macro, and manually writing an `Error` impl rather than using `thiserror`. This completely avoids proc macros and the `syn` build slowness. The user can still get nice errors out of `miette`: this is enabled by specifying a Cargo feature `--features isle-errors`. 2. To allow the user to optionally inspect the generated source, which nominally lives in a hard-to-find path inside `target/` now, this PR adds a feature `isle-in-source-tree` that, as implied by the name, moves the target for ISLE generated source into the source tree, at `cranelift/codegen/isle_generated_source/`. It seems reasonable to do this when an explicit feature (opt-in) is specified because this is how ISLE regeneration currently works as well. To prevent surprises, if the feature is not specified, the build fails if this directory exists.	2022-05-11 22:25:24 -07:00
Chris Fallin	7c5a56b836	Cranelift: division/remainder CLIF ops are scalar-only. (#4141 ) In #4104 we discussed whether it makes sense for the division and remainder ops to support vector types. We concluded that because most hardware doesn't support it directly, it probably is not ideal to force all backends to polyfill it. In the future we can always reverse this decision, perhaps with a platform-independent legalization. This PR restricts the allowed types on the CLIF ops to integer types only.	2022-05-11 11:10:02 -07:00
Chris Fallin	67eb161d04	Cranelift: fix filetest now failing after merge to main. (#4120 ) This test was added between the last CI run on #4088 and its merge to main, and the changes in #4088 (use of constants directly in instruction via load from constant pool, rather than from a register initialized by a separate instruction) cause it to fail now. This PR alters the test to be invariant to regalloc and argument decisions during lowering, as the test is really checking (per the comment) that we get two cmoves without an intervening move. As such, it just matches the instruction opcodes, irrespective of the arguments.	2022-05-10 10:56:54 -07:00
Andrew Brown	c766c432b5	meeting: add shared memory agenda item (#4119 )	2022-05-10 10:21:53 -07:00
Chris Fallin	eb435f3057	x64: use constant pool for u64 constants rather than `movabs`. (#4088 ) * Allow emitting u64 constants into constant pool. * Use constant pool for constants on x64 that do not fit in a simm32 and are needed as a RegMem or RegMemImm. * Fix rip-relative addressing bug in pinsrd emission.	2022-05-10 09:21:05 -07:00
Conrad Watt	d3087487ea	enable multi-value in spec intepreter fuzzing (#4118 )	2022-05-10 10:33:07 -05:00
Saúl Cabrera	52524d258c	Expose `TrapCode::Interrupt` on epoch based interruption (#4105 )	2022-05-10 10:27:30 -05:00
Benjamin Bouvier	71fc16bbeb	Narrow `allow(dead_code)` declarations (#4116 ) * Narrow `allow(dead_code)` declarations Having module wide `allow(dead_code)` may hide some code that's really dead. In this commit I just narrowed the declarations to the specific enum variants that were not used (as it seems reasonable to keep them and their handling in all the matches, for future use). And the compiler found more dead code that I think we can remove safely in the short term. With this, the only files annotated with a module-wide `allow(dead_code)` are isle-generated files. * resurrect some functions as test helpers	2022-05-10 12:02:52 +02:00
Chris Fallin	2af8d1e93c	Cranelift/ISLE: re-apply prio-trie fix, this time with fixed fix. (#4117 ) * ISLE compiler: fix priority-trie interval bug. (#4093) This PR fixes a bug in the ISLE compiler related to rule priorities. An important note first: the bug did not affect the correctness of the Cranelift backends, either in theory (because the rules should be correct applied in any order, even contrary to the stated priorities) or in practice (because the generated code actually does not change at all with the DSL compiler fix, only with a separate minimized bug example). The issue was a simple swap of `min` for `max` (see first commit). This is the minimal fix, I think, to get a correct priority-trie with the minimized bug example in this commit. However, while debugging this, I started to convince myself that the complexity of merging multiple priority ranges using the sort of hybrid interval tree / string-matching trie data structure was unneeded. The original design was built with the assumption we might have a bunch of different priority levels, and would need the efficiency of merging where possible. But in practice we haven't used priorities this way: the vast majority of lowering rules exist at the default (priority 0), and just a few overrides are explicitly at prio 1, 2 or (rarely) 3. So, it turns out to be a lot simpler to label trie edges with (prio, symbol) rather than (prio-range, symbol), and delete the whole mess of interval-splitting logic on insertion. It's easier (IMHO) to convince oneself that the resulting insertion algorithm is correct. I was worried that this might impact the size of the generated Rust code or its runtime, but In fact, to my initial surprise (but it makes sense given the above "rarely used" factor), the generated code with this compiler fix is exactly the same. I rebuilt with `--features rebuild-isle,all-arch` but... there were no diffs to commit! This is to me the simplest evidence that we didn't really need that complexity. * Fix earlier commit from #4093: properly sort trie. This commit fixes an in-hindsight-obvious bug in #4093: the trie's edges must be sorted recursively, not just at the top level. With this fix, the generated code differs only in one cosmetic way (a let-binding moves) but otherwise is the same. This includes @fitzgen's fix to the CI (from the revert in #4102) that deletes manifests to actually check that the checked-in source is consistent with the checked-in compiler. The force-rebuild step is now in a shell script for convenience: anyone hacking on the ISLE compiler itself can use this script to more easily rebuild everything. * Add note to build.rs to remind to update force-rebuild-isle.sh	2022-05-09 16:36:48 -07:00

1 2 3 4 5 ...

9831 Commits