wasmtime

Author	SHA1	Message	Date
Chris Fallin	7421e1a65b	Merge pull request #3324 from dheaton-arm/implement-shuffle Implement `Shuffle` for the interpreter	2021-09-13 09:49:59 -07:00
Chris Fallin	9323762d71	Merge pull request #3314 from dheaton-arm/implement-bitops Implement bit operations for Cranelift interpreter	2021-09-13 09:29:10 -07:00
Chris Fallin	587f603018	Merge pull request #3316 from dheaton-arm/implement-uwiden Implement `UwidenLow` and `UwidenHigh` for the interpreter	2021-09-10 12:32:50 -07:00
Afonso Bordado	d31bdff7db	cranelift: Use bool args in simd tests	2021-09-10 15:10:51 +01:00
dheaton-arm	5824cca0f8	Fix test failures from old x86 backend Copyright (c) 2021, Arm Limited	2021-09-08 15:43:08 +01:00
dheaton-arm	f7a1b3f9bd	Implement `UwidenLow` and `UwidenHigh` for the interpreter Implemented `UwidenLow` and `UwidenHigh` for the Cranelift interpreter, doubling the width and halving the number of lanes preserving the low and high halves respectively. Conversions are performed using unsigned zero extension. Copyright (c) 2021, Arm Limited	2021-09-08 14:17:11 +01:00
dheaton-arm	bca3cb32ef	Implement `Shuffle` for the interpreter Implemented `Shuffle` for the Cranelift interpreter, to shuffle two SIMD vectors together based on an immediate mask of 16 bytes. Copyright (c) 2021, Arm Limited	2021-09-08 11:13:57 +01:00
dheaton-arm	9f647301ff	Implement bit operations for Cranelift interpreter Implemented for the Cranelift interpreter: - `Bitrev` to reverse the order of the bits in an integer. - `Cls` to count the leading bits which are the same as the sign bit in an integer, yielding one less than the size of the integer for 0 and -1. - `Clz` to count the number of leading zeros in the bitwise representation of the integer. - `Ctz` to count the number of trailing zeros in the bitwise representation of the integer. - `Popcnt` to count the number of ones in the bitwise representation of the integer. Copyright (c) 2021, Arm Limited	2021-09-08 11:07:22 +01:00
Damian Heaton	dd23a21b9b	Implement `Swizzle` and `Splat` for interpreter (#3268 ) * Implement `Swizzle` and `Splat` for interpreter Implemented for the Cranelift interpreter: - `Swizzle` to shuffle an `i8x16` SIMD vector based on the indices specified in another vector of the same size. - `Splat` to create a SIMD vector with all lanes having the same value. Copyright (c) 2021, Arm Limited * Fix old x86 backend failing test Copyright (c) 2021, Arm Limited * Represent i16x8 and above as hex Copyright (c) 2021, Arm Limited	2021-09-07 09:53:49 -07:00
Afonso Bordado	63e9a81deb	Implement `vany_true` and `vall_true` instructions in interpreter (#3304 ) * cranelift: Implement ZeroExtend for a bunch of types in interpreter * cranelift: Implement VConst on interpreter * cranelift: Implement VallTrue on interpreter * cranelift: Implement VanyTrue on interpreter * cranelift: Mark `v{all,any}_true` tests as machinst only * cranelift: Disable `vany_true` tests on aarch64 The `b64x2` case produces an illegal instruction. See #3305	2021-09-07 09:50:39 -07:00
Chris Fallin	ecd795f736	Merge pull request #3290 from dheaton-arm/implement-ssatarith Implement `SaddSat` and `SsubSat` for the Cranelift interpreter	2021-09-03 09:48:34 -07:00
Chris Fallin	e3ccff0249	Merge pull request #3283 from dheaton-arm/implement-umulhi Implement `Umulhi` for the interpreter	2021-09-03 09:29:21 -07:00
dheaton-arm	8f057e0482	Implement `SaddSat` and `SsubSat` for the interpreter Implemented `SaddSat` and `SsubSat` to add and subtract signed vector values, saturating at the type boundaries rather than overflowing. Changed the parser to allow signed `i8` immediates in vectors as part of this work; fixes #3276. Copyright (c) 2021, Arm Limited.	2021-09-03 11:35:39 +01:00
dheaton-arm	562947c678	Fix CI tests + rename tests - Fixed CI tests for AArch64 and old x86. - Rename `simd-umulhi.clif` to `umulhi.clif`. - Rename `simd-umulhi-aarch64.clif` to `simd-umulhi.clif`. Copyright (c) 2021, Arm Limited.	2021-09-03 10:37:24 +01:00
Chris Fallin	d6a77898ba	Merge pull request #3272 from dheaton-arm/implement-iaddpairwise Implement `IaddPairwise` for the interpreter	2021-09-02 10:52:47 -07:00
Chris Fallin	6e05b646a3	Merge pull request #3282 from afonso360/x64-fix-brtables cranelift: Fix `br_table` for `i64` types in x64 backend.	2021-09-02 09:58:42 -07:00
Chris Fallin	000a97f4ff	Merge pull request #3279 from dheaton-arm/implement-insertlane Implement `Insertlane` for the Cranelift interpreter	2021-09-02 09:44:59 -07:00
Afonso Bordado	f9ada24bcf	cranelift: Fix br_table for i64 inputs We still only support a maximum of u32::MAX entries, however we no longer crash when compiling 64 bit indexes. Fixes #3100	2021-09-02 15:31:48 +01:00
dheaton-arm	16b6a404e4	Implement `Umulhi` for the interpreter Implemented `Umulhi` for the Cranelift interpreter, performing unsigned integer multiplication and producing the high half of a double-length result. Fixed `ExtractUpper` conversion behaviour as part of this change, which was extracting from a 128-bit value regardless of the size of the original value. Copyright (c) 2021, Arm Limited.	2021-09-02 13:11:41 +01:00
Chris Fallin	91410aaddf	Merge pull request #3234 from dheaton-arm/implement-isubb Implement `IsubBin`, `IsubBout`, and `IsubBorrow`for Cranelift interpreter	2021-09-01 11:25:43 -07:00
dheaton-arm	d956d349d8	Implement `Insertlane` for the Cranelift interpreter Implemented `Insertlane` to insert a value in the lane specified by the immediate value, overwriting the existing value in that lane. Added `TernaryImm8` support for the `imm_value` function. Copyright (c) 2021, Arm Limited.	2021-09-01 16:21:27 +01:00
Afonso Bordado	f9f5ae59a6	cranelift: Merge interpreter tests with runtests (#3252 ) Almost all the tests in the interpreter are already in the runtests folder so that we can reuse them for the backends. The distinction between interpreter tests and runtests is no longer very clear, since they should both support the same clif code, and produce the same results. We only have two test files: * `add.clif` tests the add and jump instruction, both of which are already covered in other test files, so we remove that file. * `fibonacci.clif` does a recursive call which is currently not supported in the filetest environment, so we keep this test interpreter only for now.	2021-09-01 06:42:02 -07:00
dheaton-arm	7a5646c5f4	Implement `IaddPairwise` for the interpreter Implemented `IaddPairwise` for the Cranelift interpreter, to add pairs of adjacent values in two SIMD vectors, concatenating them at the end (preserving both lane size and number of lanes). Copyright (c) 2021, Arm Limited	2021-09-01 13:53:26 +01:00
Damian Heaton	4378ea8e01	Implement `IaddCin`, `IaddCout`, and `IaddCarry` for Cranelift interpreter (#3233 ) * Implement `IaddCin`, `IaddCout`, and `IaddCarry` for Cranelift interpreter Implemented the following Opcodes for the Cranelift interpreter: - `IaddCin` to add two scalar integers with an input carry flag. - `IaddCout` to add two scalar integers and report overflow with the carry flag. - `IaddCarry` to add two scalar integers with an input carry flag, reporting overflow with the output carry flag. Copyright (c) 2021, Arm Limited * Simplify carry check + add i64 `IaddCarry` tests Copyright (c) 2021, Arm Limited * Move tests to `runtests` Copyright (c) 2021, Arm Limited	2021-08-31 09:29:38 -07:00
dheaton-arm	d1fe72affa	Add `i64` tests to `IsubBorrow` and move tests. Copyright (c) 2021, Arm Limited	2021-08-31 11:47:26 +01:00
bjorn3	b79e59882d	Fix tests	2021-08-27 18:28:33 +02:00
bjorn3	8adb40b2b8	Add tests	2021-08-27 17:48:04 +02:00
Damian Heaton	02ef6a02b8	Implement `Extractlane`, `UaddSat`, and `UsubSat` for Cranelift interpreter (#3188 ) * Implement `Extractlane`, `UaddSat`, and `UsubSat` for Cranelift interpreter Implemented the `Extractlane`, `UaddSat`, and `UsubSat` opcodes for the interpreter, and added helper functions for working with SIMD vectors (`extractlanes`, `vectorizelanes`, and `binary_arith`). Copyright (c) 2021, Arm Limited * Re-use tests + constrict Vector assert - Re-use interpreter tests as runtests where supported. - Constrict Vector assertion. - Code style adjustments following feedback. Copyright (c) 2021, Arm Limited * Runtest `i32x4` vectors on AArch64; add `i64x2` tests Copyright (c) 2021, Arm Limited * Add `simd-` prefix to test filenames Copyright (c) 2021, Arm Limited * Return aliased `SmallVec` from `extractlanes` Using a `SmallVec<[i128; 4]>` allows larger-width 128-bit vectors (`i32x4`, `i64x2`, ...) to not cause heap allocations. Copyright (c) 2021, Arm Limited * Accept slice to `vectorizelanes` rather than `Vec` Copyright (c) 2021, Arm Limited	2021-08-25 09:03:19 -07:00
Afonso Bordado	2776074dfc	cranelift: Add stack support to the interpreter with virtual addresses (#3187 ) * cranelift: Add stack support to the interpreter We also change the approach for heap loads and stores. Previously we would use the offset as the address to the heap. However, this approach does not allow using the load/store instructions to read/write from both the heap and the stack. This commit changes the addressing mechanism of the interpreter. We now return the real addresses from the addressing instructions (stack_addr/heap_addr), and instead check if the address passed into the load/store instructions points to an area in the heap or the stack. * cranelift: Add virtual addresses to cranelift interpreter Adds a Virtual Addressing scheme that was discussed as a better alternative to returning the real addresses. The virtual addresses are split into 4 regions (stack, heap, tables and global values), and the address itself is composed of an `entry` field and an `offset` field. In general the `entry` field corresponds to the instance of the resource (e.g. table5 is entry 5) and the `offset` field is a byte offset inside that entry. There is one exception to this which is the stack, where due to only having one stack, the whole address is an offset field. The number of bits in entry vs offset fields is variable with respect to the `region` and the address size (32bits vs 64bits). This is done because with 32 bit addresses we would have to compromise on heap size, or have a small number of global values / tables. With 64 bit addresses we do not have to compromise on this, but we need to support 32 bit addresses. * cranelift: Remove interpreter trap codes * cranelift: Calculate frame_offset when entering or exiting a frame * cranelift: Add safe read/write interface to DataValue * cranelift: DataValue write full 128bit slot for booleans * cranelift: Use DataValue accessors for trampoline.	2021-08-24 09:29:11 -07:00
Afonso Bordado	f4ff7c350a	cranelift: Add heap support to filetest infrastructure (#3154 ) * cranelift: Add heap support to filetest infrastructure * cranelift: Explicit heap pointer placement in filetest annotations * cranelift: Add documentation about the Heap directive * cranelift: Clarify that heap filetests pointers must be laid out sequentially * cranelift: Use wrapping add when computing bound pointer * cranelift: Better error messages when invalid signatures are found for heap file tests.	2021-08-24 09:28:41 -07:00
Afonso Bordado	3f6b889067	cranelift: Prevent panics when dividing INT_MIN / -1 in interpreter	2021-08-24 09:27:54 -07:00
Anton Kirilov	a1b39276e1	Enable more CLIF tests on AArch64 The tests for the SIMD floating-point maximum and minimum operations require particular care because the handling of the NaN values is non-deterministic and may vary between platforms. There is no way to match several NaN values in a test, so the solution is to extract the non-deterministic test cases into a separate file that is subsequently replicated for every backend under test, with adjustments made to the expected results. Copyright (c) 2021, Arm Limited.	2021-08-17 13:27:58 +01:00
Chris Fallin	7c0948fe0b	Merge pull request #3102 from afonso360/fix-bool-trampolines cranelift: Fix trampoline args for b1 types	2021-08-14 15:50:30 -07:00
Afonso Bordado	8862499529	cranelift: Fix trampoline args for b1 types Our DataValues only have one size of booleans so we are always going to have this mismatch of sizes	2021-08-08 17:42:50 +01:00
Afonso Bordado	a2fb019ba7	cranelift: Add basic i128 support in interpreter	2021-07-23 11:22:07 -07:00
Afonso Bordado	6be4441bbf	cranelift: Resolve alias lookups in interpreter	2021-07-22 10:42:29 -07:00
Afonso Bordado	065190f975	cranelift: Implement br_table on the interpreter	2021-07-20 15:31:27 -07:00
Afonso Bordado	04033fe645	cranelift: Implement overflow flags for icmp in interpreter	2021-07-19 09:31:14 -07:00
Afonso Bordado	c42b725ce9	cranelift: Fix br_icmp in interpreter	2021-07-19 09:31:14 -07:00
Afonso Bordado	004af01a88	cranelift: Fix brz,brnz instructions in the interpreter	2021-07-19 09:31:14 -07:00
Afonso Bordado	db5566dadb	aarch64: Fix lowering amounts for shifts This commit addresses two issues: * A panic when shifting any non i128 type by i128 amounts (#3064) * Wrong results when lowering shifts with small types (i8, i16) In these types when shifting for amounts larger than the size of the type, we would not get the wrapping behaviour that we see on i32 and i64. This is because in these larger types, the wrapping behaviour is automatically implemented by using the appropriate instruction, however we do not have i8 and i16 specific instructions, so we have to manually wrap the shift amount with an AND instruction. This issue is also found on x86_64 and s390x, and a separate issue will be filed for those. Closes #3064	2021-07-16 22:08:02 +01:00
Afonso Bordado	eebae8d4c8	aarch64: Fix incorrect encoding of large const values in icmp. When encoding constants as immediates into an RSE Imm12 instruction we need to take special care to check if the value that we are trying to input does not overflow its type when viewed as a signed value. (i.e. iconst.i8 200) We cannot both put an immediate and sign extend it, so we need to lower it into a separate reg, and emit the sign extend into the instruction. For more details see the [cg_clif bug report](https://github.com/bjorn3/rustc_codegen_cranelift/issues/1184#issuecomment-873214796).	2021-07-03 22:42:15 +01:00
Afonso Bordado	a4770a7e28	cranelift: Prevent overflow errors in interpreter for add,sub,mul	2021-06-30 06:32:16 -07:00
Afonso Bordado	e85eb77c45	aarch64: Implement missing atomic rmw ops	2021-06-25 07:51:46 +01:00
Afonso Bordado	7a5948f729	aarch64: Implement lowering i128 select	2021-06-24 16:19:25 +01:00
Chris Fallin	4a6594c514	Merge pull request #3011 from cfallin/bint-x64 Fix `bint` on x64, and make `bextend` consistent with bool representation.	2021-06-22 11:26:20 -07:00
Chris Fallin	efe3930215	Fix `bint` on x64, and make `bextend` consistent with bool representation. There has been occasional confusion with the representation that we use for bool-typed values in registers, at least when these are wider than one bit. Does a `b8` store `true` as 1, or as all-ones (`0xff`)? We've settled on the latter because of some use-cases where the wide bool becomes a mask -- see #2058 for more on this. This is fine, and transparent, to most operations within CLIF, because the bool-typed value still has only two semantically-visible states, namely `true` and `false`. However, we have to be careful with bool-to-int conversions. `bint` on aarch64 correctly masked the all-ones value down to 0 or 1, as required by the instruction specification, but on x64 it did not. This PR fixes that bug and makes x64 consistent with aarch64. While staring at this code I realized that `bextend` was also not consistent with the all-ones invariant: it should do a sign-extend, not a zero-extend as it previously did. This is also rectified and tested. (Aarch64 also already had this case implemented correctly.) Fixes #3003.	2021-06-22 10:56:56 -07:00
Chris Fallin	fa1a04d002	Merge pull request #3005 from afonso360/aarch64-i128-extend aarch64: Implement uextend/sextend for i128 values	2021-06-22 10:24:30 -07:00
Afonso Bordado	f25f5b2732	aarch64: Implement lowering uextend/sextend for i128 values	2021-06-22 12:24:07 +01:00
Chris Fallin	18cd2f681c	Merge pull request #3002 from afonso360/aarch64-i128-br aarch64 implement brz,brnz,br_icmp for i128 values	2021-06-21 10:52:50 -07:00

1 2

66 Commits