wasmtime

Author	SHA1	Message	Date
Jakob Stoklund Olesen	15461c1e4b	Add two new value types: iflags and fflags. These two value types represent the state of CPU flags after an integer comparison and a floating point comparison respectively. Instructions using these types TBD.	2017-10-12 19:05:24 -07:00
Jakob Stoklund Olesen	dbaa919ca9	Make room for SpecialType in the value type numbering. The value types are now classified into three groups: 1. Lane types are scalar types that can also be used to form vectors. 2. Vector types 2-256 copies of a lane type. 3. Special types. This is where the CPU flag types will go. The special types can't be used to form vectors. Change the numbering scheme for value types to make room for the special types and add `is_lane()` and `is_special()` classification methods. The VOID type still has number 0, but it can no longer appear as a vector lane. It classifies as special now.	2017-10-12 12:48:55 -07:00
Jakob Stoklund Olesen	89a24b2f13	Rename ScalarType to LaneType. The word "scalar" is a bit vague and tends to mean "non-vector". Since we are about to add new CPU flag value types that can't appear as vector lanes, make the distinction clear: LaneType represents value types that can appear as a vector lane. Also replace the Type::is_scalar() method with an is_vector() method.	2017-10-12 10:39:12 -07:00
Jakob Stoklund Olesen	994af598f5	Avoid interference on CFG edges. Track allocatable registers both locally and globally: Add a second AllocatableSet which tracks registers allocated to global values without accounting for register diversions. Since diversions are only local to an EBB, global values must be assigned un-diverted locations that don't interfere. Handle the third "global" interference domain in the constraint solver in addition to the existing "input" and "output" domains. Extend the solver error code to indicate when a global define just can't be allocated because there are not enough available global registers. Resolve this problem by replacing the instruction's global defines with local defines that are copied into their global destinations afterwards.	2017-10-11 15:38:30 -07:00
Jakob Stoklund Olesen	ba52a38597	Add a t8jccd_long encoding recipe for brz.b1 and brnz.b1 in 32-bit mode. The register allocator can't handle branches with constrained register operands, and the brz.b1/brnz.b1 instructions only have the t8jccd_abcd in 32-bit mode where no REX prefixes are possible. This adds a worst case encoding for those cases where a b1 value lives in a non-ABCD register.	2017-10-11 14:20:43 -07:00
Dan Gohman	733870aee4	Make FuncEnvironment independent from ModuleEnvironment (formerly WasmRuntime). This renames WasmRuntime to ModuleEnvironment, and makes several changes to allow for more flexible compilation. ModuleEnvironment no longer derives from FuncEnvironment, and no longer has the `begin_translation` and `next_translation` functions, so that independent `FuncEnvironment` instances can operate within the same module. Also, this obviates the rest of TranslationResult, as it moves processing of function bodies into the environment. The DummyEnvironment implementation gives an example of decoding the function bodies as they are parsed, however other implementation strategies are now possible.	2017-10-10 15:50:24 -07:00
Dan Gohman	653e8bb563	Declare exports via the WasmRuntime. Also, redo how functions are named in the DummyRuntime. Use the FunctionName field to just encode the wasm function index rather than trying to shoehorn a printable name into it. And to make up for that, teach the wasm printer to print export names as comments next to the function definitions. This also makes the fields of DummyRuntime public, in preparation for the DummyRuntime to have a more general-purpose debugging role, as well as possibly to allow it to serve as a base for other implementations.	2017-10-10 08:46:10 -07:00
Jakob Stoklund Olesen	90ed698e83	Add an unreachable code elimination pass. The register allocator doesn't even try to compile unreachable EBBs, so any values defined in such blocks won't be assigned registers. Since the dominator tree already has determined which EBBs are reachable, we should just eliminate any unreachable blocks instead o trying to do something with the dead code. Not that this is not a "dead code elimination" pass which would also remove individual instructions whose results are not used.	2017-10-09 15:26:27 -07:00
Dan Gohman	6aeeaebbd3	Disallow branching to the entry block. Functions that would otherwise start with a loop should start with a separate ebb which just branches to the header of the loop.	2017-10-09 15:02:17 -07:00
Jakob Stoklund Olesen	b3fa47cacc	Add support for emergency spill slots. - Create a new kind of stack slot: emergency_slot. - Add a get_emergency_slot() method which finds a suitable emergency slot given a list of slots already in use. - Use emergency spill slots when schedule_moves needs them.	2017-10-06 10:45:13 -07:00
Jakob Stoklund Olesen	30aeb57083	Add a value location verifier. This is a verification pass that can be run after register allocation. It verifies that value locations are consistent with constraints on their uses, and that the register diversions are consistent. Make it clear that register diversions are local to an EBB only. This affects what branch relaxation is allowed to do. The verify_locations() takes an optional Liveness parameter which is used to check that no diverted values are live across CFG edges.	2017-10-05 13:59:18 -07:00
Dan Gohman	e4ef2cbf22	Support ISA-specific settings in the `--isa` command-line option.	2017-10-05 10:06:40 -07:00
Jakob Stoklund Olesen	73d4bb47c0	Intel encodings for regspill and regfill. These are always SP-based.	2017-10-04 17:02:09 -07:00
Jakob Stoklund Olesen	826d4062fb	Apply register diversions during binemit tests. When "binemit" tests encode instructions, keep track of the current set of register diversions, and use the diverted locations to check operand constraints. This matches how constraints are applied during a real binemit phase.	2017-10-04 17:02:09 -07:00
Jakob Stoklund Olesen	dda3efcbdd	Add regspill and regfill instructions. These are parallels to the existing regmove instruction, but the divert the value to and from a stack slot. Like regmove diversions, this is a temporary diversion that must be local to the EBB.	2017-10-04 17:02:09 -07:00
Dan Gohman	196795017b	Tidy up handling of the DummyRuntime.	2017-10-04 16:57:39 -07:00
Jakob Stoklund Olesen	fb0999ce33	Check the top-level register class for available registers. Fixes #165. The constraint solver's schedule_move() function sometimes need to use an extra available register when the moves to be scheduled contains cycles. The pending moves have associated register classes that come from the constraint programming. Since the moves have hard-coded to and from registers, these register classes are only meant to indicate the register sizes. In particular, we can use the whole top-level register class when scavenging for a spare register to break a cycle.	2017-10-03 14:12:18 -07:00
Jakob Stoklund Olesen	c091a695e6	Fix coalescer bug exposed by the gvn-unremovable-phi test. When we detect interference between the values that have already been merged into the candidate virtual register and an EBB argument, we first try to resolve the conflict by splitting. We also check if the existing interfering value is fundamentally incompatible with the branch instruction so it needs to be removed from the virtual register, restarting the merge operation. However, this existing interfering value is not necessarily the only interference, so the split is not guaranteed to resolve the conflict. If it turns out that splitting didn't resolve the conflict, restart the merge after removing this second conflicting value.	2017-10-03 11:13:46 -07:00
Jakob Stoklund Olesen	ef048b8899	Allow for call args in incoming stack slots. A value passed as an argument to a function call may live in an incoming stack slot initially. Fix the call legalizer so it copies such an argument into the expected outgoing stack slot for the call.	2017-10-03 11:13:46 -07:00
Dan Gohman	d857aacec3	Sniff the wasm magic bytes, rather than relying on the filename extension.	2017-10-03 09:43:37 -07:00
Dan Gohman	ba14499fe9	Factor out the code for reading a file into a utility function.	2017-10-03 09:43:30 -07:00
Dan Gohman	f064418652	Refactor set/isa parsing into a utility function.	2017-10-03 09:43:27 -07:00
Dan Gohman	12ab4cd914	Add a cton-util compile command.	2017-10-03 09:42:50 -07:00
Dan Gohman	25b8b45a41	Avoid unnecessary BufReaders.	2017-10-03 09:34:41 -07:00
Dan Gohman	eba55cb1da	Make the "wat2wasm" error message not panic.	2017-10-03 09:31:42 -07:00
Dan Gohman	1efa670f60	Do a full compile in 'cton-util wasm'. This removes the `optimize` option, as one can do that with `--set`, eg. `--set opt_level=best`. And it adds an option to print the compilation output.	2017-10-03 09:30:45 -07:00
Jakob Stoklund Olesen	5f56f81251	Resolve all value aliases when computing live ranges. Value aliases are only in the way during register allocation, so make sure they are all dead as we enter the register allocation passes.	2017-09-29 15:54:06 -07:00
Jakob Stoklund Olesen	c82e68efea	Eliminate the ABCD register class constaint in REX encodings. Some REX-less encodings require an ABCD input because they are looking at 8-bit registers. This constraint doesn't apply with a REX prefix where the low 8 bits of all registers are addressable.	2017-09-29 15:29:25 -07:00
Jakob Stoklund Olesen	51a6901a7f	Implement coloring::iterate_solution(). It can happen that the currently live registers are blocking a smaller register class completely, so the only way of solving the allocation problem is to turn some of the live-through registers into solver variables. When the quick_solve attempt fails, try to free up registers in the critical register class by turning live-through values into solver variables.	2017-09-29 14:55:35 -07:00
Jakob Stoklund Olesen	45888ab84e	Reload for spilled call return values. When the return value from a call has been spilled, the reload pass needs to insert a spill instruction right after the call instruction which returns its results in registers.	2017-09-29 11:25:38 -07:00
Jakob Stoklund Olesen	53404a9387	Check for invalid special type constraints. The extend and reduce instructions have additional type constraints. Stop inserting sextend instructions after ctz, clz, and popcnt when translating from WebAssembly. The Cretonne instructions have the same signature as the WebAssembly equivalents.	2017-09-28 16:30:19 -07:00
Jakob Stoklund Olesen	8abcdac5a1	Legalize fcvt_to_sint and fcvt_to_uint for Intel64. We need to generate traps on NaN and overflow.	2017-09-28 12:00:38 -07:00
Jakob Stoklund Olesen	34146435e5	Legalize unsigned-to-float conversions for Intel 64. Also make sure we generate type checks for the controlling type variable in legalization patterns. This is not needed for encodings since the encoding tables are already keyed on the controlling type variable.	2017-09-28 11:39:19 -07:00
Jakob Stoklund Olesen	a274cdf275	Fix the Intel encoding of band_not. The andnps instruction inverts its first argument while band_not inverts is second argument. Use a swapped-operands "fax" encoding recipe.	2017-09-27 18:14:13 -07:00
Jakob Stoklund Olesen	de9b1d1004	Enable debug assertions for cretonne-tools. The cton-util executable is used in --release mode to run the file tests. We want assertions enabled for that. This doesn't affect the compiler flags used to build the Cretonne crate when it is used as a dependency in another project.	2017-09-27 12:55:34 -07:00
Jakob Stoklund Olesen	84471a8431	Add some very basic support for the Intel32 ABI. In 32-bit mode, all function arguments are passed on the stack, not in registers. This ABI support is not complete or properly tested, but at least it doesn't try to pass arguments in r8.	2017-09-27 12:55:34 -07:00
Jakob Stoklund Olesen	b6b474a8c9	Add Intel legalization for fmin and fmax. The native x86_fmin and x86_fmax instructions don't behave correctly for NaN inputs and when comparing +0.0 to -0.0, so we need separate branches for those cases.	2017-09-27 12:55:34 -07:00
Jakob Stoklund Olesen	44eab3e158	Add Intel regmove encodings for floating point types.	2017-09-27 12:49:54 -07:00
Jakob Stoklund Olesen	1fe7890700	Add x86_fmin and x86_fmax instructions. These Intel-specific instructions represent the semantics of the minss / maxss Intel instructions which behave more like a C ternary operator than the WebAssembly fmin and fmax instructions. They will be used as building blocks for implementing the WebAssembly semantics.	2017-09-27 09:17:09 -07:00
Jakob Stoklund Olesen	ac69f3bfdf	Add an Intel-specific x86_cvtt2si instruction. This is used to represent the non-trapping semantics of the cvttss2si and cvttsd2si instructions (and their vectorized counterparts). The overflow behavior of this instruction is specific to the Intel ISAs. There is no float-to-i64 instruction on the 32-bit Intel ISA.	2017-09-26 15:44:41 -07:00
Jakob Stoklund Olesen	6ff681a90d	Add general legalization for the select instruction.	2017-09-26 14:16:35 -07:00
Jakob Stoklund Olesen	ce767be703	Intel encodings for floating point copies.	2017-09-26 13:54:38 -07:00
Jakob Stoklund Olesen	7fb6159a85	Add Intel encodings for the fcmp instruction. Not all floating point condition codes are directly supported by the ucimiss/ucomisd instructions. Some inequalities need to be reversed and eq+ne require two separate tests.	2017-09-26 11:17:32 -07:00
Jakob Stoklund Olesen	79968a2325	Add standard expansions for fcopysign. This is also just a sign bit manipulation.	2017-09-25 15:17:32 -07:00
Jakob Stoklund Olesen	6bec5f8507	Intel encodings for nearest/floor/ceil/trunc. These floating point rounding operations all use the roundss/roundsd instructions that are available in SSE 4.1.	2017-09-25 15:08:04 -07:00
Jakob Stoklund Olesen	ac343ba92a	Add encodings for square root instructions.	2017-09-25 13:15:09 -07:00
Dan Gohman	14d6d1117d	Clean up unneeded '&'s.	2017-09-25 13:05:29 -07:00
Jakob Stoklund Olesen	8deca67968	Add legalization patterns for fabs and fneg. These sign bit manipulations need to use a -0.0 floating point constant which we didn't have a way of materializing previously. Add a ieee32.bits(0x...) syntax to the Python AST nodes that creates am f32 immediate value with the exact requested bitwise representation.	2017-09-25 12:15:33 -07:00
Jakob Stoklund Olesen	ba1c50d6c1	Test WebAssembly floating point constants. f64.const does not yet work on 32-bit Intel.	2017-09-25 11:06:18 -07:00
Jakob Stoklund Olesen	fdb97da21b	Implement a poor man's jump table. We will eventually support real jump tables, but for now just expand br_table into a sequence of conditional branches.	2017-09-25 10:56:14 -07:00

1 2 3 4 5 ...

630 Commits