wasmtime

Author	SHA1	Message	Date
Dan Gohman	4f53cc1dad	Align IntelGOTPCRel4 with R_X86_64_GOTPCREL. Add an addend field to reloc_external, and use it to move the responsibility for accounting for the difference between the end of an instruction (where the PC is considered to be in PC-relative on intel) and the beginning of the immediate field into the encoding code. Specifically, this makes IntelGOTPCRel4 directly correspond to R_X86_64_GOTPCREL, instead of also carrying an implicit `- 4`.	2017-12-15 16:17:32 -06:00
Dan Gohman	76e31cc1ad	Rename GotPCRel4 to GOTPCRel4. This emphasizes that GOT is being used as an abbreviation rather than the word "got".	2017-12-15 16:17:32 -06:00
Pat Hickey	d444044e9e	intel isa: comments to explain rip-relative addressing encoding	2017-12-12 19:29:52 -08:00
Pat Hickey	6d44debc18	intel: add PIC variants to recipes and encodings	2017-12-12 19:29:52 -08:00
Pat Hickey	88b30ff386	refactor Reloc to an enum of every architecture's reloc types https://github.com/stoklund/cretonne/pull/206#issuecomment-350905016	2017-12-12 13:57:10 -08:00
Jakob Stoklund Olesen	f03729d742	Fix generated code for ISA predicates on encoding recipes. The generated code had syntax errors and inverted logic. Add an SSE 4.1 requirement to the floating point rounding instructions.	2017-12-08 10:37:50 -08:00
Tyler McMullen	7988d0c54c	Add 8-bit variation of adjust_sp_imm for 32-bit and 64-bit Intel.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ced39f5186	Fix up adjust_sp_imm instruction. * Use imm64 rather than offset32 * Add predicate to enforce signed 32-bit limit to imm * Remove AdjustSpImm format * Add encoding tests for adjust_sp_imm * Adjust use of adjust_sp_imm in Intel prologue_epilogue to match	2017-12-05 11:49:12 -08:00
Tyler McMullen	1a11c351b5	Add tests and documentation for x86_(push\|pop). Fix up encoding issues revealed by tests.	2017-12-05 11:49:12 -08:00
Tyler McMullen	3b1b33e0ac	Add docs and tests for copy_special instruction. Fixes encoding issue that tests revealed.	2017-12-05 11:49:12 -08:00
Tyler McMullen	4eb9a54096	Convert x86_(push\|pop) operations to be explicitly limited to 32-bit and 64-bit values.	2017-12-05 11:49:12 -08:00
Tyler McMullen	6ec4bfc4ca	Fix up the encodings for new instructions, both expected and actual. Make the test more accurate.	2017-12-05 11:49:12 -08:00
Tyler McMullen	e6481bb4eb	Add 32-bit encodings for x86_push, x86_pop, copy_special, and adjust_sp_imm.	2017-12-05 11:49:12 -08:00
Tyler McMullen	c92d49963a	Simplify x86_(push\|pop) encodings.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ffab87318e	Add adjust_sp_imm instruction. Note: This enables using rsp and rbp as normal registers. Which is... wrong.	2017-12-05 11:49:12 -08:00
Tyler McMullen	32509ebacd	Fix push/pop encoding for extended registers. Add copy_special encoding.	2017-12-05 11:49:12 -08:00
Tyler McMullen	b8275f5713	Add (some) encodings for x86_push/pop instructions. Simple uses actually pass the legalizer now.	2017-12-05 11:49:12 -08:00
Tyler McMullen	8ed37e352e	Add x86_push and x86_pop instructions.	2017-12-05 11:49:12 -08:00
Dan Gohman	5f8b1b9f04	Fix a flake8 lint.	2017-10-31 13:05:26 -07:00
Dan Gohman	5d063eb8bc	Merge reloc_func and reloc_globalsym into reloc_external.	2017-10-31 12:26:33 -07:00
Dan Gohman	9c54c3fff0	Introduce globalsym_addr. This is an instruction used in legalization of GlobalVarData::Sym global variables.	2017-10-30 13:26:56 -07:00
Dan Gohman	cb805f704d	Put BaldrMonkey-specific behavior under a setting. BaldrMonkey will need to enable allones_funcaddrs.	2017-10-30 13:26:56 -07:00
Dan Gohman	fae5ffb556	Make generated code more consistent with current rustfmt.	2017-10-30 10:06:23 -07:00
Jakob Stoklund Olesen	02e81dd1d7	Fix build after flake8 update. There's a new version of flake8 out which doesn't like variables names i, l, I. No functional change intended.	2017-10-25 11:40:37 -07:00
Jakob Stoklund Olesen	620eb7effe	Add a "clobbers_flags" flag to encoding recipes. On some ISAs like Intel's, all arithmetic instructions set all or some of the CPU flags, so flag values can't be live across these instructions. On ISAs like ARM's Aarch32, flags are clobbered by compact 16-bit encodings but not necessarily by 32-bit encodings of the same instruction. The "clobbers_flags" bit on the encoding recipe is used to indicate if CPU flag values can be live across an instruction, or conversely whether the encoding can be used where flag values are live.	2017-10-16 14:40:28 -07:00
Jakob Stoklund Olesen	5d065c4d8f	Add encodings for CPU flags instructions. Branch on flags: brif, brff, Compare integers to flags: ifcmp Compare floats to flags: ffcmp Convert flags to b1: trueif, trueff	2017-10-16 13:07:23 -07:00
Jakob Stoklund Olesen	0f4f663584	Add register banks for CPU flags to Intel and ARM ISAs. The arm32 ISA technically has separate floating point and integer flags, but the only useful thing you can do with the floating point flags is to copy them ti the integer flags, so there is not need to model them. The arm64 ISA fixes this and the fcmp instruction writes the integer nzcv flags directly. RISC-V does not have CPU flags.	2017-10-13 14:02:09 -07:00
Jakob Stoklund Olesen	ba52a38597	Add a t8jccd_long encoding recipe for brz.b1 and brnz.b1 in 32-bit mode. The register allocator can't handle branches with constrained register operands, and the brz.b1/brnz.b1 instructions only have the t8jccd_abcd in 32-bit mode where no REX prefixes are possible. This adds a worst case encoding for those cases where a b1 value lives in a non-ABCD register.	2017-10-11 14:20:43 -07:00
Jakob Stoklund Olesen	ece09f2df2	Add encodings for spill.b1, fill.b1 etc. These spills and fills use 32-bit writes, knowing that the spill slot is minimum 4 bytes which makes it safe. Also simplify the definition of load/store encodings a bit by introducing loops.	2017-10-11 14:20:43 -07:00
Jakob Stoklund Olesen	ecd537ecd6	Avoid widening TailRecipe register constraints automatically. Most recipes with an ABCD constraint can handle the full GPR register class when a REX prefix is applied, but not all. The "icscc" macro recipe always generates a setCC instruction with no REX prefix, so it can only write the ABCD registers, even in its REX form. Don't automatically rewrite ABCD constraints to GPR constraints when applying a REX prefix to a tail recipe. Instead, allow individual ABCD recipes to specify a "when_prefixed" alternative recipe to use. This also eliminates the spurious Rex*abcd recipe names which didn't have an ABCD constraint. Also allow recipes to specify that a REX prefix is required by setting the prefix_required flag. This is used by recipes like t8jccb which explicitly accesses an 8-bit register with a GPR constraint which is only valid with a prefix.	2017-10-09 14:08:37 -07:00
Jakob Stoklund Olesen	73d4bb47c0	Intel encodings for regspill and regfill. These are always SP-based.	2017-10-04 17:02:09 -07:00
Jakob Stoklund Olesen	e10b3117cb	Rename enc_flt() to enc_both(). This encoding method is not only used for floating point instructions.	2017-10-03 13:27:00 -07:00
Jakob Stoklund Olesen	c82e68efea	Eliminate the ABCD register class constaint in REX encodings. Some REX-less encodings require an ABCD input because they are looking at 8-bit registers. This constraint doesn't apply with a REX prefix where the low 8 bits of all registers are addressable.	2017-09-29 15:29:25 -07:00
Jakob Stoklund Olesen	51a6901a7f	Implement coloring::iterate_solution(). It can happen that the currently live registers are blocking a smaller register class completely, so the only way of solving the allocation problem is to turn some of the live-through registers into solver variables. When the quick_solve attempt fails, try to free up registers in the critical register class by turning live-through values into solver variables.	2017-09-29 14:55:35 -07:00
Jakob Stoklund Olesen	86e22e7de5	Add long-range encodings for conditional branches. The brz and brnz instructions get support for 32-bit jump displacements for long range branches. Also change the way branch ranges are specified on tail recipes for the Intel instructions. All branch displacements are relative to the end of the instruction, so just compute the branch range origin as the instruction size instead of trying to specify it in the tail recipe definitions.	2017-09-29 13:18:29 -07:00
Jakob Stoklund Olesen	711e5cd644	Handle srem INT_MIN, -1 correctly. The x86_divmodx traps on integer overflow, but the srem instruction is not supposed to trap with a -1 divisor. Generate a legalization expansion for srem that special-cases the -1 divisor to simply return 0.	2017-09-29 08:53:49 -07:00
Jakob Stoklund Olesen	8abcdac5a1	Legalize fcvt_to_sint and fcvt_to_uint for Intel64. We need to generate traps on NaN and overflow.	2017-09-28 12:00:38 -07:00
Jakob Stoklund Olesen	34146435e5	Legalize unsigned-to-float conversions for Intel 64. Also make sure we generate type checks for the controlling type variable in legalization patterns. This is not needed for encodings since the encoding tables are already keyed on the controlling type variable.	2017-09-28 11:39:19 -07:00
Jakob Stoklund Olesen	a274cdf275	Fix the Intel encoding of band_not. The andnps instruction inverts its first argument while band_not inverts is second argument. Use a swapped-operands "fax" encoding recipe.	2017-09-27 18:14:13 -07:00
Jakob Stoklund Olesen	b6b474a8c9	Add Intel legalization for fmin and fmax. The native x86_fmin and x86_fmax instructions don't behave correctly for NaN inputs and when comparing +0.0 to -0.0, so we need separate branches for those cases.	2017-09-27 12:55:34 -07:00
Jakob Stoklund Olesen	384b04b411	Fix some misnamed TailRecipes and add a consistency check.	2017-09-27 12:55:34 -07:00
Jakob Stoklund Olesen	44eab3e158	Add Intel regmove encodings for floating point types.	2017-09-27 12:49:54 -07:00
Jakob Stoklund Olesen	1fe7890700	Add x86_fmin and x86_fmax instructions. These Intel-specific instructions represent the semantics of the minss / maxss Intel instructions which behave more like a C ternary operator than the WebAssembly fmin and fmax instructions. They will be used as building blocks for implementing the WebAssembly semantics.	2017-09-27 09:17:09 -07:00
Jakob Stoklund Olesen	ac69f3bfdf	Add an Intel-specific x86_cvtt2si instruction. This is used to represent the non-trapping semantics of the cvttss2si and cvttsd2si instructions (and their vectorized counterparts). The overflow behavior of this instruction is specific to the Intel ISAs. There is no float-to-i64 instruction on the 32-bit Intel ISA.	2017-09-26 15:44:41 -07:00
Jakob Stoklund Olesen	ce767be703	Intel encodings for floating point copies.	2017-09-26 13:54:38 -07:00
Jakob Stoklund Olesen	7fb6159a85	Add Intel encodings for the fcmp instruction. Not all floating point condition codes are directly supported by the ucimiss/ucomisd instructions. Some inequalities need to be reversed and eq+ne require two separate tests.	2017-09-26 11:17:32 -07:00
Jakob Stoklund Olesen	6bec5f8507	Intel encodings for nearest/floor/ceil/trunc. These floating point rounding operations all use the roundss/roundsd instructions that are available in SSE 4.1.	2017-09-25 15:08:04 -07:00
Jakob Stoklund Olesen	ac343ba92a	Add encodings for square root instructions.	2017-09-25 13:15:09 -07:00
Jakob Stoklund Olesen	29dfcf5dfb	Add spill/fill encodings for Intel ISAs. To begin with, these are catch-all encodings with a SIB byte and a 32-bit displacement, so they can access any stack slot via both the stack pointer and the frame pointer. In the future, we will add encodings for 8-bit displacements as well as EBP-relative references without a SIB byte.	2017-09-22 16:05:26 -07:00
Angus Holder	b003605132	Adapt intel to be able to correctly choose compressed instruction encodings: create a register class to identify the lower 8 registers, omit unnecessary REX prefixes, and fix the tests	2017-09-22 07:54:26 -07:00

1 2 3

129 Commits