wasmtime

Author	SHA1	Message	Date
Dan Gohman	56f11e76b4	Use PC-relative encodings for colocated functions on non-PIC. Colocated functions are expected to be defined within the PC-relative immediate range on x86-64, so allow this addressing for non-PIC as well as PIC.	2018-04-16 16:27:27 -07:00
Dan Gohman	0e57f3d0ea	Add a "colocated" flag to symbol references. (#298 ) This adds a "colocated" flag to function and symbolic global variables which indicates that they are defined along with the current function, so they can use PC-relative addressing. This also changes the function decl syntax; the name now always precedes the signature, and the "function" keyword is no longer included.	2018-04-13 15:00:09 -07:00
Dan Gohman	04746270b3	Rename X86Abs4/X86Abs8 to Abs4/Abs8. These relocation codes are for simple absolute addresses and aren't architecture-specific.	2018-04-13 09:11:14 -07:00
Dan Gohman	1c760ab179	Rename intel to x86. x86 is the more accurate name, as there are non-Intel x86 implementations. Fixes #263.	2018-04-12 10:02:16 -07:00
Dan Gohman	eab57c0a40	Use large-model addressing for calls when in non-PIC mode. The main use for non-PIC code at present is JIT code, and JIT code can live anywhere in memory and reference other symbols defined anywhere in memory, so it needs to use the "large" code model. func_addr and globalsym_addr instructions were already using `movabs` to support arbitrary 64-bit addresses, so this just makes calls be legalized to support arbitrary 64-bit addresses also.	2018-04-08 22:37:35 -07:00
Dan Gohman	2703b8ce6f	The current x86-32 encodings for symbolic addresses are non-PIC.	2018-04-08 22:30:55 -07:00
Dan Gohman	b0d414731c	The addend for a PCRel4 reloc should be -4 too.	2018-04-07 06:15:33 -07:00
Dan Gohman	6606b88136	Optimize immediates and compare and branch sequences (#286 ) * Add a pre-opt optimization to change constants into immediates. This converts 'iadd' + 'iconst' into 'iadd_imm', and so on. * Optimize away redundant `bint` instructions. Cretonne has a concept of "Testable" values, which can be either boolean or integer. When the an instruction needing a "Testable" value receives the result of a `bint`, converting boolean to integer, eliminate the `bint`, as it's redundant. * Postopt: Optimize using CPU flags. This introduces a post-legalization optimization pass which converts compare+branch sequences to use flags values on CPUs which support it. * Define a form of x86's `urm` that doesn't clobber FLAGS. movzbl/movsbl/etc. don't clobber FLAGS; define a form of the `urm` recipe that represents this. * Implement a DCE pass. This pass deletes instructions with no side effects and no results that are used. * Clarify ambiguity about "32-bit" and "64-bit" in comments. * Add x86 encodings for icmp_imm. * Add a testcase for postopt CPU flags optimization. This covers the basic functionality of transforming compare+branch sequences to use CPU flags. * Pattern-match irsub_imm in preopt.	2018-03-30 12:30:07 -07:00
Tyler McMullen	951ff11f85	[WIP] Add a Trap sink to code generation (#279 ) * First draft of TrapSink implementation. * Add trap sink calls to 'trapif' and 'trapff' recipes. * Add SourceLoc to trap sink calls, and add trap sink calls to all loads and stores. * Add IntegerDivisionByZero trap to div recipe. * Only emit load/store traps if 'notrap' flag is not set on the instruction. * Update filetest machinery to add new trap sink functionality. * Update filetests to include traps in output. * Add a few more trap outputs to filetests. * Add trap output to CLI tool.	2018-03-28 22:48:03 -07:00
Dan Gohman	23ab07b54e	Support legalizing bconst instructions on x86.	2018-03-28 14:11:16 -07:00
Dan Gohman	79f02e42dd	Use movss/movsd rather than movd/movq for floating-point loads and stores. While there may be CPUs that have a domain crossing penalty here, this also helps the generated code look more like the code produced by other compilers.	2018-03-27 11:53:59 -07:00
Dan Gohman	ffe89cdc0a	Rename %eflags to %rflags. EFLAGS is a subregister of RFLAGS. For consistency with GPRs where we use the 64-bit names to refer to the registers, use the 64-bit name for RFLAGS as well.	2018-03-27 11:52:57 -07:00
Pat Hickey	80d2c5d9bf	Implement shift-immediate encodings for x86 (#283 ) * add x86 encodings for shift-immediate instructions implements encodings for ishl_imm, sshr_imm, and ushr_imm. uses 8-bit immediates. added tests for the encodings to intel/binary64.cton. Canonical versions come from llvm-mc. * translate test to use shift-immediates * shift immediate encodings: use enc_i32_i64 and note why the regular shift encodings cant use it above * add additional encoding tests for shift immediates this covers 32 bit mode, and 64 bit operations in 64 bit mode.	2018-03-26 16:48:20 -07:00
Dan Gohman	ca4582ae82	Rename the recipes for x86 spill/fill instructions. Both "sp" and "fi" have multiple meanings in this context, so use slightly longer but less ambiguous names.	2018-03-20 13:28:35 -07:00
Afnan Enayet	9a49bc2ec9	Rename `I32` -> `X86_32` and `I64` -> `X86_64` (#271 ) * Rename `I32` -> `X86_32` and `I64` -> `X86_64` * Format file to pass flake8 tests * Fix comment so lines are under 80 char limit * Remove trailing whitespace from comment * Renamed `enc_i64` to `enc_x86_64` as per suggestion from PR	2018-03-18 13:50:51 -07:00
Dan Gohman	b8a106adf0	Remove the "has_sse2" flag. Cretonne currently requires SSE2 support pervasively, so it's not meaningful to have a setting for it.	2018-03-12 12:38:01 -07:00
Dan Gohman	136d6f5c4b	Implement ireduce, sextend, and uextend between i8/i16 and i32/i64.	2018-03-05 15:13:59 -08:00
Dan Gohman	6e94e70f30	Use an https URL rather than http. Found by sphinx's linkcheck.	2018-03-05 06:55:27 -08:00
Dan Gohman	c59e9180de	Tidy up whitespace.	2018-03-05 06:55:27 -08:00
Julian Seward	7054f25abb	Adds support to transform integer div and rem by constants into cheaper equivalents. Adds support for transforming integer division and remainder by constants into sequences that do not involve division instructions. * div/rem by constant powers of two are turned into right shifts, plus some fixups for the signed cases. * div/rem by constant non-powers of two are turned into double length multiplies by a magic constant, plus some fixups involving shifts, addition and subtraction, that depends on the constant, the word size and the signedness involved. * The following cases are transformed: div and rem, signed or unsigned, 32 or 64 bit. The only un-transformed cases are: unsigned div and rem by zero, signed div and rem by zero or -1. * This is all incorporated within a new transformation pass, "preopt", in lib/cretonne/src/preopt.rs. * In preopt.rs, fn do_preopt() is the main driver. It is designed to be extensible to transformations of other kinds of instructions. Currently it merely uses a helper to identify div/rem transformation candidates and another helper to perform the transformation. * In preopt.rs, fn get_div_info() pattern matches to find candidates, both cases where the second arg is an immediate, and cases where the second arg is an identifier bound to an immediate at its definition point. * In preopt.rs, fn do_divrem_transformation() does the heavy lifting of the transformation proper. It in turn uses magic{S,U}{32,64} to calculate the magic numbers required for the transformations. * There are many test cases for the transformation proper: filetests/preopt/div_by_const_non_power_of_2.cton filetests/preopt/div_by_const_power_of_2.cton filetests/preopt/rem_by_const_non_power_of_2.cton filetests/preopt/rem_by_const_power_of_2.cton filetests/preopt/div_by_const_indirect.cton preopt.rs also contains a set of tests for magic number generation. * The main (non-power-of-2) transformation requires instructions that return the high word of a double-length multiply. For this, instructions umulhi and smulhi have been added to the core instruction set. These will map directly to single instructions on most non-intel targets. * intel does not have an instruction exactly like that. For intel, instructions x86_umulx and x86_smulx have been added. These map to real instructions and return both result words. The intel legaliser will rewrite {s,u}mulhi into x86_{s,u}mulx uses that throw away the lower half word. Tests: filetests/isa/intel/legalize-mulhi.cton (new file) filetests/isa/intel/binary64.cton (added x86_{s,u}mulx encoding tests)	2018-02-28 11:41:36 -08:00
Dan Gohman	ab9298eafa	Make the `fst` recipe use the deref-safe register class as well.	2018-02-28 10:12:40 -08:00
Jakob Stoklund Olesen	b9b1d0fcd5	Add a trapff instruction. This is the floating point equivalent of trapif: Trap when a given condition is in the floating-point flags. Define Intel encodings comparable to the trapif encodings.	2018-02-20 14:35:41 -08:00
Jakob Stoklund Olesen	a9e799debb	Add an avoid_div_traps setting. This enables code generation that never causes a SIGFPE signal to be raised from a division instruction. Instead, division and remainder calculations are protected by explicit traps.	2018-02-16 13:10:29 -08:00
Jakob Stoklund Olesen	3ccc3f4f9b	Add a stack_check instruction. This instruction loads a stack limit from a global variable and compares it to the stack pointer, trapping if the stack has grown beyond the limit. Also add a expand_flags transform group containing legalization patterns for ISAs with CPU flags. Fixes #234.	2018-02-13 10:48:06 -08:00
Jakob Stoklund Olesen	60e70da0e6	Add Intel encodings for ifcmp_imm. The instruction set has variants with 8-bit and 32-bit signed immediate operands. Add a TODO to use a TEST instruction for the special case ifcmp_imm x, 0.	2018-02-13 10:38:46 -08:00
Jakob Stoklund Olesen	788a78caf4	Add Intel encodings for ifcmp_sp. Also generate an Into<RegUnit> implementation for the RU enums.	2018-02-09 14:32:29 -08:00
Jakob Stoklund Olesen	69f70fc61d	Add Intel encodings for trapif. This is implemented as a macro with a conditional jump over a ud2. This way, we don't have to split up EBBs at every conditional trap.	2018-02-08 15:15:15 -08:00
Julian Seward	6f8a54b6a5	Adds support for legalizing CLZ, CTZ and POPCOUNT on baseline x86_64 targets. Changes: * Adds a new generic instruction, SELECTIF, that does value selection (a la conditional move) similarly to existing SELECT, except that it is controlled by condition code input and flags-register inputs. * Adds a new Intel x86_64 variant, 'baseline', that supports SSE2 and nothing else. * Adds new Intel x86_64 instructions BSR and BSF. * Implements generic CLZ, CTZ and POPCOUNT on x86_64 'baseline' targets using the new BSR, BSF and SELECTIF instructions. * Implements SELECTIF on x86_64 targets using conditional-moves. * new test filetests/isa/intel/baseline_clz_ctz_popcount.cton (for legalization) * new test filetests/isa/intel/baseline_clz_ctz_popcount_encoding.cton (for encoding) * Allow lib/cretonne/meta/gen_legalizer.py to generate non-snake-caseified Rust without rustc complaining. Fixes #238.	2018-02-06 09:43:00 -08:00
Tyler McMullen	ff16583c59	Remove RSP from deref safe register class as well.	2018-01-29 14:18:08 -08:00
Tyler McMullen	21f0fc39ad	Further restrict Intel register classes to prevent incorrect encoding of R12 derefs.	2018-01-29 13:42:11 -08:00
Tyler McMullen	850896f05e	The addend for a PLTRel4 reloc should be -4.	2018-01-18 14:23:00 -08:00
Tyler McMullen	eb85aa833c	Illegalize rbp/r13 for zero-offset loads on Intel x64 (#225 ) * Switch RegClass to a bitmap implementation. * Add special RegClass to remove r13 from 'ld' recipe. * Use MASK_LEN constant instead of magic number. * Enforce that RegClass slicing is only valid on contiguous classes. * Use Optional[int] for RegClass optional bitmask parameter. * Add comment explaining use of Intel ISA's GPR_NORIP register class.	2018-01-16 20:05:53 -08:00
Jakob Stoklund Olesen	85aab278dd	Add RISC-V encodings for b1 copy/spill/fill. We allow b1 values in general purpose registers, so we need to be able to move them around.	2018-01-16 09:19:22 -08:00
Dan Gohman	4f53cc1dad	Align IntelGOTPCRel4 with R_X86_64_GOTPCREL. Add an addend field to reloc_external, and use it to move the responsibility for accounting for the difference between the end of an instruction (where the PC is considered to be in PC-relative on intel) and the beginning of the immediate field into the encoding code. Specifically, this makes IntelGOTPCRel4 directly correspond to R_X86_64_GOTPCREL, instead of also carrying an implicit `- 4`.	2017-12-15 16:17:32 -06:00
Dan Gohman	76e31cc1ad	Rename GotPCRel4 to GOTPCRel4. This emphasizes that GOT is being used as an abbreviation rather than the word "got".	2017-12-15 16:17:32 -06:00
Pat Hickey	d444044e9e	intel isa: comments to explain rip-relative addressing encoding	2017-12-12 19:29:52 -08:00
Pat Hickey	6d44debc18	intel: add PIC variants to recipes and encodings	2017-12-12 19:29:52 -08:00
Pat Hickey	88b30ff386	refactor Reloc to an enum of every architecture's reloc types https://github.com/stoklund/cretonne/pull/206#issuecomment-350905016	2017-12-12 13:57:10 -08:00
Jakob Stoklund Olesen	f03729d742	Fix generated code for ISA predicates on encoding recipes. The generated code had syntax errors and inverted logic. Add an SSE 4.1 requirement to the floating point rounding instructions.	2017-12-08 10:37:50 -08:00
Tyler McMullen	7988d0c54c	Add 8-bit variation of adjust_sp_imm for 32-bit and 64-bit Intel.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ced39f5186	Fix up adjust_sp_imm instruction. * Use imm64 rather than offset32 * Add predicate to enforce signed 32-bit limit to imm * Remove AdjustSpImm format * Add encoding tests for adjust_sp_imm * Adjust use of adjust_sp_imm in Intel prologue_epilogue to match	2017-12-05 11:49:12 -08:00
Tyler McMullen	1a11c351b5	Add tests and documentation for x86_(push\|pop). Fix up encoding issues revealed by tests.	2017-12-05 11:49:12 -08:00
Tyler McMullen	3b1b33e0ac	Add docs and tests for copy_special instruction. Fixes encoding issue that tests revealed.	2017-12-05 11:49:12 -08:00
Tyler McMullen	4eb9a54096	Convert x86_(push\|pop) operations to be explicitly limited to 32-bit and 64-bit values.	2017-12-05 11:49:12 -08:00
Tyler McMullen	6ec4bfc4ca	Fix up the encodings for new instructions, both expected and actual. Make the test more accurate.	2017-12-05 11:49:12 -08:00
Tyler McMullen	e6481bb4eb	Add 32-bit encodings for x86_push, x86_pop, copy_special, and adjust_sp_imm.	2017-12-05 11:49:12 -08:00
Tyler McMullen	c92d49963a	Simplify x86_(push\|pop) encodings.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ffab87318e	Add adjust_sp_imm instruction. Note: This enables using rsp and rbp as normal registers. Which is... wrong.	2017-12-05 11:49:12 -08:00
Tyler McMullen	32509ebacd	Fix push/pop encoding for extended registers. Add copy_special encoding.	2017-12-05 11:49:12 -08:00
Tyler McMullen	b8275f5713	Add (some) encodings for x86_push/pop instructions. Simple uses actually pass the legalizer now.	2017-12-05 11:49:12 -08:00

1 2 3 4

162 Commits