wasmtime

Author	SHA1	Message	Date
Jakob Stoklund Olesen	788a78caf4	Add Intel encodings for ifcmp_sp. Also generate an Into<RegUnit> implementation for the RU enums.	2018-02-09 14:32:29 -08:00
Jakob Stoklund Olesen	69f70fc61d	Add Intel encodings for trapif. This is implemented as a macro with a conditional jump over a ud2. This way, we don't have to split up EBBs at every conditional trap.	2018-02-08 15:15:15 -08:00
Jakob Stoklund Olesen	11c721934c	Add a trapif instruction. This is a conditional trap controlled by integer CPU flags. Compare to brif.	2018-02-08 14:40:46 -08:00
Julian Seward	6f8a54b6a5	Adds support for legalizing CLZ, CTZ and POPCOUNT on baseline x86_64 targets. Changes: * Adds a new generic instruction, SELECTIF, that does value selection (a la conditional move) similarly to existing SELECT, except that it is controlled by condition code input and flags-register inputs. * Adds a new Intel x86_64 variant, 'baseline', that supports SSE2 and nothing else. * Adds new Intel x86_64 instructions BSR and BSF. * Implements generic CLZ, CTZ and POPCOUNT on x86_64 'baseline' targets using the new BSR, BSF and SELECTIF instructions. * Implements SELECTIF on x86_64 targets using conditional-moves. * new test filetests/isa/intel/baseline_clz_ctz_popcount.cton (for legalization) * new test filetests/isa/intel/baseline_clz_ctz_popcount_encoding.cton (for encoding) * Allow lib/cretonne/meta/gen_legalizer.py to generate non-snake-caseified Rust without rustc complaining. Fixes #238.	2018-02-06 09:43:00 -08:00
Jakob Stoklund Olesen	1bbc529ef9	Improve the variable ordering used by the coloring constraint solver. The fuzzer bugs #219 and #227 are both cases where the register allocator coloring pass "runs out of registers". What's really happening is that the constraint solver failed to find a solution, even when one existed. Suppose we have three solver variables: v0(GPR, out, global) v1(GPR, in) v2(GPR, in, out) And suppose registers %r0 and %r1 are available on both input and output sides of the instruction, but only %r1 is available for global outputs. A valid solution would be: v0 -> %r1 v1 -> %r1 v2 -> %r0 However, the solver would pick registers for the three values in numerical order because v1 and v2 have the same domain size (=2). This would assign v1 -> %r0 and then fail to find a free register for v2. Fix this by prioritizing in+out variables over single-sided variables even when their domains are equal. This means the v2 gets assigned a register before v1, and it gets a chance to pick a register that is still available on both in and out sides. Also try to avoid depending on value numbers in the solver. These bugs were hard to reproduce because a test case invariably would have different value numbers, causing the solver to order its variables differently and succeed. Throw in the previous solution and original register assignments as tie breakers which are stable and not dependent on value numbers. This is still not a substitute for a proper solver search algorithm that we will probably have to write eventually. Fixes #219 Fixes #227	2018-01-19 13:31:26 -08:00
Tyler McMullen	14e39db428	Add filetest for statically out-of-bound heap addresses.	2018-01-18 15:49:10 -08:00
Tyler McMullen	df210bfdea	Fix the Intel x64 PIC 'call' test, adding correct addend.	2018-01-18 14:23:00 -08:00
Jakob Stoklund Olesen	1e49431804	Add test case from #216 . The error exposed by this test case no longer happens after the coalescer was rewritten to to follow the Budimlic paper. It's still a good coalescer test. Fixes #216 by including the test case.	2018-01-17 16:19:51 -08:00
Jakob Stoklund Olesen	dcad3fa339	Fix coloring bug with combined constraints and global values. The Intel instruction "v1 = ushr v2, v2" will implicitly fix the output register for v2 to %rcx because the output is tied to the first input operand and the second input operand is fixed to %rcx. Make sure we handle this transitive constraint when checking for interference with the globally live registers. Fixes #218	2018-01-17 15:51:08 -08:00
Jakob Stoklund Olesen	0a6500c99a	Avoid making solver variables for fixed input constraints. When the coloring pass sees an instruction with a fixed input register constraint that is already satisfied, make sure to tell the solver about it anyway. There are situations where the solver wants to convert a value to a solver variable, and we can't allow that if the same value is also used for a fixed register operand. Fixes #221.	2018-01-17 15:01:00 -08:00
Jakob Stoklund Olesen	13af22b46b	Track register pressure for dead EBB parameters. The spiller wasn't tracking register pressure correctly for dead EBB parameters in visit_ebb_header(). Make sure we free any dead EBB parameters. Fixes #223	2018-01-17 13:19:08 -08:00
Jakob Stoklund Olesen	d1f236b00a	Reimplement coalescer following the Budimlic paper. The old coalescing algorithm had some algorithmic complexity issues when dealing with large virtual registers. Reimplement to use a proper union-find algorithm so we only need one pass through the dominator forests for virtual registers that are interference free. Virtual registers that do have interference are split and new registers built. This pass is about twice as fast as the old one when dealing with complex virtual registers.	2018-01-16 12:32:04 -08:00
Jakob Stoklund Olesen	cacba1a58f	Don't allow EBB parameters to be ghost values. Ghost instructions and values are supposed to be stored as metadata alongside the compiled program such that the ghost values can be computed from the real register/stack values when the program is stopped for debugging or de-optimization. If we allow an EBB parameter to be a ghost value, we have no way of computing its real value using ghost instructions. We would need to know a complete execution trace of the stopped program to figure out which values were passed to the ghost parameter. Instead we require EBB parameters to be real values materialized in registers or on the stack. We use the regclass_for_abi_type() TargetIsa callback to determine the initial register class for these parameters. They can then be spilled later if needed. Fixes #215.	2018-01-11 16:48:02 -08:00
Jakob Stoklund Olesen	5e094034d4	Fix verifier bug in unreachable code. We want to disable dominance checks in unreachable code. The is_reachable() check for EBB parameter values was checking if the defining EBB was reachable, not the EBB using the value. This bug showed up in fuzzing and in #213.	2018-01-09 10:47:49 -08:00
Dan Gohman	4f53cc1dad	Align IntelGOTPCRel4 with R_X86_64_GOTPCREL. Add an addend field to reloc_external, and use it to move the responsibility for accounting for the difference between the end of an instruction (where the PC is considered to be in PC-relative on intel) and the beginning of the immediate field into the encoding code. Specifically, this makes IntelGOTPCRel4 directly correspond to R_X86_64_GOTPCREL, instead of also carrying an implicit `- 4`.	2017-12-15 16:17:32 -06:00
Dan Gohman	76e31cc1ad	Rename GotPCRel4 to GOTPCRel4. This emphasizes that GOT is being used as an abbreviation rather than the word "got".	2017-12-15 16:17:32 -06:00
Jakob Stoklund Olesen	febe8e0e51	Allow spilling of EBB arguments. When the spiller needs to make a register available for a conditional branch instruction, it can be necessary to spill some of the EBB arguments on the branch instruction. This is ok because EBB argument values belong to the same virtual register as the corresponding EBB parameter and we spill the whole virtreg to the same slot. Also make sure free_regs() can handle values that are killed by the current instruction and spilled.	2017-12-14 13:57:13 -06:00
Jakob Stoklund Olesen	d617d5e0f3	Use a domtree pre-order instead of a CFG RPO for coalescing. The stack implementation if the Budimlic dominator forest doesn't work correctly with a CFG RPO. It needs the domtree pre-order. Also handle EBB pre-order vs inst-level preorder. Manage the stack according to EBB dominance. Look for a dominating value by searching the stack. This is different from the Budimlic algorithm because we're computing the dominator tree pre-order with EBB granularity only. Fixes #207.	2017-12-13 16:22:01 -06:00
Jakob Stoklund Olesen	a825427786	Avoid reloading spilled EBB arguments. The coalescer makes sure that matching EBB arguments and parameters are always in the same virtual registers, and therefore also in the same stack slot if they are spilled. This means that the reload pass should never rewrite an EBB argument if the argument value is spilled. This comes up in cases where the branch instruction needs the same value in a register: brnz v9, ebb3(v9) If the virtual register containing v9 is spilled, the branch instruction must be reloaded like: v52 = fill v9 brnz v52, ebb3(v9) The branch register argument must be rewritten, and the EBB argument must be referring to the original stack value. Fixes #208.	2017-12-13 15:22:05 -06:00
Pat Hickey	ed81bc21be	filetests: add filetests for intel PIC encodings	2017-12-12 19:29:52 -08:00
Jakob Stoklund Olesen	a888b2a6f1	Dominator tree pre-order. Add a DominatorTreePreorder data structure which can be initialized for a DominatorTree and used for queries involving a pre-order of the dominator tree. Print out the pre-order and send it through filecheck in "test domtree" file tests.	2017-12-08 17:43:15 -08:00
Jakob Stoklund Olesen	7d5f2f0404	Convert the CFG traversal tests to file tests. Add a "cfg_postorder:" printout to the "test domtree" file tests and use that to check the computed CFG post-order instead of doing it manually with Rust code.	2017-12-08 13:58:18 -08:00
Jakob Stoklund Olesen	a7eb13a151	Expand unknown instructions to runtime library calls.	2017-12-08 10:37:50 -08:00
Jakob Stoklund Olesen	f03729d742	Fix generated code for ISA predicates on encoding recipes. The generated code had syntax errors and inverted logic. Add an SSE 4.1 requirement to the floating point rounding instructions.	2017-12-08 10:37:50 -08:00
Tyler McMullen	7988d0c54c	Add 8-bit variation of adjust_sp_imm for 32-bit and 64-bit Intel.	2017-12-05 11:49:12 -08:00
Tyler McMullen	5783ea2c9a	Account for return address when reserving stack space for CSRs.	2017-12-05 11:49:12 -08:00
Tyler McMullen	a75248d2cf	Move the initial stack pointer adjustment to after the CSR pushes.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ebcbd54f61	Add 'compile' test and confirm the pro/epilogue is added. Fix regression this revealed.	2017-12-05 11:49:12 -08:00
Tyler McMullen	ced39f5186	Fix up adjust_sp_imm instruction. * Use imm64 rather than offset32 * Add predicate to enforce signed 32-bit limit to imm * Remove AdjustSpImm format * Add encoding tests for adjust_sp_imm * Adjust use of adjust_sp_imm in Intel prologue_epilogue to match	2017-12-05 11:49:12 -08:00
Tyler McMullen	1a11c351b5	Add tests and documentation for x86_(push\|pop). Fix up encoding issues revealed by tests.	2017-12-05 11:49:12 -08:00
Tyler McMullen	3b1b33e0ac	Add docs and tests for copy_special instruction. Fixes encoding issue that tests revealed.	2017-12-05 11:49:12 -08:00
Tyler McMullen	6ec4bfc4ca	Fix up the encodings for new instructions, both expected and actual. Make the test more accurate.	2017-12-05 11:49:12 -08:00
Tyler McMullen	fdfe24760a	Add missing newline to prologue epilogue test	2017-12-05 11:49:12 -08:00
Tyler McMullen	d4311d2b1d	Add prologue-epilogue test that exercises new instructions and binary emission.	2017-12-05 11:49:12 -08:00
Pat Hickey	b5601d57c8	filetests: change hex function names to user function numbers	2017-11-23 14:08:47 -08:00
Dan Gohman	e213c2654f	Fix branch_destination/analyze_branch for BranchInt/BranchFloat.	2017-11-08 10:58:03 -08:00
Dan Gohman	5d063eb8bc	Merge reloc_func and reloc_globalsym into reloc_external.	2017-10-31 12:26:33 -07:00
Dan Gohman	9c54c3fff0	Introduce globalsym_addr. This is an instruction used in legalization of GlobalVarData::Sym global variables.	2017-10-30 13:26:56 -07:00
Dan Gohman	cb805f704d	Put BaldrMonkey-specific behavior under a setting. BaldrMonkey will need to enable allones_funcaddrs.	2017-10-30 13:26:56 -07:00
Dan Gohman	6fc45b070a	Add a new kind of GlobalVar for symbolic addresses. These addresses will allow referencing C/C++/Rust-style global variables by name directly.	2017-10-30 13:26:56 -07:00
Jakob Stoklund Olesen	91b1566aca	Use "test regalloc" for the register allocator tests. These tests were only using "test compile" because it doesn't require any filecheck directives to be present, so just stop requiring filecheck directives for "test regalloc" and other filecheck-based test drivers.	2017-10-25 18:31:14 -07:00
Jakob Stoklund Olesen	d37126565e	Also consider fixed outputs for replace_global_defines. Fixes #178. When an instruction with a fixed output operand defines a globally live SSA value, we need to check if the fixed register is available in the `regs.global` set of registers that can be used across EBB boundaries. If the fixed output register is not available in regs.global, set the replace_global_defines flag so the output operands are rewritten as local values.	2017-10-25 14:28:30 -07:00
Jakob Stoklund Olesen	1b71285b34	Return bools in GPR registers. Boolean types are returned in %rax, so regclass_for_abi_type() should return GPR. Fixes #179.	2017-10-25 13:34:55 -07:00
Jakob Stoklund Olesen	e8ecf1f809	Add a FixedTied constraint kind for operand constraints. Fixes #175. The Intel division instructions have fixed input operands that are clobbered by fixed output operands, so the value passed as an input will be clobbered just like a tied operand. The FixedTied operand constraint is used to indicate a fixed input operand that has a corresponding output operand with the same fixed register. Teach the spiller to teach a FixedTied operand the same as a Tied operand constraint and make sure that the input value is killed by the instruction.	2017-10-25 11:22:20 -07:00
Jakob Stoklund Olesen	921bcc6c25	Use the term "EBB parameter" everywhere. Add EBB parameter and EBB argument to the langref glossary to clarify the distinction between formal EBB parameter values and arguments passed to branches. - Replace "ebb_arg" with "ebb_param" in function names that deal with EBB parameters. - Rename the ValueDef variants to Result and Param. - A bunch of other small langref fixes. No functional changes intended.	2017-10-19 16:17:09 -07:00
Dan Gohman	7c9b9e3d27	Mark spill and fill as can_store and can_load. This allows GVN to avoid hoisting them. These will be to coarse for things that want more precise dependence information, however we can work that out when we build such things.	2017-10-19 13:11:33 -07:00
Dan Gohman	cc0bb70c5d	Make GVN aware of instructions that write to CPU flags.	2017-10-19 12:59:10 -07:00
Jakob Stoklund Olesen	b948de1693	Add a verifier pass for CPU flags. Only one CPU flags value can be live at a time, and some instructions clobber the flags.	2017-10-18 15:07:19 -07:00
Jakob Stoklund Olesen	5d065c4d8f	Add encodings for CPU flags instructions. Branch on flags: brif, brff, Compare integers to flags: ifcmp Compare floats to flags: ffcmp Convert flags to b1: trueif, trueff	2017-10-16 13:07:23 -07:00
Jakob Stoklund Olesen	1f98fc491c	Add instructions using CPU flags. Add integer and floating comparison instructions that return CPU flags: ifcmp, ifcmp_imm, and ffcmp. Add conditional branch instructions that check CPU flags: brif, brff Add instructions that check a condition in the CPU flags and return a b1: trueif, trueff.	2017-10-12 19:12:28 -07:00

1 2 3 4 5 ...

305 Commits