regalloc2

Author	SHA1	Message	Date
Chris Fallin	0cb08095e6	Make multiple defs of one vreg possible on one instruction. (#38 ) Currently, if this is done, overlapping liveranges are created, and we hit an assert that ensures our non-overlapping and built-in-reverse-order invariants during liverange construction. One could argue that multiple defs of a single vreg don't make a ton of sense -- which def's value is valid after the instruction? -- but if they all get the same alloc, then the answer is "whatever was put in that alloc", and this is just a case of an instruction being a bit over-eager when listing its registers. This can arise in practice when operand lists come from combinations or concatenations: for example, in Cranelift's s390x backend, there is a "Loop" pseudo-instruction, and the operands of the Loop are the operands of all the sub-instructions. It seems more logically cohesive overall to say that one can state an operand as many times as one likes; so this PR makes it so.	2022-04-04 16:22:05 -07:00
Chris Fallin	433e8b3776	Early defs reserve a register for whole instruction. (#32 ) The `Operand` abstraction allows a def to be positioned at the "early" point of an instruction, before its effect and alongside its normal uses. This is intended to allow the embedder to express that a def may be written before all uses are read, so it should not conflict with the uses. It's also convenient to use early defs to express temporaries, which should be available throughout a regalloc-level instruction's emitted sequence. In such a case, the register should not be used again after the instruction, so it is dead following the instruction. Strictly speaking, and according to regalloc2 prior to this PR, then the temp will only conflict with the uses at the early-point, and not the defs at the late-point (after the instruction), because it's dead past its point of definition. But for a temp we really want it to register conflicts not just with the normal uses but with the normal defs as well. This PR changes the semantics so that an early def builds a liverange that spans the early- and late-point of an instruction when the vreg is dead flowing down from the instruction, giving the semantics we want for temps.	2022-03-18 10:32:49 -07:00
Chris Fallin	4f1161d9e4	Generalize debug-info support a bit. (#34 ) * Generalize debug-info support a bit. Previously, debug value-label support required each vreg to have a disjoint sequence of instruction ranges, each with one label. Unfortunately, it's entirely possible for multiple values at the program level to map to one vreg at the IR level, leading to multiple labels. This PR generalizes the debug-info generation support to allow for arbitrary (label, range, vreg) tuples, as long as they are sorted by vreg, with no other requirements. The lookup is a little more costly when we generate the debuginfo, but in practice we shouldn't have more than a few debug value labels per vreg, so in practice the constants should be small. * Typo fix from Amanieu Co-authored-by: Amanieu d'Antras <amanieu@gmail.com> Co-authored-by: Amanieu d'Antras <amanieu@gmail.com>	2022-03-18 10:32:27 -07:00
Chris Fallin	bf92e7c02f	Allow pinned-vregs to be implicit liveins. (#30 ) Previously, the regalloc required all liveins to be defined by a pseudoinstruction at the start of the function body. The regalloc.rs compatibility shim did this, but it's slightly inconvenient when using the API directly. This change allows pinned vregs to be implicit liveins to the function body instead.	2022-03-18 10:13:56 -07:00
Chris Fallin	fe021ad6d4	Simplify pinned-vreg API: don't require slice of all pinned vregs. (#28 ) Simplify pinned-vreg API: don't require slice of all pinned vregs. Previously, we kept a bool flag `is_pinned` in the `VRegData`, and we required a `&[VReg]` of all pinned vregs to be provided by `Function::pinned_vregs()`. This was (I think) done for convenience, but it turns out not to really be necessary, as we can just query `is_pinned_vreg` where needed (and in the likely implementation, e.g. in Cranelift, this will be a `< NUM_PINNED_VREGS` check that can be inlined). This adds convenience for the embedder (the main benefit), and also reduces complexity, removes some state, and avoids some work initializing the regalloc state for a run.	2022-03-04 15:12:16 -08:00
Chris Fallin	14442df3fc	Support for debug-labels. (#27 ) Support for debug-labels. If the client adds labels to vregs across ranges of instructions in the input program, the regalloc will provide metadata in the `Output` that describes the `Allocation`s in which each such vreg is stored for those ranges. This allows the client to emit debug metadata telling a debugger where to find program values at each point in the program.	2022-03-03 16:58:33 -08:00
Chris Fallin	ccd6b4fc2c	Remove DefAlloc -- no longer needed.	2022-01-19 23:57:31 -08:00
Amanieu d'Antras	6b1a5e8b1b	Address review feedback	2022-01-11 22:27:15 +00:00
Amanieu d'Antras	ee4de54240	Guard trace! behind cfg!(debug_assertions) Even if the trace log level is disabled, the presence of the trace! macro still has a significant impact on performance because it is present in the inner loops of the allocator. Removing the trace! calls at compile-time reduces instruction count by ~7%.	2022-01-11 13:30:13 +00:00
Amanieu d'Antras	693fb6a975	Only emit DefAlloc edits when the "checker" feature is enabled. This reduces instruction counts by ~2% when disabled.	2022-01-11 13:03:24 +00:00
Amanieu d'Antras	74928b83fa	Replace all assert! with debug_assert! This results in a ~6% reduction in instruction count.	2022-01-11 03:54:08 +00:00
Amanieu d'Antras	8ab44c383e	Add a helper to iterate over insts and edits of a block in order	2021-12-27 22:08:36 +01:00
Amanieu d'Antras	51493ab03a	Apply review feedback	2021-12-12 00:33:30 +00:00
Amanieu d'Antras	38ffc479c2	Simplify the internal representation of PReg	2021-12-11 22:39:19 +00:00
Amanieu d'Antras	8f435243e0	Properly handle fixed stack slots during multi-fixed-reg fixup	2021-12-11 22:39:14 +00:00
Amanieu d'Antras	77e6a9e0d7	Add support for fixed stack slots This works by allowing a PReg to be marked as being a stack location instead of a physical register.	2021-12-11 22:31:58 +00:00
Amanieu d'Antras	0cb3a8019f	Rework the API for outgoing blockparams	2021-12-01 01:43:20 +00:00
Amanieu d'Antras	a516e6d6f3	Return safepoint_slots as Allocations instead of SpillSlots This enables us to support reftype vregs in register locations in the future.	2021-11-16 00:47:43 +00:00
Amanieu d'Antras	358c831b31	Remove regs from MachineEnv It isn't exactly clear what purpose it serves.	2021-09-19 16:40:27 +01:00
Amanieu d'Antras	af527aca88	Fix PReg indexing with >32 pregs	2021-09-19 16:39:56 +01:00
Amanieu d'Antras	9e2ab3d5f7	Address review feedback	2021-09-14 13:12:52 +01:00
Amanieu d'Antras	35ed2109b1	Adjust Operand encoding The encoding for OperandConstraint is adjusted to free up 2 bits which allows for 2^21 vregs and 2^6 pregs.	2021-09-13 08:33:17 +01:00
Amanieu d'Antras	a243c4e575	Remove Function::is_call The documentation says that this is only used for heuristics, but it is never actually called. This should be removed for now and perhaps added back later if we find an actual use for it.	2021-09-09 11:16:11 +01:00
Chris Fallin	6f0893d69d	Address review comments.	2021-08-31 17:56:06 -07:00
Chris Fallin	6389071e09	Address review comments.	2021-08-31 17:42:50 -07:00
Chris Fallin	b19fa4857f	Rename operand positions to Early and Late, and make weights f16/f32 values.	2021-08-31 17:31:23 -07:00
Chris Fallin	3a18564e98	Addressed more review comments.	2021-08-30 17:51:55 -07:00
Chris Fallin	6d313f2b56	Address review comments: more doc comments and some minor refactorings.	2021-08-30 17:15:37 -07:00
Chris Fallin	ffc06b2099	Debug output for Operands: omit default/most common positions.	2021-08-12 14:49:42 -07:00
Chris Fallin	c071e44fc0	Derive PartialOrd/Ord/Hash for Operand.	2021-08-12 14:43:13 -07:00
Chris Fallin	7652b4b109	Review feedback.	2021-08-12 14:27:20 -07:00
Chris Fallin	2f856435f4	Review feedback.	2021-08-12 14:08:10 -07:00
Chris Fallin	3e1e0f39b6	Convert all log::debug to log::trace.	2021-08-12 12:05:19 -07:00
Chris Fallin	84285c26fb	Rename OperandPolicy to OperandConstraint as per feedback from @julian-seward1.	2021-08-12 11:17:52 -07:00
Chris Fallin	245c212289	Revert "Add fixed-non-allocatable operand support." This feature needs more thought; for now we will of course continue to support pinned vregs, but perhaps we can do better for "pass-through-and-forget" operands that are given non-allocatable registers. This reverts commit `736f636c36`.	2021-06-20 23:03:44 -07:00
Chris Fallin	22eed0a6ae	Make bitvec public; it is used by regalloc.rs shim too.	2021-06-19 12:47:02 -07:00
Chris Fallin	736f636c36	Add fixed-non-allocatable operand support.	2021-06-19 12:17:18 -07:00
Chris Fallin	50eb6fc42f	Keep internal modules private, but re-export under fuzzing feature flag	2021-06-19 12:08:37 -07:00
Chris Fallin	36975b8b6f	Add doc-comment note on Edit that stack-to-stack moves are never generated.	2021-06-19 11:34:05 -07:00
Chris Fallin	6ec6207717	Add design document.	2021-06-18 13:59:12 -07:00
Chris Fallin	1bd1248cb5	Avoid stack-to-stack moves by allocating an extra spillslot and re-using the scratch reg instead.	2021-06-10 22:36:02 -07:00
Chris Fallin	c6bcd3c941	WIP: redundant-move elimination.	2021-06-07 21:15:32 -07:00
Chris Fallin	2a5f571b80	WIP: Handle moves between realregs (pregs) and vregs somewhat specially, by converting into operand constraints Still has a fuzzbug in interaction between R->R and V->R moves. Will likely rework to make pinned-vreg handling more general but want to save a checkpoint here; idea for rework: - set allocs immediately if an Operand is a pinned vreg; - reserve preg ranges; - then, in rest of liveness computation / LR construction, convert pinned-vregs to operands with constraints, but otherwise do not special-case as we do in this commit.	2021-05-20 19:53:19 -07:00
Chris Fallin	f1c6dfe807	Optionally show annotations in final allocation/program dump based on RegallocOptions flag	2021-05-19 16:36:36 -07:00
Chris Fallin	e1f67e860f	Pinned VRegs for use with regalloc.rs shim to support RealRegs.	2021-05-18 22:40:43 -07:00
Chris Fallin	f0fbf3aa0c	Rework data structures: bundles have a SmallVec of ranges, and ranges a SmallVec of uses. Appears to be a small speed improvement on the highly-artificial fuzz-generator test inputs; Cranelift tests TBD.	2021-05-17 22:44:10 -07:00
Chris Fallin	5b55948feb	Check branch-args for conflicts with edge-move placement.	2021-05-13 17:25:11 -07:00
Chris Fallin	1f9258bea5	Detect undefined liveins.	2021-05-12 01:06:27 -07:00
Chris Fallin	37fa3ec763	Improve prog-move handling: no use/def records, just directly connect the LRs. Also requires some metadata in edit output to properly hook up the checker in regalloc.rs to track user-moves without seeing the original insts with operands.	2021-05-11 23:59:12 -07:00
Chris Fallin	4f26b1c78f	Properly handle prog-moves with fixed srcs or dests	2021-05-09 13:35:38 -07:00

1 2

64 Commits