When AVX512VL and AVX512F are available, use a single instruction (`VCVTUDQ2PS`) instead of a length 9-instruction sequence. This optimization is a port from the legacy x86 backend.
This crate contains the core Cranelift code generator. It translates code from an intermediate representation into executable machine code.