Adds a lowering for SSE2 for i32x4-based multiplication which only first became available in SSE4.1
use_egraphs
iadd_cout
isub_bout