Tags / Equivalents
_mm_mul_ps() on Intel 64-bit - SSE4.2
Multiply packed single-precision (32-bit) floating-point elements in a and b, and store the results in output.
_mm_mullo_epi16() on Intel 64-bit - SSE4.2
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in output.
_m_pmullw() on Intel 64-bit - SSE4.2
Multiply the packed 16-bit integers in "a" and "b", producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in "dst".
_mm_mullo_epi32() on Intel 64-bit - SSE4.2
Multiply the packed 32-bit integers in a and b, producing intermediate 64-bit integers, and store the low 32 bits of the intermediate integers in output.
_mm_mul_pd() on Intel 64-bit - SSE4.2
Multiply packed double-precision (64-bit) floating-point elements in a and b, and store the results in output.