Tags / Equivalents
_mm_mullo_epi16() on Intel 64-bit - SSE4.2
Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in output.
_m_pmullw() on Intel 64-bit - SSE4.2
Multiply the packed 16-bit integers in "a" and "b", producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in "dst".