Tags / Equivalents

_mm256_mullo_epi16() on Intel 64-bit - AVX2

Multiply the packed signed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in output.

 Intel 64-bit

vmulq_s16() on Arm 64-bit - NEON

VMUL multiplies corresponding elements in two vectors. Elements in the result vector and input vectors have the same width.

 Arm 64-bit

vmul_s16() on Arm 64-bit - NEON

Multiply (vector). This instruction multiplies corresponding elements in the vectors of the two source SIMD&FP registers, places the results in a vector, and writes the vector to the destination SIMD&FP register.

 Arm 64-bit

_mm_mullo_epi16() on Intel 64-bit - SSE4.2

Multiply the packed 16-bit integers in a and b, producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in output.

 Intel 64-bit

_m_pmullw() on Intel 64-bit - SSE4.2

Multiply the packed 16-bit integers in "a" and "b", producing intermediate 32-bit integers, and store the low 16 bits of the intermediate integers in "dst".

 Intel 64-bit

vec_mul() on IBM Power 9 64-bit - VSX

Compute the products of corresponding elements of two vectors.

 IBM Power 9 64-bit
 
Some data for your search? Something else? whatever.