Tags / Equivalents
vmulq_f32() on Arm 64-bit - NEON
VMUL multiplies corresponding elements in two vectors. Elements in the result vector and input vectors have the same width.
vmul_f32() on Arm 64-bit - NEON
Multiply (vector). This instruction multiplies corresponding elements in the vectors of the two source SIMD&FP registers, places the results in a vector, and writes the vector to the destination SIMD&FP register.
_mm256_mul_ps() on Intel 64-bit - AVX2
Multiply packed single-precision (32-bit) floating-point elements in a and b, and store the results in output.
_mm_mul_ps() on Intel 64-bit - SSE4.2
Multiply packed single-precision (32-bit) floating-point elements in a and b, and store the results in output.
_mm512_mul_ps() on Intel 64-bit - AVX512
Multiply packed single-precision (32-bit) floating-point elements in "a" and "b", and store the results in "dst".
vec_mul() on IBM Power 9 64-bit - VSX
Compute the products of corresponding elements of two vectors.