Tags / Equivalents
_mm_sqrt_ps() on Intel 64-bit - SSE4.2
Compute the square root of packed single-precision (32-bit) floating-point elements in a, and store the results in output.
vec_sqrt() on IBM Power 9 64-bit - VSX
Purpose: Returns a vector containing the square root of each element in the source vector.
Result value: Each element of output is the square root of the corresponding element of a.
Endian considerations: None.
vsqrtq_f32() on Arm 64-bit - NEON
Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.
_mm256_sqrt_ps() on Intel 64-bit - AVX2
Compute the square root of packed single-precision (32-bit) floating-point elements in a, and store the results in output.
vec_sqrt() on IBM-Z 64-bit - IBM-Z
Returns a vector containing the square root of each element in the given vector. Minimum Arch: Z12.
_mm512_sqrt_ps() on Intel 64-bit - AVX512
Compute the square root of packed single-precision (32-bit) floating-point elements in "a", and store the results in "dst".
vsqrt_f32() on Arm 64-bit - NEON
Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.