Tags / Equivalents
_mm512_sqrt_ph() on Intel 64-bit - AVX512
Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".
_mm256_sqrt_ph() on Intel 64-bit - AVX512
Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".
vsqrth_f16() on Arm 64-bit - NEON
Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.
vsqrt_f16() on Arm 64-bit - NEON
Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.
_mm_sqrt_ph() on Intel 64-bit - AVX512
Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".
vsqrtq_f16() on Arm 64-bit - NEON
Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.