Tags / Equivalents
vrsqrteq_f64() on Arm 64-bit - NEON
VRSQRTE finds an approximate reciprocal square root of each element in a vector, and places it in the return vector.
vec_rsqrt() on IBM Power 9 64-bit - VSX
Purpose: Returns a vector containing a refined approximation of the reciprocal square roots of the corresponding elements of the source vector. This function provides an implementation-dependent greater precision than vec_rsqrte.
Result value: Each element of output contains a refined approximation of the reciprocal square root of the corresponding element of a.
Endian considerations: None.
Notes:
-
The example implementations assume that a register h initially contains the floating-point value 0.5 in each element (single- or double-precision as appropriate).
-
For finite square roots, this intrinsic guarantees at least 23 bits of accuracy for single-precision floating point, and at least 52 bits of accuracy for double-precision floating point.
vrsqrte_f64() on Arm 64-bit - NEON
Floating-point Reciprocal Square Root Estimate. This instruction calculates an approximate square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.
vrsqrted_f64() on Arm 64-bit - NEON
Floating-point Reciprocal Square Root Estimate. This instruction calculates an approximate square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.
_mm512_rsqrt14_pd() on Intel 64-bit - AVX512
_mm512_rsqrt28_pd() on Intel 64-bit - AVX512