| SIMD.info

_mm512_sqrt_ph() on Intel 64-bit - AVX512

Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".

Intel 64-bit

_mm256_sqrt_ph() on Intel 64-bit - AVX512

Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".

Intel 64-bit

vsqrth_f16() on Arm 64-bit - NEON

Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.

Arm 64-bit

vsqrt_f16() on Arm 64-bit - NEON

Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.

Arm 64-bit

_mm_sqrt_ph() on Intel 64-bit - AVX512

Compute the square root of packed half-precision (16-bit) floating-point elements in "a", and store the results in "dst".

Intel 64-bit

vsqrtq_f16() on Arm 64-bit - NEON

Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.

Arm 64-bit

Tags / Equivalents

_mm512_sqrt_ph() on Intel 64-bit - AVX512

_mm256_sqrt_ph() on Intel 64-bit - AVX512

vsqrth_f16() on Arm 64-bit - NEON

vsqrt_f16() on Arm 64-bit - NEON

_mm_sqrt_ph() on Intel 64-bit - AVX512

vsqrtq_f16() on Arm 64-bit - NEON