vsqrt_f16
ADD TO COMPARE ADDED TO COMPARE

Arm 64-bit (64 bits)/ NEON View official documentation

Location: >
Supported Architectures: A64

Purpose:

Floating-point Square Root (vector). This instruction calculates the square root for each vector element in the source SIMD&FP register, places the result in a vector, and writes the vector to the destination SIMD&FP register.

Result:

float16x4_t

Example:

#include <arm_neon.h>
#include <stdio.h>
int main() {
 float16x4_t a = {
  16.0f16, 9.0f16, 4.0f16, 1.0f16
 };
 float16x4_t result = vsqrt_f16(a);
 float16 res[4];
 vst1_f16(res, result);
 printf("%f %f %f %f\n", res[0], res[1], res[2], res[3]);

 return 0;
}

Prototypes

Assembly Instruction:

FSQRT

Usage:


									
										float16x4_t result =
									
									vsqrt_f16(
									
										float16x4_t a
									)

Performance Metrics:

📊 Unlock Performance Insights

Get access to detailed performance metrics including latency, throughput, and CPU-specific benchmarks for this intrinsic.

SIMD Intrinsics Summary

SIMD Engines:	6
C Intrinsics:	10444
NEON:	4353
AVX2:	405
AVX512:	4717
SSE4.2:	598
VSX:	192
IBM-Z:	179

Vector Square Root 16-bit half-precision floats

vsqrt_f16ADD TO COMPARE ADDED TO COMPARE

Prototypes

📊 Unlock Performance Insights

SIMD Intrinsics Summary

vsqrt_f16
ADD TO COMPARE ADDED TO COMPARE