_mm256_sqrt_psADD TO COMPARE ADDED TO COMPARE
Intel 64-bit (64 bits)/ AVX2
View official documentation
Location:
Math Functions
>
Vector Square Root
Purpose:
Compute the square root of packed single-precision (32-bit) floating-point elements in a, and store the results in output.
Result:
__m256
Example:
#include <immintrin.h>
#include <stdio.h>
int main() {
__m256 a = _mm256_set_ps(64.0f, 49.0f, 36.0f, 25.0f, 16.0f, 9.0f, 4.0f, 1.0f);
__m256 result = _mm256_sqrt_ps(a);
float res[8];
_mm256_storeu_ps(res, result);
printf("%f %f %f %f %f %f %f %f\n", res[0], res[1], res[2], res[3], res[4], res[5], res[6], res[7]);
return 0;
}
Prototypes
Assembly Instruction:
vsqrtps
Usage:
__m256 output =
_mm256_sqrt_ps(
__m256 a
)
DB statistics
SIMD Engines: | 5 |
C Intrinsics: | 10702 |
NEON: | 4232 |
AVX2: | 462 |
AVX512: | 4955 |
SSE4.2: | 652 |
VSX: | 401 |