vaddq_p8ADD TO COMPARE ADDED TO COMPARE
Arm 64-bit (64 bits)/ NEON
View official documentation
Purpose:
This instruction adds corresponding unsigned 8-bit integer elements of two vectors.
Result:
poly8x16_t
Example:
#include <arm_neon.h>
#include <stdio.h>
int main() {
poly8x16_t a = {
1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16
};
poly8x16_t b = {
16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1
};
poly8x16_t result = vaddq_p8(a, b);
uint8_t res[16];
for (int i = 0; i < 16; i++) {
res[i] = result[i];
}
printf("%u %u %u %u %u %u %u %u %u %u %u %u %u %u %u %u\n", res[0], res[1], res[2], res[3], res[4], res[5], res[6], res[7], res[8], res[9], res[10], res[11], res[12], res[13], res[14], res[15]);
return 0;
}
Prototypes
Assembly Instruction:
EOR
Usage:
poly8x16_t result =
vaddq_p8(
poly8x16_t a, poly8x16_t b
)
Performance Metrics:
📊 Unlock Performance Insights
Get access to detailed performance metrics including latency, throughput, and CPU-specific benchmarks for this intrinsic.
SIMD Intrinsics Summary
| SIMD Engines: | 6 |
| C Intrinsics: | 10444 |
| NEON: | 4353 |
| AVX2: | 405 |
| AVX512: | 4717 |
| SSE4.2: | 598 |
| VSX: | 192 |
| IBM-Z: | 179 |