vaddq_p8
ADD TO COMPARE ADDED TO COMPARE

Arm 64-bit (64 bits)/ NEON View official documentation

Location: >
Supported Architectures: v7, A32, A64

Purpose:

This instruction adds corresponding unsigned 8-bit integer elements of two vectors.

Result:

poly8x16_t

Example:

#include <arm_neon.h>
#include <stdio.h>
int main() {
 poly8x16_t a = {
  1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16
 };
 poly8x16_t b = {
  16, 15, 14, 13, 12, 11, 10, 9, 8, 7, 6, 5, 4, 3, 2, 1
 };
 poly8x16_t result = vaddq_p8(a, b);
 uint8_t res[16];
 for (int i = 0; i < 16; i++) {
   res[i] = result[i];
  }
  printf("%u %u %u %u %u %u %u %u %u %u %u %u %u %u %u %u\n",           res[0], res[1], res[2], res[3], res[4], res[5], res[6], res[7],           res[8], res[9], res[10], res[11], res[12], res[13], res[14], res[15]);

  return 0;
 }

Prototypes

Assembly Instruction:

EOR

Usage:


									
										poly8x16_t result =
									
									vaddq_p8(
									
										poly8x16_t a, poly8x16_t b
									)

Performance Metrics:

📊 Unlock Performance Insights

Get access to detailed performance metrics including latency, throughput, and CPU-specific benchmarks for this intrinsic.

SIMD Intrinsics Summary

SIMD Engines:	6
C Intrinsics:	10444
NEON:	4353
AVX2:	405
AVX512:	4717
SSE4.2:	598
VSX:	192
IBM-Z:	179

Vector Add 8-bit polynomial elements

vaddq_p8ADD TO COMPARE ADDED TO COMPARE

Prototypes

📊 Unlock Performance Insights

SIMD Intrinsics Summary

vaddq_p8
ADD TO COMPARE ADDED TO COMPARE