vadd_p16
ADD TO COMPARE ADDED TO COMPARE

Arm 64-bit (64 bits)/ NEON View official documentation

Location: >
Supported Architectures: v7, A32, A64

Purpose:

Floating-point Add (vector). This instruction adds corresponding vector elements in the two source SIMD&FP registers, writes the result into a vector, and writes the vector to the destination SIMD&FP register. All the values in this instruction are floating-point values.

Result:

poly16x4_t

Example:

#include <arm_neon.h>
#include <stdio.h>
int main() {
 poly16x4_t a = {
  1, 2, 3, 4
 };
 poly16x4_t b = {
  5, 6, 7, 8
 };
 poly16x4_t result = vadd_p16(a, b);
 printf("%d %d %d %d\n", vget_lane_s16(result, 0), vget_lane_s16(result, 1), vget_lane_s16(result, 2), vget_lane_s16(result, 3));

 return 0;
}

Prototypes

Assembly Instruction:

EOR

Usage:


									
										poly16x4_t result =
									
									vadd_p16(
									
										poly16x4_t a, poly16x4_t b
									)

Performance Metrics:

📊 Unlock Performance Insights

Get access to detailed performance metrics including latency, throughput, and CPU-specific benchmarks for this intrinsic.

SIMD Intrinsics Summary

SIMD Engines:	6
C Intrinsics:	10444
NEON:	4353
AVX2:	405
AVX512:	4717
SSE4.2:	598
VSX:	192
IBM-Z:	179

Vector Add 16-bit polynomial elements

vadd_p16ADD TO COMPARE ADDED TO COMPARE

Prototypes

📊 Unlock Performance Insights

SIMD Intrinsics Summary

vadd_p16
ADD TO COMPARE ADDED TO COMPARE