_mm_haddq_epi16
Visual Studio 2010 SP1 is required.
Microsoft Specific
Generates the XOP instruction vphaddwq to perform an integer horizontal add of its source.
__m128i _mm_haddq_epi16 (
__m128i src
);
Parameters
- [in] src
A 128-bit parameter that contains eight 16-bit signed integers.
Return value
A 128-bit result r that contains two 64-bit signed integers.
r[i] := src[4*i] + src[4*i+1] + src[4*i+2] + src[4*i+3];
Requirements
Intrinsic |
Architecture |
---|---|
_mm_haddq_epi16 |
XOP |
Header file <intrin.h>
Remarks
Each of the two groups of four consecutive 16-bit signed integer values in src is added to produce a 64-bit signed integer result that is stored as the corresponding value in the destination. No overflow is possible in this instruction.
The vphaddwq instruction is part of the XOP family of instructions. Before you use this intrinsic, you must ensure that the processor supports this instruction. To determine hardware support for this instruction, call the __cpuid intrinsic with InfoType = 0x80000001 and check bit 11 of CPUInfo[2] (ECX). This bit is 1 when the instruction is supported, and 0 otherwise.
Example
#include <stdio.h>
#include <intrin.h>
int main()
{
__m128i a, d;
int i;
for (i = 0; i < 4; i++) {
a.m128i_i16[i] = i-128;
a.m128i_i16[i+4] = 127-i;
}
d = _mm_haddq_epi16(a);
for (i = 0; i < 2; i++) printf_s(" %I64d", d.m128i_i64[i]);
printf_s("\n");
}
-506 502