_mm_maccd_epi16
Visual Studio 2010 SP1 is required.
Microsoft Specific
Generates the XOP instruction vpmacswd to perform an integer multiply-add of its sources.
__m128i _mm_maccd_epi16 (
__m128i src1,
__m128i src2,
__m128i src3
);
Parameters
[in] src1
A 128-bit parameter that contains four 16-bit signed integers in its even-indexed words.[in] src2
A 128-bit parameter that contains four 16-bit signed integers in its even-indexed words.[in] src3
A 128-bit parameter that contains four 32-bit signed integers.
Return value
A 128-bit result r that contains four 32-bit signed integers.
r[i] := src1[2*i] * src2[2*i] + src3[i];
Requirements
Intrinsic |
Architecture |
---|---|
_mm_maccd_epi16 |
XOP |
Header file <intrin.h>
Remarks
Each even-indexed 16-bit signed integer value in src1 is multiplied by the corresponding 16-bit signed integer value in src2, the 32-bit signed integer product is added to the corresponding 32-bit signed integer value in src3, and the signed 32-bit integer result is stored as the corresponding value in the destination. The odd-indexed values in src1 and src2 are ignored.
Overflow is ignored.
The vpmacswd instruction is part of the XOP family of instructions. Before you use this intrinsic, you must ensure that the processor supports this instruction. To determine hardware support for this instruction, call the __cpuid intrinsic with InfoType = 0x80000001 and check bit 11 of CPUInfo[2] (ECX). This bit is 1 when the instruction is supported, and 0 otherwise.
Example
#include <stdio.h>
#include <intrin.h>
int main()
{
__m128i a, b, c, d;
int i;
for (i = 0; i < 8; i++) {
a.m128i_i16[i] = 30000;
b.m128i_i16[i] = 8000*(i-4);
}
for (i = 0; i < 4; i++) {
c.m128i_i32[i] = 1300000000*i - 2000000000;
}
d = _mm_maccd_epi16(a, b, c);
for (i = 0; i < 4; i++) printf_s(" %d", d.m128i_i32[i]);
printf_s("\n");
}
1334967296 -1180000000 600000000 -1914967296