SSE4.1 implementation of some alpha-processing functions

DispatchAlpha* functions are hard to speed up, compared to SSE2.
ExtractAlpha sees a ~15% speed-up though.

Change-Id: I8715c2defecbc832f469eed7e6ffd012146b52de
This commit is contained in:
Pascal Massimino
2015-06-19 14:23:38 +02:00
committed by James Zern
parent 7f9c98f21d
commit bfc300c7ff
6 changed files with 102 additions and 0 deletions

View File

@ -189,6 +189,7 @@ DSP_DEC_OBJS = \
$(DIROBJ)\dsp\alpha_processing.obj \
$(DIROBJ)\dsp\alpha_processing_mips_dsp_r2.obj \
$(DIROBJ)\dsp\alpha_processing_sse2.obj \
$(DIROBJ)\dsp\alpha_processing_sse41.obj \
$(DIROBJ)\dsp\cpu.obj \
$(DIROBJ)\dsp\dec.obj \
$(DIROBJ)\dsp\dec_clip_tables.obj \