SSE4.1 implementation of some alpha-processing functions

DispatchAlpha* functions are hard to speed up, compared to SSE2.
ExtractAlpha sees a ~15% speed-up though.

Change-Id: I8715c2defecbc832f469eed7e6ffd012146b52de
This commit is contained in:
Pascal Massimino
2015-06-19 14:23:38 +02:00
committed by James Zern
parent 7f9c98f21d
commit bfc300c7ff
6 changed files with 102 additions and 0 deletions

View File

@ -129,6 +129,7 @@ DSP_DEC_OBJS = \
src/dsp/alpha_processing.o \
src/dsp/alpha_processing_mips_dsp_r2.o \
src/dsp/alpha_processing_sse2.o \
src/dsp/alpha_processing_sse41.o \
src/dsp/cpu.o \
src/dsp/dec.o \
src/dsp/dec_clip_tables.o \