libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-14 21:09:55 +02:00

Author	SHA1	Message	Date
Pascal Massimino	c7129da5b6	Merge "4-5% faster encoding using SSE2 for GetResidualCost"	2015-02-18 04:46:53 -08:00
Djordje Pesut	94380d00d9	MIPS: dspr2: added optimizaton for functions VE4 and DC4 Change-Id: I118adc6d3872742d8b1f9dbac438cba6fc90b7a9	2015-02-18 11:25:08 +01:00
Pascal Massimino	2a407092ab	4-5% faster encoding using SSE2 for GetResidualCost new file: cost_sse2.c Change-Id: I4896c07f5ff2443ef743f4435fe2758d95a672ed	2015-02-18 09:41:02 +01:00
James Zern	17e1986214	Merge "MIPS: dspr2: added optimization for simple filtering functions"	2015-02-17 14:57:05 -08:00
pascal massimino	3ec404c47a	Merge "dsp: normalize WEBP_TSAN_IGNORE_FUNCTION usage"	2015-02-14 01:57:08 -08:00
James Zern	b969f5dfac	dsp: normalize WEBP_TSAN_IGNORE_FUNCTION usage the attribute is only necessary in one location; remove it from the prototypes. Change-Id: I3820a3c34fbb029fd7ac69a1b0a9b76091bdbde2	2015-02-13 15:23:40 -08:00
Djordje Pesut	d7b8e71126	MIPS: dspr2: added optimization for simple filtering functions affected functions: SimpleVFilter16, SimpleHFilter16, SimpleVFilter16i and SimpleHFilter16i noticed bug in FilterLoop26 (fix included in this patch) Change-Id: I72d9c1e45cbac6393eba52bb549b04924d463e30	2015-02-13 11:18:43 +01:00
Djordje Pesut	42a8a6280c	MIPS: dspr2: Added optimization for function VP8LTransformColorInverse_C Change-Id: I8b60e22c9f6c0badab6267a33751dfc28750f457	2015-02-13 08:57:20 +01:00
James Zern	3030f11525	Merge "dsp/mips: add some missing TSan annotations"	2015-02-12 14:55:32 -08:00
pascal massimino	dfcf4593fe	Merge "MIPS: dspr2: Added optimization for function VP8LAddGreenToBlueAndRed_C"	2015-02-12 14:48:17 -08:00
James Zern	55c75a25f0	dsp/mips: add some missing TSan annotations Change-Id: I3c832aefdeac26c6c75c35b19b45c1a2f67493c5	2015-02-12 14:36:33 -08:00
Djordje Pesut	2cb879f0c6	MIPS: dspr2: Added optimization for function VP8LAddGreenToBlueAndRed_C Change-Id: If897c6c2f1c4b8405789298e135d6a1e4bf13012	2015-02-12 09:06:49 +01:00
James Zern	e15560107c	move some cost tables from enc/ to dsp/ removes circular dependency between dsp and enc. since: `a987fae` MIPS: dspr2: added optimization for function GetResidualCost Change-Id: Ifeb8fc02de89e2ba982ed7ffacd925d649bfec3c	2015-02-11 16:10:06 -08:00
pascal massimino	39537d7cfe	Merge "VP8LDspInitMIPSdspR2: add missing TSan annotation"	2015-02-10 00:02:41 -08:00
James Zern	43fd3543df	VP8LDspInitMIPSdspR2: add missing TSan annotation Change-Id: Ic0d84e95daf063976b40fb5ba1e94d3547e2afba	2015-02-09 23:55:30 -08:00
pascal massimino	c7233dfcdc	Merge "VP8LDspInit: remove memcpy"	2015-02-09 23:48:44 -08:00
James Zern	35579a4902	VP8LDspInit: remove memcpy without this change the TSan annotation is useless Change-Id: Ief511379f3aad75889815d4fe8362aed5c1abac7	2015-02-09 23:41:24 -08:00
James Zern	97f6aff874	VP8YUVInit: add missing TSan annotation Change-Id: I7f8868de425e1aac3721b3e328844725104d14db	2015-02-09 22:50:31 -08:00
James Zern	f9016d6662	dsp/enc::InitTables: add missing TSan annotation Change-Id: I262b9071417a0ec502c7c0380f27da6413cc74e4	2015-02-09 22:40:45 -08:00
James Zern	e3d9771aa1	VP8EncDspCostInit*: add missing TSan annotations Change-Id: I4cdb84bc8c9a8c6aa34b5773c8fb69e5810a9809	2015-02-09 22:39:14 -08:00
Djordje Pesut	309b790867	MIPS: mips32: Added optimization for function SetResidualCoeffs Change-Id: If67c10285df71ba7dd1aff6c24c2145c280dd2bf	2015-02-09 13:17:49 +01:00
Pascal Massimino	a987faedfa	MIPS: dspr2: added optimization for function GetResidualCost set/get residual C functions moved to new file in src/dsp mips32 version of GetResidualCost moved to new file Change-Id: I7cebb7933a89820ff28c187249a9181f281081d2	2015-02-07 02:13:26 -08:00
James Zern	c24d8f144f	cosmetics: upsampling_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: Ia6df6690c85f580b20f19ce85cc6ec7b52620aee	2015-02-05 23:51:57 -08:00
James Zern	1829c42c58	cosmetics: lossless_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: I2405b18c6bb829b76c3a9814057ccbe6e14220d9	2015-02-05 23:51:44 -08:00
James Zern	183168f332	cosmetics: enc_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: Ib85d63abbb9fc33096f893c2524d3ce8ae3ebd03	2015-02-05 23:51:29 -08:00
James Zern	860badcacc	cosmetics: dec_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: I77decb55f1382bea4b646a11b77dfa40bf1ef94d	2015-02-05 23:51:16 -08:00
James Zern	0254db9793	cosmetics: argb_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: I87fa2de11dafcc77767aab64e13b8c5585ebf5cd	2015-02-05 23:51:07 -08:00
James Zern	1aadf856c9	cosmetics: alpha_processing_sse2: add const to some casts source pointers are often cast to __m128*, retain the const in those cases Change-Id: I00ba15af2f43d125ceb2620e82fd43d420fbb9d3	2015-02-05 23:50:39 -08:00
Pascal Massimino	19f0ba0eb9	Implement true-motion prediction in SSE2 (along with DC/HE/VE for chroma/luma16) Overall effect is ~1% faster decoding. Change-Id: I90917e050d61874cbc8da0e88f26b5dd6131c265	2015-02-04 17:02:22 +01:00
Pascal Massimino	774d4cb758	make VP8PredLuma16[] array non-const Change-Id: I0ce7e4e847f9fffefb6544db9636068442a2d264	2015-02-04 17:00:22 +01:00
Djordje Pesut	6ce296da12	MIPS: dspr2: Added optimization for function CollectHistogram Change-Id: Id6b87ea1c9d21fee9494ad6c53ffc84ef60d5974	2015-02-03 14:11:20 +01:00
James Zern	b7de794622	Merge "lossless_neon: enable subtract green for aarch64"	2015-02-02 16:01:40 -08:00
Pascal Massimino	77724f70e9	SSE2 version of GradientUnfilter somewhat 1-2% faster decoder for lossy+alpha Change-Id: Ib317e26e9fcb8d37af02668ffbfccc4664e659fe	2015-01-31 23:18:00 +01:00
James Zern	416e1cea9b	lossless_neon: enable subtract green for aarch64 similar to: `1ba61b0` enable NEON intrinsics in aarch64 builds vtbl1_u8 is available everywhere but Xcode-based iOS arm64 builds, use vtbl1q_u8 there. performance varies based on the input, 1-3% on encode was observed Change-Id: Ifec35b37eb856acfcf69ed7f16fa078cd40b7034	2015-01-31 11:32:05 -08:00
Pascal Massimino	022d2f886c	add SSE2 variants for alpha filtering functions The 'inverse' variants are harder to parallelize, since the result of filtering is used for prediction. The 'direct' way is relatively easier. The heavy bottleneck left for optimization is still GradientUnfilter() Change-Id: I358008f492a887e8fff6600cb27857b18dee86e9	2015-01-29 08:46:22 +01:00
Pascal Massimino	7afdaf8496	Alpha coding: reorganize the filter/unfiltering code Move the filtering code to their own dsp/ spot New function: VP8FiltersInit() Change-Id: I0b2041eab42346c59b972f2575b05509e6a8f7b1	2015-01-28 08:02:41 +01:00
Djordje Pesut	da0912126b	MIPS: dspr2: Added optimization for function FTransformWHT Change-Id: I918366cd1908304068c66da9965efb0aa63320cd	2015-01-19 10:15:13 +01:00
pascal massimino	daeb276a2b	Merge "MIPS: dspr2: Added optimization for MultARGBRow function"	2015-01-17 08:56:01 -08:00
James Zern	0de5f33e31	dsp/cpu: (msvs) add include for __cpuidex and only use it on x86 / x64 where it's available. has the side-effect of quieting a msvs /analyze warning: C6001: Using uninitialized memory 'cpu_info'. Change-Id: Iae51be3b22b2ee949cfc473eeea9fd9fb6b3c2cb	2015-01-16 18:16:10 -08:00
Djordje Pesut	7d850f7b9a	MIPS: dspr2: Added optimization for MultARGBRow function Change-Id: Ide549ae0d80413bea8c19fe091d97bffe8b17985	2015-01-16 15:56:34 +01:00
Djordje Pesut	5487529368	MIPS: dspr2: added optimization for function QuantizeBlock Change-Id: Id217116890b7408d23464216608ce67ae545688a	2015-01-16 12:51:13 +01:00
James Zern	4fbe9cf202	dsp/cpu: (msvs) avoid immintrin.h on _M_ARM _xgetgv() isn't relevant there anyway broken since: `279e661` Merge "dsp/cpu: add include for _xgetbv() w/MSVS" Change-Id: Iaa7bc0c5be9c06bfffab39e194c64c09bf5b5a27	2015-01-15 23:04:08 -08:00
Pascal Massimino	3fd59039bd	simplify/reorganize arguments for CollectColorBlueTransforms and other various call sites too. Change-Id: Icb8f828dfe25672662de18d0e48e7d3144b1f38d	2015-01-15 18:12:12 -08:00
Djordje Pesut	a7e7caa486	MIPS: dspr2: added optimization for function TransformColorRed added new function CollectColorRedTransforms to C, which calls TransformColorRed and it is realized via pointer to function Change-Id: Ia68d73bfcf1ca2cb443dc2825910946221f87835	2015-01-15 09:32:09 +01:00
pascal massimino	2cb39180cc	Merge "MIPS: dspr2: added optimization for function TransformColorBlue"	2015-01-15 00:06:01 -08:00
pascal massimino	279e66138d	Merge "dsp/cpu: add include for _xgetbv() w/MSVS"	2015-01-15 00:05:35 -08:00
James Zern	b6c0428e8c	dsp/cpu: add include for _xgetbv() w/MSVS explicitly add immintrin.h instead of transitively picking it up via windows.h presumably. makes the code easier to move around. Change-Id: If70d5143ac94fc331da763ce034358858e460e06	2015-01-14 23:31:35 -08:00
Djordje Pesut	7b16197361	MIPS: dspr2: added optimization for function TransformColorBlue added new function CollectColorBlueTransforms to C, which calls TransformColorBlue and it is realized via pointer to function Change-Id: Ia488b7a7a689223b5d33aae9724afab89b97fced	2015-01-13 10:39:38 +01:00
James Zern	d7c4b02a57	cpu: fix AVX2 detection for gcc/clang targets ecx needs to be set to 0; the visual studio builds were already doing this. https://software.intel.com/en-us/articles/how-to-detect-new-instruction-support-in-the-4th-generation-intel-core-processor-family Change-Id: I95efb115b4d50bbdb6b14fca2aa63d0a24974e55	2015-01-12 17:58:57 -08:00
Pascal Massimino	d581ba40ba	follow-up: clean up WebPRescalerXXX dsp function by removing redundant RFIX macros and using a plain-C fallback. Change-Id: I52436c672bf20780b6fe3bcf43fe73e1abac10ff	2015-01-12 15:26:55 -08:00

... 2 3 4 5 6 ...

579 Commits