libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-18 23:09:52 +02:00

Author	SHA1	Message	Date
Pascal Massimino	1dc82a6bba	Merge "introduce a generic GetCoeffs() function pointer"	2017-01-18 07:44:36 +00:00
Pascal Massimino	8074b89eb3	introduce a generic GetCoeffs() function pointer We can switch at run-time between the standard GetCoeffs() critical function, that uses a fast variant of VP8GetBit(). However, some platforms have slow instructions that make standard VP8GetBit() slow. GetCoeffs() is the right level of branching to switch to GetCoeffsAlt() that avoids these slow instructions in some not-frequent cases. Next patch will upgrade VP8GetBit() to use clz, after this one is proved to be neutral speed-wise. Change-Id: Ia6cef5de9de6131574d2202bbc0bea8559c9b693	2017-01-17 16:24:00 +01:00
Pascal Massimino	749a45a520	Merge "NEON: implement alpha-filters (horizontal/vertical/gradient)"	2017-01-17 15:13:08 +00:00
Pascal Massimino	74c053b57d	Merge "NEON: fix overflow in SSE NxN calculation"	2017-01-17 15:10:54 +00:00
Pascal Massimino	0a3aeff75b	Merge "dsp: WebPExtractGreen function for alpha decompression"	2017-01-17 15:08:20 +00:00
Pascal Massimino	1de931c669	NEON: implement alpha-filters (horizontal/vertical/gradient) gradient-filter code is not much faster, but maybe improvable in the future. Change-Id: Ia16070e409fe8703b02276166f19526917df6b35	2017-01-17 15:44:46 +01:00
Pascal Massimino	9b3aca404d	NEON: fix overflow in SSE NxN calculation vmlal_u8() is prone to overflow during the accumulation. There was a mismatch happening at low q mostly. Because in this case the distortion is important and the accumulated sum was later than 16bit-unsigned. Change-Id: I1a08a2f744bcdf0b26647e61b9ee92a0c2e28fe8	2017-01-17 11:47:36 +01:00
Pascal Massimino	1c07a3c639	dsp: WebPExtractGreen function for alpha decompression + NEON implementation Change-Id: I67204f99d6e4c5974718bdf21dad30381978f72c	2017-01-17 09:33:25 +00:00
Pascal Massimino	9ed5e3e5dd	use pointers for WebPRescaler's in WebPDecParams This makes the structure more generic, without the hard-coded internal structure. This is a borderline incompatible ABI change, even if WebPIDecoder structure is opaque. Change-Id: I518765c3f76fc17a136cef045a5a8aa70ed70e85	2017-01-16 22:30:29 -08:00
James Zern	db013a8d5c	Merge "ARM: don't use USE_GENERIC_TREE"	2017-01-13 22:15:04 +00:00
Pascal Massimino	fcd4784dcd	use a 8b table for C-version for clz() 30% faster on x86, 5% faster on N5. New generic function: WebPLog2FloorC() This function is called as fallback for BitsLog2Floor() when there's no clz() available. Change-Id: Ica15c6092112e514c0e200fab89c434de48d4b19	2017-01-13 15:36:26 +01:00
Pascal Massimino	fbb5c473b4	ARM: don't use USE_GENERIC_TREE It's 1-2% faster to use hard-coded tree on ARM Change-Id: I54403a70f6c692e50148c33f36833588957c20ee	2017-01-13 10:05:21 +01:00
Pascal Massimino	8fda56126e	Merge "add a kSlowSSSE3 feature for CPUInfo"	2017-01-13 07:01:48 +00:00
Pascal Massimino	86bbd24552	add a kSlowSSSE3 feature for CPUInfo This is meant to be used for run-time detection of slow platforms regarding instructions like pshufb and bsr. Adapted from libvpx patch: https://chromium-review.googlesource.com/#/c/367731 Change-Id: I2c22fbb9aae699d87a041393ba1ad5f1f21ff640	2017-01-13 06:19:27 +00:00
Vincent Rabaud	7c2779e95a	Get code to fully compile in C++. Change-Id: I6d8490c8c9b955d90dcc89ee8a9cf29ca0f93b08	2017-01-12 18:03:55 +01:00
Vincent Rabaud	250c358662	Merge "When compiling as C++, avoid narrowing warnings."	2017-01-12 13:00:56 +00:00
Vincent Rabaud	c0648ac2ae	When compiling as C++, avoid narrowing warnings. The gcc compilation warning was: narrowing conversion from ‘int’ to ‘int8_t’ Change-Id: I4803dd60ad04060cdb5d61a1aa98b25215b9d4eb	2017-01-12 13:39:22 +01:00
Pascal Massimino	0d55f60c91	40% faster ApplyAlphaMultiply_SSE2 process four pixels at a time Change-Id: I1dee7f70772be4915654fc6638ef4729a1a239d4	2017-01-12 02:33:09 -08:00
Pascal Massimino	49d0280df1	NEON: implement several alpha-processing functions - ApplyAlphaMultiply - DispatchAlpha - DispatchAlphaToGreen - ExtractAlpha Decoding to Argb / rgbA / ... is 10-15% faster (measured on N4) new file: alpha_processing_neon.c Change-Id: I40f1a809e9885d1031ff0bc886d8d001efa66bca	2017-01-11 17:39:29 +01:00
Pascal Massimino	48b1e85fbe	SSE2: 15% faster alpha-processing functions ApplyAlphaMultiply / MultARGBRow / MultRow we use now: x/255 = (x * 0x8081) >> (16 + 7) and x/255 + .5 = ((x + 128) * 0x0101) >> 16 Change-Id: I8931091316ffc8bbf65aa3402f2e7d2b800e1971	2017-01-11 15:35:16 +01:00
Pascal Massimino	e3b8abbc9b	fix warning from static analysis. "-1 cannot be represented in type 'unsigned int'" Change-Id: I05abcb44af68f702ead5a7f24dc14aab31a2e4d9	2017-01-10 22:59:47 -08:00
Pascal Massimino	28fe054e73	SSE2: 30% faster ApplyAlphaMultiply() and 15% faster MultARGBRow() by switching to formulae: X / 255 = (X + 1 + (X >> 8)) >> 8 for any 16bit value X. (X / 255 + .5) = (XX + (XX >> 8)) >> 8, with XX = X + 128 Change-Id: Ia4a7408aee74d7f61b58f5dff304d05546c04e81	2017-01-10 23:34:22 +01:00
Vincent Rabaud	f44acd253b	Merge "Properly compute the optimal color cache size."	2017-01-10 21:14:16 +00:00
Vincent Rabaud	527844fee0	Properly compute the optimal color cache size. The previous optimization was performing dichotomy on a function that is anything in practice, hence a bit of randomness. Also, two magic constants were used, one for an extra constant cost, one for an extra linear cost. Both values/models were empirical. A brute force search for the best cache size is now performed. To have less CPU impact, a speed optimization is also made by not inserting a value again and again. This makes sense but it's also the most common case of when LZ77 is useful hence an overall improvement sometimes. Change-Id: I57de5750ad2313b2feecbcd15cd6e4feeb98e5c8	2017-01-10 21:44:53 +01:00
Pascal Massimino	be0ef6395f	fix a comment typo Change-Id: I0fabd08cd8abd3cea7ddfd2e498507adb0d3c67e	2017-01-10 21:17:13 +01:00
Vincent Rabaud	8874b16275	Fix a non-deterministic color cache size computation. In case of impossible allocation, some value was returned while computation should be stopped. Change-Id: I5f85e264575be825e4261ab6fa63840c157cf5c2	2017-01-10 18:53:19 +01:00
Vincent Rabaud	d712e20de0	Do not allow a color cache size bigger than the number of colors. This is purely for speed optimization. Change-Id: Ie4b4380df8a5afa90574012bacdb1ddad03f320e	2017-01-10 09:25:02 +01:00
Vincent Rabaud	ecff04f625	re-introduce some comments in Huffman Cost. Change-Id: I2396bbc58628dd12a2d36068f7193e2a6eb4d166	2017-01-06 13:17:14 +01:00
Pascal Massimino	00b08c88c0	Merge "NEON: 5% faster conversion to RGB565 and RGBA4444"	2016-12-22 08:39:01 +00:00
Pascal Massimino	0e7f444702	Merge "NEON: faster fancy upsampling"	2016-12-21 14:53:24 +00:00
Pascal Massimino	b016cb91c5	NEON: faster fancy upsampling 2-3% faster decoding overall Change-Id: I2c53e50dc7e0ade5245cff8cc5d7b96a14062955	2016-12-21 15:23:54 +01:00
Vincent Rabaud	1cb638010c	Call the C function to finish off lossless SSE loops only when necessary. Change-Id: I4e221d80879dc9c90c24d69a40bc5811d73787ad	2016-12-21 14:25:54 +01:00
Vincent Rabaud	875fafc191	Implement BundleColorMap in SSE2. Change-Id: I44cd23647bd0a49330b6b2b3ed08050a5500e58e	2016-12-21 10:44:31 +01:00
James Zern	f04eb37603	Merge tag 'v0.5.2' libwebp-0.5.2 - 12/13/2016: version 0.5.2 This is a binary compatible release. This release covers CVE-2016-8888 and CVE-2016-9085. * further security related hardening in the tools; fixes to gif2webp/AnimEncoder (issues #310, #314, #316, #322), cwebp/libwebp (issue #312) * full libwebp (encoder & decoder) iOS framework; libwebpdecoder WebP.framework renamed to WebPDecoder.framework (issue #307) * CMake support for Android Studio (2.2) * miscellaneous build related fixes (issue #306, #313) * miscellaneous documentation improvements (issue #225) * minor lossy encoder fixes and improvements * tag 'v0.5.2': (54 commits) update ChangeLog anim_util: quiet implicit conv warnings in 32-bit jpegdec: correct ContextFill signature Remove some errors when compiling the code as C++. vwebp: clear canvas during resize w/o animation tiffdec: restore libtiff 3.9.x compatibility update NEWS AnimEncoder: avoid freeing uninitialized memory pointer. WebPAnimEncoder: If 'minimize_size' and 'allow_mixed' on, try lossy + lossless. fix a potential overflow with MALLOC_LIMIT bump version to 0.5.2 update AUTHORS & .mailmap iosbuild.sh: add WebPDecoder.framework + encoder AnimEncoder: Correctly skip a frame when sub-rectangle is empty. Fix assertions in WebPRescalerExportRow() fix a typo in WebPPictureYUVAToARGB's doc systematically call WebPDemuxReleaseIterator() on dec->prev_iter_ doc: use two's complement explicitly for uint8->int8 conversion Anim_encoder: correctly handle enc->prev_candidate_undecided_ WebPPictureDistortion(): free() -> WebPSafeFree() ... Change-Id: I16bcf54af41ce8fad98d4fbc8aa1df58f338fc23	2016-12-20 20:14:55 -08:00
Pascal Massimino	341d711c43	NEON: 5% faster conversion to RGB565 and RGBA4444 We use the magic 'shift and insert' instruction instead of the multiple shifts and or's. Change-Id: I48df0320668b502a91792defc0423a9441669d19	2016-12-20 17:01:48 +01:00
Vincent Rabaud	24eb39401b	Remove some errors when compiling the code as C++. This fixes some cases from https://bugs.chromium.org/p/webp/issues/detail?id=137 Change-Id: I58f3a617bf973dbe4c5794004a01e2aea39ba53a (cherry picked from commit `28ce304344`)	2016-12-15 11:50:44 -08:00
Pascal Massimino	a4bbe4b38b	fix indentation Change-Id: I5593fb2441f253c6b8cc43949c11909f19184b55	2016-12-13 22:50:29 -08:00
hui su	5ab6d9de1f	AnimEncoder: avoid freeing uninitialized memory pointer. In GenerateCandidates(), when candidate_ll->evaluate_ and candidate_lossy->evaluate_ are both true, if lossless encoding exits on error, candidate_ll->evaluate_ would not be correctly reset. This will cause freeing uninitialized memory pointer in SetFrame(). BUG=webp:322 Change-Id: I481b49a186e4fa3607ce71b4543a481083edf444 (cherry picked from commit `3ebe1c0003`)	2016-12-13 18:18:57 -08:00
Urvang Joshi	f29bf582df	WebPAnimEncoder: If 'minimize_size' and 'allow_mixed' on, try lossy + lossless. This improves compression by ~5% at default quality. If only 'allow_mixed' is on (but 'minimize_size' isn't), we continue to use a heuristic to try one of the two or both. Change-Id: Ia573a73ea26ad25f9debff759eed69d2b0449e82 (cherry picked from commit `3f4042b52a`)	2016-12-13 18:18:48 -08:00
hui su	3ebe1c0003	AnimEncoder: avoid freeing uninitialized memory pointer. In GenerateCandidates(), when candidate_ll->evaluate_ and candidate_lossy->evaluate_ are both true, if lossless encoding exits on error, candidate_ll->evaluate_ would not be correctly reset. This will cause freeing uninitialized memory pointer in SetFrame(). BUG=webp:322 Change-Id: I481b49a186e4fa3607ce71b4543a481083edf444	2016-12-13 17:39:16 -08:00
Pascal Massimino	df780e0eac	fix a potential overflow with MALLOC_LIMIT BUG=webp:321 Change-Id: Iab89dfe167fb394fcdffd3b2732d4ac9bef764b0 (cherry picked from commit `76bbcf2ed6`)	2016-12-13 16:15:28 -08:00
Pascal Massimino	58fc507842	Merge "PredictorSub: implement fully-SSE2 version"	2016-12-13 11:03:13 +00:00
Pascal Massimino	9cc421675b	PredictorSub: implement fully-SSE2 version and inline the C-version too. Predictor #13 is still a hard one. Change-Id: Iedecfb5cbf216da4e28ccfdd0810286133f42331	2016-12-13 02:19:35 -08:00
Pascal Massimino	827d3c5038	Merge "fix a potential overflow with MALLOC_LIMIT"	2016-12-13 06:10:56 +00:00
James Zern	218460cdd7	bump version to 0.5.2 libwebp{,decoder} - 0.5.2 libwebp libtool - 6.2.0 libwebpdecoder libtool - 2.2.0 mux - 0.3.2 libtool - 2.2.0 demux - 0.3.1 libtool - 2.1.0 Change-Id: Idf199415c325e6e9d157459a4e016ebba88c3f34	2016-12-12 17:36:12 -08:00
Pascal Massimino	76bbcf2ed6	fix a potential overflow with MALLOC_LIMIT BUG=webp:321 Change-Id: Iab89dfe167fb394fcdffd3b2732d4ac9bef764b0	2016-12-12 13:40:40 -08:00
James Zern	2423017a28	dsp/lossless.c,cosmetics: fix indent after: `fbba5bc` optimize predictor #1 in plain-C For some reason, gcc has hard time inlining this one... Change-Id: I2e2416593acd4c9d14958d8757bfd284d999100b	2016-12-12 12:53:23 -08:00
Pascal Massimino	fbba5bc2c1	optimize predictor #1 in plain-C For some reason, gcc has hard time inlining this one... Also optimize predictor #0 and #1 for encoding, so we don't have to call the generic pointers VP8LPredictors[...] Change-Id: I1ff31e3b83874b53f84fe23487f644619fd61db9	2016-12-12 17:41:36 +01:00
Pascal Massimino	9ae0b3f65a	Merge "SSE2: slightly (~2%) faster Predictor #1 "	2016-12-12 14:46:21 +00:00
Pascal Massimino	c1f97bd758	SSE2: slightly (~2%) faster Predictor #1 by removing a load from memory Change-Id: If6c4aa7fb99309d09f943393ec772891449971f0	2016-12-12 02:24:38 -08:00

1 2 3 4 5 ...

2432 Commits