libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-06-30 16:14:29 +02:00

Author	SHA1	Message	Date
Pascal Massimino	ea664b8995	SSE2: 10% faster Predictor #11 Change-Id: I14ae5f6603071b86dfdbe8e6f7dfdbe5d8510185	2016-12-12 02:20:41 -08:00
Hui Su	be7dcc088c	AnimEncoder: Correctly skip a frame when sub-rectangle is empty. Change-Id: I0d288bd9561b48cf5a1eae92a1b7106ba44c664e (cherry picked from commit 1cc79e92ac74337aa4102a3128fa9451ef4b5fd0)	2016-12-09 20:22:31 -08:00
Hui Su	408858308a	Fix assertions in WebPRescalerExportRow() Change-Id: I25711dd54e71c90a25f7b18e0ef9155e8151a15e (cherry picked from commit 27b5d991e2a3d87bd45610765af6f2a9a3530d69)	2016-12-09 20:22:25 -08:00
Pascal Massimino	8f38c72e11	fix a typo in WebPPictureYUVAToARGB's doc method -> colorspace Change-Id: I5c9a2ccc909c967a936758dde2cfce92eb95462a (cherry picked from commit dc789ada44691f18d3334581d887a922ea702a41)	2016-12-09 17:27:59 -08:00
Pascal Massimino	33ca93f909	systematically call WebPDemuxReleaseIterator() on dec->prev_iter_ Change-Id: I4a767134dcc52a7ee7c3bc5deb91012eaf7b6512 (cherry picked from commit aaf2a6a69884d0f9abfa2f97d252e6d568e9c191)	2016-12-09 17:27:54 -08:00
hui su	f91ba96306	Anim_encoder: correctly handle enc->prev_candidate_undecided_ Set enc->prev_candidate_undecided_ as 0 when a frame is not chosen as a possible keyframe, so that the dispose method can be dispose-to-background. Change-Id: If2899f5dbc06fb53705fb8240072ab6440a6de12 (cherry picked from commit 29fedbf58b9c0d7641e9e42505199ed5ad295325)	2016-12-09 16:58:28 -08:00
Pascal Massimino	25d74e652e	WebPPictureDistortion(): free() -> WebPSafeFree() missed one! Change-Id: I643170451b3ac07c748b70a9abfe8af17a716b24 (cherry picked from commit 32dead4ee384afa8dc4da9a8a9d49ab944318c35)	2016-12-09 16:58:19 -08:00
James Zern	03f1c00877	mux/Makefile.am: add missing -lm + libwebpmux.pc anim_encode.c relies on functions from math.h BUG=webp:306 Change-Id: I3a8eb48febfd52bfbeb04f4dc615ccbed72926f7 (cherry picked from commit aaf2530cc38a69a46f0e612a281ddcbda566663a)	2016-12-09 15:03:08 -08:00
Pascal Massimino	58410cd6dc	fix bug in RefineUsingDistortion() When try_both_modes=0 (that is: -m 0 or -m 1), and the mode is i4, we were still sometimes falling back to (unexplored, uninitialized) i16 mode, which resulted in a enc/dec mismatch. This was mainly occurring for large images (when bit_limit is low enough) We disable the fall-back by disabling bit_limit using a large MAX_COST threshold. Change-Id: I0c60257595812bd813b239ff4c86703ddf63cbf8 (cherry picked from commit 0a3838ca77c515ace2c49738f6976dc8aa3e136c)	2016-12-08 15:48:16 -08:00
Pascal Massimino	e168af8c6c	fix filtering auto-adjustment the min-distortion was quite too low. And we were also considering the fully skipped macroblocks (nz=0) in the stats. We need to have at least some non-zero dc coeffs (nz=0x100XXXX). Fix also two typos in StoreMaxDelta: the v0/v1 comparison was wrong, and the DCs[] coeffs are actually already in ZigZag order. Change-Id: I602aaa74b36f7ce80017e506212c7d6fd9deba1f (cherry picked from commit e4cd4daf746a03aec9fd709ece756e6d39740aff)	2016-12-08 15:48:08 -08:00
Pascal Massimino	ed9dec41a5	fix doc and code snippet for WebPINewDecoder() doc Change-Id: I1a75fdf60f0b9f1816be28f22613438bfe21752b (cherry picked from commit e715285611c0975366965e9b6ddfc880d06f3bda)	2016-12-08 15:48:04 -08:00
Pascal Massimino	3c49178f7d	prevent 32b overflow for very large canvas_width / height some multiplies here and there needed some extra checks and error reporting. Even if width * height is guaranteed to be < 2**32, we were multiplying by num_channels and triggering a 32b overflow. Some multiplies were not using size_t or uint64_t, additionally. Change-Id: If2a35b94c8af204135f4b88a7fd63850aa381bbf (cherry picked from commit 1c36440094c7a34ae315035e16b8ed2275247556)	2016-12-08 15:27:51 -08:00
Pascal Massimino	b3fb8bb602	slightly faster Predictor #11 in NEON (+some slight modifications on Predictor #12) Change-Id: Ic2132dcd83d961cd069fa01ca1670e35e35274e2	2016-12-08 07:32:51 -08:00
James Zern	a0d2753fcb	lower WEBP_MAX_ALLOCABLE_MEMORY default restrict to 2^34 for 64-bit targets, < 2^32 for 32-bit Change-Id: Iff4ce40ae2c3c7fc119f018c2128dbe8f744341f (cherry picked from commit b8384b53d63fd193917076a727a262fc005263f8)	2016-12-07 18:30:44 -08:00
Pascal Massimino	31fe11a57a	fix infinite loop in case of PARTITION0 overflow max_i4_header_bits_ could drop to zero for difficult image and trigger a loop. Surprisingly, StatLoop() didn't have this bug. Change-Id: Idc0f9eadef30a2b2f02041b994f25def30901e36 (cherry picked from commit 21e7537abeb01ad8d5d05c7d27b3f3b22dc85a62)	2016-12-07 18:30:39 -08:00
hui su	532215dd29	Change the rule of picking UV mode in MBAnalyzeBestUVMode() Pick the mode with the smallest alpha. It only affects m0, in which case the mode decision is not re-examined later in VP8Decimate(). Tests on some natural content png images show PSNR increase as well as visual quality improvement. Change-Id: Iea997e718cd7477160fa05eb7cfb35f4cec2fa9a (cherry picked from commit 1377ac2ec1cb81e4a74fa6294ff30a9e4cc584aa)	2016-12-07 18:30:33 -08:00
hui su	7416280d75	Fix an unsigned integer overflow error in enc/cost.h Change-Id: I9774b59c417c185f09a61a115364b9642976a100 (cherry picked from commit 0b2c58a91cee8a8bdefa07c8b561f91ed4c96c47)	2016-12-07 18:29:51 -08:00
hui su	13cf1d2e41	Do token recording and counting in a single loop Change-Id: I8afd3c486b210bd67888de03e91dde7f78276f89 (cherry picked from commit 0c0fb83211f79df8694e6e344fdd0ad07d62be6f)	2016-12-07 18:29:44 -08:00
hui su	eb9a4b97c5	Reset segment id if we decide not to update segment map This avoids potential encoder and decoder mismatch. Change-Id: I5282d3e168afc6193033ad3fce8fbc35618ab2f5 (cherry picked from commit 386e4ba2f0c5e95bf2ad6042cae46a9ba07a5141)	2016-12-07 18:25:06 -08:00
Pascal Massimino	76ebbfff28	NEON: implement predictor #13 ~5-7% faster Change-Id: I3361b0bbc978f3721168db15778a67337309c18a	2016-12-07 14:58:49 -08:00
Vincent Rabaud	95b12a08ae	Merge "Revert Average3 and Average4"	2016-12-07 15:38:56 +00:00
Vincent Rabaud	54ab2e758f	Revert Average3 and Average4 Average3 created a slowdown of 1-2% in lossless decoding. Average4 created a slowdown of 2-3% in lossless decoding. Change-Id: Ic2e62cdd83fc897887ec2bf41ea7cadbada84fe5	2016-12-07 15:32:33 +01:00
Pascal Massimino	fe12330c81	3-5% faster Predictor #5 , #6 , #7 and #10 for NEON Change-Id: Ica48c7088d4384f0888dd171a47e68ebd25729b2	2016-12-07 15:25:33 +01:00
Pascal Massimino	fbfb3bef7b	~2% faster predictor #10 for NEON Change-Id: Icd9cff90c227d702c3ba319131996c5475094520	2016-12-06 13:47:35 +00:00
Pascal Massimino	d4b7d801db	lossless_sse2: use the local functions ...instead of the pointers stored in the array. Should be faster (inlined) and safer. Also: suffix explicitly the functions with _SSE2 Change-Id: Ie7de4b8876caea15067fdbe44abfedd72b299a90	2016-12-06 14:20:41 +01:00
Vincent Rabaud	a5e3b22574	Lossless decoder SSE2 improvements. Change-Id: Ia901014ac63156a2e278b81e035256c30bdf8706	2016-12-06 13:45:09 +01:00
Pascal Massimino	58a1f124c2	~2% faster predictor #12 in NEON. Change-Id: I6772bb865d0f72720a65561eb55028e538df236d	2016-12-06 10:24:27 +01:00
Pascal Massimino	906c3b6392	Merge "Implement lossless transforms in NEON."	2016-12-03 16:55:14 +00:00
Vincent Rabaud	d23abe4e9f	Implement lossless transforms in NEON. Change-Id: I2172b1a763eb9dfe25d2b9bf1fb6501d7e192e55	2016-12-03 11:20:22 +00:00
Vincent Rabaud	2e6cb6f34e	Give more flexibility to the predictor generating macro. Change-Id: Ia651afa8322cb5c5ae87128340d05245c0f6a900	2016-12-02 12:33:12 -08:00
Vincent Rabaud	28e0bb7088	Merge "Fix race condition in multi-threading initialization."	2016-12-02 17:45:10 +00:00
Vincent Rabaud	647045305a	Fix race condition in multi-threading initialization. Before, a first thread could enter VP8LDspInitSSE2, set VP8LPredictorsAdd to an SSE2 version BEFORE another thread would do the memcpy from VP8LPredictorsAdd to VP8LPredictorsAdd_C thus leading to a C version actually being the SSE2 one (which would then create an infinite recursion in the SSE2 predictors at execution). Change-Id: I224f4ceab31d38f77a1375a7e2636a6014080e3a	2016-12-02 18:28:57 +01:00
Hui Su	1cc79e92ac	AnimEncoder: Correctly skip a frame when sub-rectangle is empty. Change-Id: I0d288bd9561b48cf5a1eae92a1b7106ba44c664e	2016-12-02 11:50:13 +01:00
Pascal Massimino	ea72cd60cb	add missing 'extern' keyword for predictor dcl Change-Id: Ibf3db9b6dae91e53524c31cdfccf4678b3fa1135	2016-12-01 08:15:14 +01:00
Vincent Rabaud	67879e6d48	SSE implementation of decoding predictors. Change-Id: I5c9ae63afc98013cb45ce8a91f051203ac68402c	2016-11-30 12:00:07 +01:00
Vincent Rabaud	a41296aef5	Fix potentially uninitialized value. Change-Id: I721695e22474992db3094942b1ad4754ae7c0a02	2016-11-29 13:19:32 +01:00
Vincent Rabaud	4239a1489c	Make the lossless predictors work on a batch of pixels. Change-Id: Ieaee34f1f97c375b9e97ef7e9df60aed353dffa1	2016-11-28 17:12:10 +01:00
Pascal Massimino	bc18ebad2e	fix extra 'const's in signatures Change-Id: Ie433d0defbc0c6feae2eb2f11e70082f1affada8	2016-11-25 09:45:52 +01:00
Vincent Rabaud	71e2f5cadf	Remove memcpy in lossless decoding. Change-Id: Iba694b306486d67764e2fc5576c98a974c9b886c	2016-11-24 17:45:24 +01:00
Vincent Rabaud	7474d46e45	Do not use a register array in SSE. Change-Id: I79cf95bdac1164fc4de899828e9380c23df8d141	2016-11-24 13:06:44 +01:00
Owen Rodley	67748b41db	Improve latency of FTransform2. Benchmarks from vrabaud@: 8BIT/GRAY corpus speed: faster: -4.3 % , corpus size: unchanged skal/sources_png_skal corpus speed: faster: -5.2 % , corpus size: unchanged images/png_rgb corpus speed: faster: -5.1 % , corpus size: unchanged images/lpcb corpus speed: unchanged, corpus size: unchanged images/png_big corpus speed: faster: -1.7 % , corpus size: unchanged images/png_doc corpus speed: unchanged, corpus size: unchanged images/png_1bit corpus speed: faster: -1.2 % , corpus size: unchanged images/jpeg_small corpus speed: unchanged, corpus size: unchanged images/icip_core1 corpus speed: unchanged, corpus size: unchanged images/png_gray corpus speed: faster: -2.5 % , corpus size: unchanged images/jpeg_high_quality corpus speed: faster: -4.0 % , corpus size: unchanged images/jpeg corpus speed: faster: -2.3 % , corpus size: unchanged images/png_translucent corpus speed: faster: -2.8 % , corpus size: unchanged images/gif corpus speed: faster: -1.4 % , corpus size: unchanged images/png_opaque corpus speed: faster: -2.8 % , corpus size: unchanged images/png_rgb_opaque corpus speed: unchanged, corpus size: unchanged images/png_indexed corpus speed: faster: -2.0 % , corpus size: unchanged images/all corpus speed: faster: -1.5 % , corpus size: unchanged images/png_small corpus speed: unchanged, corpus size: unchanged images/png corpus speed: unchanged, corpus size: unchanged images/gif_still corpus speed: faster: -1.6 % , corpus size: unchanged Change-Id: I69fe11baa188c5d32cbc77a84b8c0deae13d792b	2016-11-24 07:09:50 +00:00
Vincent Rabaud	6540cd0eeb	Provide an SSE implementation of ConvertBGRAToRGB Change-Id: Ida11b079077a47fe3b92754f08aa30d81c301fcf	2016-11-23 16:25:51 +01:00
Pascal Massimino	3c2a61b099	remove some unneeded casts Change-Id: Ie68788c77f016ed11446a55142b1bd8d96261452	2016-11-16 22:54:40 -08:00
Pascal Massimino	9ac063c37f	add dsp functions for SmartYUV + SSE2 implementation Change-Id: I5cfdb62d68b5a95899241a097d3a2f697fbc590e	2016-11-16 14:23:06 +00:00
Pascal Massimino	22efabddb4	Merge "smart_yuv: switch to planar instead of packed r/g/b processing"	2016-11-15 14:55:17 +00:00
Pascal Massimino	1d6e7bf39f	smart_yuv: switch to planar instead of packed r/g/b processing avoiding triplets of data should make it easier to write SSE2 versions. FilterRow() can now filter all input in one single pass -> conversion is 15-20% faster (but still overall slow compared to -pre 0) Change-Id: I14c3215e672fdecde7ec80394e814bdc7445019f	2016-11-15 14:51:34 +01:00
Pascal Massimino	0a3838ca77	fix bug in RefineUsingDistortion() When try_both_modes=0 (that is: -m 0 or -m 1), and the mode is i4, we were still sometimes falling back to (unexplored, uninitialized) i16 mode, which resulted in a enc/dec mismatch. This was mainly occurring for large images (when bit_limit is low enough) We disable the fall-back by disabling bit_limit using a large MAX_COST threshold. Change-Id: I0c60257595812bd813b239ff4c86703ddf63cbf8	2016-11-12 02:15:28 -08:00
James Zern	83cbfa09a1	Import: use relative pointer offsets avoids int rollover when working with large input BUG=webp:312 Change-Id: I6ad9f93b6c4b665c559bff87716a7b847f66a20d (cherry picked from commit 342e15f0ce1336c94c84afec48d14bbc606779a0)	2016-11-09 15:50:57 -08:00
James Zern	a1ade40ed8	PreprocessARGB: use relative pointer offsets avoids int rollover when working with large input BUG=webp:312 Change-Id: I2881bec2884b550c966108beeff1bf0d8ef9f76b (cherry picked from commit 1147ab4ee7ff33c418279944aa17b5a43c6ec706)	2016-11-09 15:24:16 -08:00
James Zern	fd4d090fd1	ConvertWRGBToYUV: use relative pointer offsets avoids int rollover when working with large input BUG=webp:312 Change-Id: I693cbb295df9cf94aa89294b19c0496bdbe84d18 (cherry picked from commit de9fa5074ebc51ca59c435da3a05cd108d06a7bf)	2016-11-09 12:57:03 -08:00

... 2 3 4 5 6 ...

2432 Commits