libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-06-28 07:04:30 +02:00

Author	SHA1	Message	Date
Jyrki Alakuijala	90fcfcd905	Insert less hash chain entries from the beginnings of long copies. This makes the chains more efficient and a larger variety of data is tested. 0.02 % compression gain at q 100, 0.05 % at default quality. 0.8 % speedup by callgrind. 0.16 % compression gain for lossy alpha ?! Change-Id: I888120133352799eb14f5f602c7f40ab404bd665	2015-08-18 18:44:03 -07:00
skal	c5f00621c7	incorporate bzero() into WebPRescalerInit() instead of call site Change-Id: I9ebb83e643e24bc685a1a1cb6836cb54e34a0ec8	2015-08-14 19:37:22 -07:00
James Zern	b918724280	utils/rescaler: add WebPRescalerGetScaledDimensions + use it in WebPPictureRescale() Change-Id: I491bea8cd56f0eb1ac8bf0829b9f36c77804219a	2015-08-13 20:50:38 -07:00
skal	56a2e9f5e7	WebPPictureDistortion: support ARGB format for 'pic' when computing distortion. using a *tmp_plane buffer to split a/r/g/b planes up appeared to be the easiest route, compared to copy-pasting the whole code and making it x_stride aware... Change-Id: I0898ef1df62bd3e1713b77187b31b5eeef3832fe	2015-08-11 17:28:29 -07:00
Jyrki Alakuijala	b969f888ab	Reduce magic in palette reordering Slightly faster on -m 0 -q 0, particularly for small images (50 x 75 image was 0.1 % faster on callgrind measurement). Increases compression density by 0.005 % for the 1000 images, but small images can improve even 0.5 % (about 4 bytes, depending on the characteristics of the palette). Change-Id: I94f568d396ac62a054a829abeeef3eb0af6b3f94	2015-08-10 19:06:07 -07:00
Pascal Massimino	7df93893dc	fix rescaling bug (uninitialized read, see bug #254 ). the x_add/x_sub increments were wrong for u/v in the upscaling case. They shouldn't be left to the caller's discretion, but set up by WebPRescalerInit to their exact necessary values. -> Cleaned-up WebPRescalerInit() param list. -> added safety asserts -> removed the mips32/mips_r2 variant of "ImportRow" which were buggy prior Change-Id: I347c75804d835811e7025de92a0758d7929dfc09	2015-08-05 23:00:00 -07:00
Jyrki Alakuijala	01d61fd9c6	lossless: ~20 % speedup 0.28 % byte size increase on lossless, 0.18 % increase on lossy alpha Change-Id: I1e001a56831a8f996ac522aa646f9ae587c80d12	2015-07-20 17:13:44 -07:00
Jyrki Alakuijala	f722c8f0bd	lossless: Speed up ComputeCacheEntropy by 40 % a total impact of 1 % on encoding speed This allows for performance neutral removal of the binary search in cache bits selection. This will give a small improvement in compression density. Change-Id: If5d4d59460fa1924ce71af977320834a47c2054a	2015-07-20 17:13:44 -07:00
Jyrki Alakuijala	17eb609916	lossless: Allow copying from prev row in rle-mode. 0.21 % compression density improvement for 1000 png corpus in lossless mode 0.50 % compression density improvement for 1000 png corpus in lossy mode Change-Id: I14ee8c427ae5d3e116b0ee6695fcdea3321a319d	2015-07-20 17:13:43 -07:00
Jyrki Alakuijala	52931fd548	lossless: combine the Huffman code with extra bits gives 2 % speedup 24.9 -> 25.5 MP/s for a photo with -q 0 -m 0 Change-Id: If9ae04683a86dd7b1fced2183cf79b9349a24a9e	2015-07-07 20:24:28 -07:00
Jyrki Alakuijala	c4855ca249	lossless: Inlining add literal this is a simple speedup of about 1-2 % Change-Id: I0c7b01c0a69f4aeaf363ffda05a28871f1def696	2015-07-07 20:24:28 -07:00
Jyrki Alakuijala	8e9c94dedb	lossless: simplify HashChainFindCopy heuristics for small speedup 0.0003 % worse compression Change-Id: Ic4b6b21e5279231c6321f2cec1c79f7e17e56afa	2015-07-07 20:24:27 -07:00
Jyrki Alakuijala	888429f409	lossless: 0.5 % compression density improvement do not do length 2 matches far away speedup for non compressible data by inserting two literals at a time when no matches are found Change-Id: Ia8e033071f4186bb8148bb2bf13ca37586734aa3	2015-07-07 20:24:27 -07:00
Jyrki Alakuijala	7b23b19808	lossless: Add zeroes into the predicted histograms. Increases compression density by 0.03 % for lossy. Speeds up at least one of the lossy alpha images by 20 %. Palette entropy 'kludge' seems to save 1-2 % on alpha images. Change-Id: I2116b8d81593ac8173bfba54a7c833997fca0804	2015-07-07 20:24:27 -07:00
Jyrki Alakuijala	85b44d8a69	lossless: encoding, don't compute unnecessary histo share the computation between different modes 3-5 % speedup for lossless alpha 1 % for lossy alpha no change in compression density Change-Id: I5e31413b3efcd4319121587da8320ac4f14550b2	2015-07-07 20:24:26 -07:00
Jyrki Alakuijala	d92453f381	lossless: Remove about 25 % of the speed degradation introduced in: "lossless: 0.37 % compression density improvement" Uses the statistics of red and blue histograms to decide if to run cross color correction at all. Improves compression density by 0.02 % or so. Change-Id: I47429557e9cdbd9fa90c584696f241b17427d73f	2015-07-07 20:24:26 -07:00
Jyrki Alakuijala	2cce031704	Faster alpha coding for webp No significant size degradation (+0.001 %) for 1000 image corpus Fixes the 8 ms vs 2 ms degradation from: "lossless: 0.37 % compression density improvement" Change-Id: Id540169a305d9d5c6213a82b46c879761b3ca608	2015-07-07 20:24:25 -07:00
Jyrki Alakuijala	5e75642efd	lossless: rle mode not to accept lengths smaller than 4. Gives a compression gain of 0.22 % Change-Id: I0f3b8dad6b4c1bfb16eab095a467f34466b9e3b7	2015-07-07 20:24:25 -07:00
Jyrki Alakuijala	84326e4ab0	lossless: Less code for the entropy selection Tested: 1000 png corpus gives same results Change-Id: Ief5ea7727290743b9bd893b08af7aa7951f556cb	2015-07-07 20:24:25 -07:00
Jyrki Alakuijala	16ab951abf	lossless: 0.37 % compression density improvement counting the entropy expectation for five different configurations: palette non-predicted non-predicted with subtract green predicted predicted with subtract green and choose the strategy with the smallest expected entropy Change-Id: Iaaf209c0d565660a54a4f9b3959067afb9951960	2015-07-07 20:24:24 -07:00
skal	ac76801159	introduce FTransform2 to perform two transforms at a time. FTransform goes from ~12.0% to 11.5% total CPU time. Change-Id: Ibcb23155324f4fd8b235563f80668531c781f624	2015-05-18 21:06:15 -07:00
James Zern	dbba67d1e7	histogram.h: cosmetics: remove unnecessary includes Change-Id: Ia8277d3587534c2a1af05d3df57a6973a68be16d	2015-04-17 12:23:06 -07:00
Pascal Massimino	7fa67c9b9e	change GetPixPairHash64() return type to uint32_t Change-Id: Ibb61c1631d7a4bcda5417b5a85864d5e2c3f3858	2015-04-16 00:55:25 -07:00
Pascal Massimino	7073bfb3ee	Merge "split 64-mult hashing into two 32-bit multiplies"	2015-04-15 23:04:47 -07:00
Pascal Massimino	7fe357b8c0	split 64-mult hashing into two 32-bit multiplies Speed-wise equivalent on x86 and ARM (maybe a tad faster, hard to tell). Note that the two 32-bit multiples are not strictly equivalent to the 64-bit one, since we're missing one carry propagation. In practice, no observable difference was seen because of this slightly different hashing result. Change-Id: I8f2381175eae1cb20dabf149e6b27e1768fba6ab	2015-04-15 17:45:19 +02:00
Pascal Massimino	6121413415	remove VP8Residual::cost unused field Change-Id: Id494475b05c540b40fd104594acbcaa783b88d77	2015-04-15 01:56:31 -07:00
James Zern	db12250fd1	cosmetics: vp8enci.h: break long line Change-Id: Ib7c7ef6171506e826ed5f7df20c5644f240fd645	2015-04-06 16:11:02 -07:00
Pascal Massimino	82d980209b	add a dec/common.h header to collect common enc/dec #defines had to rename few structs. -> we can now include both vp8i.h and vp8enci.h without naming conflicts. Change-Id: Ib41b498f1b57aab3d6b796361afc45210ec75174	2015-03-31 22:17:58 -07:00
James Zern	553051f741	dsp/lossless: split enc/dec functions adds lossless_enc*.c; reduces the size of the decode-only so: ~78K w/gcc-4.8.2 on x86_64. Change-Id: If5e4610b67d05eba5896bc64bab79e9df92b2092	2015-03-23 22:57:50 -07:00
James Zern	92a5da9c8c	sync versions with 0.4.3 libwebp{,decoder} - 0.4.3 libwebp libtool - 5.3.0 libwebpdecoder libtool - 1.3.0 mux/demux - 0.2.2 (unchanged) libtool - 1.2.0 (unchanged) (cherry picked from commit bd852f5d81edbcf201a4f6a1567689c9b95444d1) Change-Id: Ie8c35ffc20c1bfd782bdafd99da6c6b1373022c1	2015-03-11 17:29:23 -07:00
Pascal Massimino	b1bdbbabfb	~30% faster smart-yuv (-pre 4) with early-out criterion we look at average global improvement and stop when things are moving slow, or when we had a quite good first iteration already (means: the picture is "not difficult") Change-Id: I8ab7d100353039b5b32bb5fac3fe03c8440c78d5	2015-03-11 00:42:12 -07:00
Pascal Massimino	44bd95612e	fix signature for VP8RecordCoeffTokens() Change-Id: Ia2fe764b7280931335237ced8190604129fae565	2015-03-02 23:38:20 -08:00
Pascal Massimino	c9b8ea0eef	small cosmetics on TokenBuffer. Change-Id: I7c33651ed8e3a151aef44247db5fb1e8bf41f8ba	2015-03-03 00:48:28 +01:00
Vikas Arora	ef98750027	Speedup method StoreImageToBitMask by 5%. Speedup method StoreImageToBitMask by replacing the code to find histogram index and Huffman tree codes at every iteration to a more optimal code that updates these only when the current pixel (to write) crosses the histogram tile-row boundary. This change speeds up the StoreImageToBitMask method by 5%. Change-Id: If01a1ccd7820f9a3a3e5bc449d070defa51be14b	2015-02-20 09:46:19 -08:00
Pascal Massimino	2382050748	1-2% faster encoding by removing an indirection in GetResidualCost() The MIPS code for cost is not updated yet, that's why i keep Residual::cost around for now. Should be removed in favor of costs later. Change-Id: Id1d09a8c37ea8c5b34ad5eb8811d6a3ec6c4d89f	2015-02-19 08:44:35 +01:00
James Zern	9bc0f922aa	ApplyFiltersAndEncode: only copy lossless stats this avoids a race with multi-threaded lossy + alpha compression Change-Id: Ie437105f5a899ed28b9c8885b6ca5431092ce8f5	2015-02-12 19:44:25 -08:00
James Zern	e15560107c	move some cost tables from enc/ to dsp/ removes circular dependency between dsp and enc. since: a987fae MIPS: dspr2: added optimization for function GetResidualCost Change-Id: Ifeb8fc02de89e2ba982ed7ffacd925d649bfec3c	2015-02-11 16:10:06 -08:00
pascal massimino	c3a031686a	Merge "picture_csp: fix build w/USE_GAMMA_COMPRESSION undefined"	2015-02-10 00:09:03 -08:00
James Zern	1dd419ced5	picture_csp: fix build w/USE_GAMMA_COMPRESSION undefined kGammaFix is now only defined with USE_GAMMA_COMPRESSION; fixes: use of undeclared identifier 'kGammaFix' Change-Id: Ib1e2f410eff9b83be065894f88181f91dd2776e1	2015-02-09 23:57:14 -08:00
James Zern	0ec4da960d	picture_csp::InitGammaTables*: add missing TSan annotations Change-Id: I66ca5b3e7b1614f861a9b68bd437f58b24cb1ebb	2015-02-09 23:44:47 -08:00
Pascal Massimino	a987faedfa	MIPS: dspr2: added optimization for function GetResidualCost set/get residual C functions moved to new file in src/dsp mips32 version of GetResidualCost moved to new file Change-Id: I7cebb7933a89820ff28c187249a9181f281081d2	2015-02-07 02:13:26 -08:00
James Zern	3b77e5a735	VP8TBufferClear: remove some misleading const's the input to the function is non-const and the pointer being operated is being free'd; removes an unnecessary cast in the process Change-Id: Ic515ed672ddf7f8e4e36eeac696ff7aa8a3652f7	2015-02-05 23:56:26 -08:00
James Zern	aa139c8f1a	VP8EmitTokens: remove unnecessary param void cast 'final_pass' is used within the function Change-Id: I81be1a6e18cafaa6ae685ed8ad2b107fa7ed29cf	2015-02-05 23:56:26 -08:00
Vikas Arora	4c82284d2e	Updated the near-lossless level mapping. Updated the near-lossless level mapping and make it correlated to lossy quality i.e 100 => minimum loss (in-fact no-loss) and the visual-quality loss increases with decrease in near-lossless level (quality) till value 0. The new mapping implies following (PSNR) loss-metric: -near_lossless 100: No-loss (bit-stream same as -lossless). -near_lossless 80: Very very high PSNR (around 54dB). -near_lossless 60: Very high PSNR (around 48dB). -near_lossless 40: High PSNR (around 42dB). -near_lossless 20: Moderate PSNR (around 36dB). -near_lossless 0: Low PSNR (around 30dB). Change-Id: I930de4b18950faf2868c97d42e9e49ba0b642960	2015-02-05 11:17:14 -08:00
James Zern	c86b40cca0	enc/near_lossless.c: fix alignment Change-Id: Ifd1b1b88c375abf655d94e2ba7d52087110294a5	2015-02-02 19:35:12 -08:00
Vikas Arora	72831f6b28	Speedup AnalyzeAndInit for low effort compression. AnalyzeSubtractGreen constitutes about 8-10% of the comression CPU cycles. Statistically, subtract-green is proved to be useful for most of the non-palette compression. So instead of evaluating the entropy (by calling AnalyzeSubtractGreen) apply subtract-green transform for the low-effort compression. This changes speeds up the compression at m=0 by 8-10% (with very slight loss of 0.07% in the compression density). Change-Id: I9797dc39437ae089716acb14631bbc77d367acf4	2015-01-30 10:37:31 -08:00
Vikas Arora	a6597483af	Speedup Analyze methods for lossless compression. Speed up AnalyzeSubtractGreen by looping through the image pixel once to compute the two histograms. AnalyzeEntropy code cleanup. Removed some 'if' conditions and pointer indirections inside pixel iterate loop. Change-Id: Ia65e3033988ff67df8e3ecce19d6e34cfc76358e	2015-01-30 09:16:31 -08:00
Vikas Arora	98c8138663	Enable Near-lossless feature. Enable the WebP near-lossless feature by pre-processing the image to smoothen the pixels. On a 1000 PNG image corpus, for which WebP lossless (default settings) gets 25% compression gains, following is the performance of near-lossless feature at various '-near_lossless' levels: -near_lossless 90: 30% (very very high PSNR 54-60dB) -near_lossless 75: 38% (very high PSNR 48-54dB) -near_lossless 50: 45% (high PSNR 42-48dB) -near_lossless 25: 48% (moderate PSNR 36-42dB) -near_lossless 10: 50% (PSNR 30-36dB) WebP near-lossless is specifically useful for discrete-tone images like line-art, icons etc. Change-Id: I7d12a2c9362ccd076d09710ea05c85fa64664c38	2015-01-29 16:10:20 -08:00
Urvang Joshi	2db15a9583	Temporarily disable encoding of alpha plane with color cache. This is to avoid triggering the related decoder bug. Change-Id: I8fa074a5393bcd62aa4a2232cd4e02935e927a89	2015-01-28 15:28:02 -08:00
James Zern	cafa1d882f	Merge "Simplify backward refs calculation for low-effort."	2015-01-27 23:32:21 -08:00

1 2 3 4 5 ...

588 Commits