libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-08-11 02:20:33 +02:00

Author	SHA1	Message	Date
Urvang Joshi	2db15a9583	Temporarily disable encoding of alpha plane with color cache. This is to avoid triggering the related decoder bug. Change-Id: I8fa074a5393bcd62aa4a2232cd4e02935e927a89	2015-01-28 15:28:02 -08:00
James Zern	cafa1d882f	Merge "Simplify backward refs calculation for low-effort."	2015-01-27 23:32:21 -08:00
Pascal Massimino	7afdaf8496	Alpha coding: reorganize the filter/unfiltering code Move the filtering code to their own dsp/ spot New function: VP8FiltersInit() Change-Id: I0b2041eab42346c59b972f2575b05509e6a8f7b1	2015-01-28 08:02:41 +01:00
Vikas Arora	4d6d7285b0	Simplify backward refs calculation for low-effort. Simplify and speedup backward references for low-effort settings by evaluating LZ77 references only. This change speeds up compression by 10-25% at lower (q <= 25) quality range with a slight drop (0.2%) in the compression density. Change-Id: Ibd6f03b1a062d8ab9191786c2a425e9132e4779f	2015-01-27 09:36:14 -08:00
Vikas Arora	ec0d1be577	Cleaup Near-lossless code. Cleaup Near-lossless code - Simplified and refactored the code. - Removed the requirement (TODO) to allocate the buffer of size WxH and work with buffer of size 3xW. - Disabled the Near-lossless prr-processing for small icon images (W < 64 and H < 64). Change-Id: Id7ee90c90622368d5528de4dd14fd5ead593bb1b	2015-01-26 15:29:59 -08:00
Vikas Arora	9814ddb601	Remove the post-transform near-lossless heuristic. Remove the post-transform (prediction, subtract green & cross-color) near-lossless heuristic, that's not ready yet and produces unacceptable visual (banding) artifacts. Change-Id: I9b606a790ce0344c588f2ef83a09c57ac19c2fc1	2015-01-26 14:19:00 -08:00
Pascal Massimino	0f027a72bf	simplify smart RGB->YUV conversion code * use the same TFIX == YFIX precision (2bits) * use int instead of float in LinearToGammaF() output is visually equivalent. Code is a little faster. Change-Id: Ie3cfebca351dbcbd924b3d00801d6523dca6981f	2015-01-23 14:42:32 +01:00
Pascal Massimino	0d5b334ee8	BackwardReferencesHashChainFollowChosenPath: remove unused variable Change-Id: I8dc4622dbacca03a7876f8856a0db5b9b9ec2fbd	2015-01-22 23:22:58 -08:00
Pascal Massimino	f480d1a7ef	Fix to near lossless artefacts on palettized images. Don't rely on palette not being there before the palette colors are counted. Change-Id: I988286675d3398f2da8f6d2fb6462db08af8028d	2015-01-22 17:47:50 +01:00
Pascal Massimino	cb4a18a7ba	rename HashChainInit into HashChainReset this avoids the confusion with "VP8LHashChainInit" Change-Id: Ia1686828c138729e5bda3cc5c8246d99c80915ef	2015-01-20 00:38:07 -08:00
Pascal Massimino	f079e487ae	use uint16_t for chosen_path[] len is MAX_LENGTH (4096) at max. This reduce memory for path by a half. Change-Id: I399fda4093d93b1e9d956397b7b210956c5b948f	2015-01-20 00:34:09 -08:00
James Zern	f0e0677b87	VP8LEncodeStream: add an assert check enc->argb_ to quiet an msvs /analyze warning: C6387: 'enc->argb_+y*width' could be '0': this does not adhere to the specification for the function 'memcpy'. Change-Id: I87544e92ee0d3ea38942a475c30c6d552f9877b7	2015-01-16 18:16:40 -08:00
Vikas Arora	b9e356b998	Disable costly TraceBackwards for method=0. Disable costly TraceBackwards heuristic for computing the backward references for low_effort (method=0) compression. The TraceBackwards heuristic is already disabled for lower (q < 25) quality range. Following is the compression data for 1000 image corpus for q >= 25. This speeds up compression (q >= 25) by a factor of 2.5-3X with slight loss of compression density (0.7% for lower quality range and 1.2% for higher qualities). Change-Id: I256c9e2137c7de4083f423ea32ee12d3b0f46253	2015-01-15 09:01:40 -08:00
Vikas Arora	ea08466d34	Tune BackwardReferencesLz77 for low_effort (m=0). - Lower the threshold parameters for HashChainFindCopy. For 1000 image PNG corpus (m=0), this change yields speedup of 15-20% at lower quality range (0.25% drop in compression density) and about 10% for higher quality range without any drop in the compression density. Following is the compression stats (before/after) for method = 0: Before After bpp/MPs bpp/MPs q=0 2.8615/18.000 2.8651/18.631 q=5 2.8615/18.216 2.8650/20.517 q=10 2.8572/18.070 2.8650/21.992 q=15 2.8519/18.371 2.8584/21.747 q=20 2.8454/18.975 2.8515/20.448 q=25 2.8230/8.531 2.8253/9.585 // Compression density remains same for q-range [30-100] q=30 2.7310/7.706 2.7310/8.028 q=35 2.7253/6.855 2.7253/7.184 q=40 2.7231/6.364 2.7231/6.604 q=45 2.7216/5.844 2.7216/6.223 q=50 2.7196/5.210 2.7196/5.731 q=55 2.7208/4.766 2.7208/4.970 q=60 2.7195/4.495 2.7195/4.602 q=65 2.7185/4.024 2.7185/4.236 q=70 2.7174/3.699 2.7174/3.861 q=75 2.7164/3.449 2.7164/3.605 q=80 2.7161/3.222 2.7161/3.038 q=85 2.7153/2.919 2.7153/2.946 q=90 2.7145/2.766 2.7145/2.771 q=95 2.7124/2.548 2.7124/2.575 q=100 2.6873/2.253 2.6873/2.335 Change-Id: I0e17581fb71f6094032ad06c6203350bd502f9a1	2015-01-08 00:30:21 -08:00
Vikas Arora	b0b973c39b	Speedup VP8LGetHistoImageSymbols for low effort (m=0) mode. - Do light weight entropy based histogram combine and leave out CPU intensive stochastic and greedy heuristics for combining the histograms. For 1000 image PNG corpus (m=0), this change yields speedup of 10% at lower quality range (1% drop in compression density) and about 5% for higher quality range (1% drop in compression density). Following is the compression stats (before/after) for method = 0: Before After bpp/MPs bpp/MPs q=0 2.8336/16.577 2.8615/18.000 q=5 2.8336/16.504 2.8615/18.216 q=10 2.8293/16.419 2.8572/18.070 q=15 2.8242/17.582 2.8519/18.371 q=20 2.8182/16.131 2.8454/18.975 q=25 2.7924/7.670 2.8230/8.531 q=30 2.7078/6.635 2.7310/7.706 q=35 2.7028/6.203 2.7253/6.855 q=40 2.7005/6.198 2.7231/6.364 q=45 2.6989/5.570 2.7216/5.844 q=50 2.6970/5.087 2.7196/5.210 q=55 2.6963/4.589 2.7208/4.766 q=60 2.6949/4.292 2.7195/4.495 q=65 2.6940/3.970 2.7185/4.024 q=70 2.6929/3.698 2.7174/3.699 q=75 2.6919/3.427 2.7164/3.449 q=80 2.6918/3.106 2.7161/3.222 q=85 2.6909/2.856 2.7153/2.919 q=90 2.6902/2.695 2.7145/2.766 q=95 2.6881/2.499 2.7124/2.548 q=100 2.6873/2.253 2.6873/2.285 Change-Id: I0567945068f8dc7888041e93d872f9def91f50ba	2015-01-08 00:29:57 -08:00
Pascal Massimino	72d573f693	simplify the PackARGB signature Change-Id: I51570e362126b2681f93211a4f59a3fedb5fd4b5	2015-01-05 02:10:04 -08:00
Pascal Massimino	1f4b8642e8	move VP8EncDspARGBInit() call closer to where it's needed Change-Id: I0d5121b456918f0ee6646903a8d71d4384deafe3	2014-12-23 16:04:14 +01:00
Djordje Pesut	7ce8788b06	MIPS: dspr2: added optimization for function MakeARGB32 inline function MakeARGB32 calls changed to call via pointers to functions which make (a)rgb for entire row Change-Id: Ia4bd4be171a46c1e1821e408b073ff5791c587a9	2014-12-22 12:31:36 +01:00
Pascal Massimino	24284459c7	replace unneeded calls to HistogramCopy() by swaps most of the time, we don't need to actually move the data. Compression is randomly slightly different, because HistogramCompactBins() changed. Timing is about the same. Change-Id: Ia6af8e9780581014d6860f2b546189ac817cfad1	2014-12-17 15:32:36 +01:00
Pascal Massimino	31a9cf6417	Speedup WebP lossless compression for low effort (m=0) mode with following: - Disable Cross-Color transform. - Evaluate predictors #11 (paeth), #12 and #13 only. Change-Id: I857264c85c61c3957d4fb45ae32d261d947c8bed	2014-12-17 11:52:11 +01:00
Pascal Massimino	bad775715a	simplify the Histogram struct, to only store max_value and last_nz we don't need to store the whole distribution in order to compute the alpha Later, we can incorporate the max_value / last_non_zero bookkeeping in SSE2 directly. Change-Id: I748ccea4ac17965d7afcab91845ef01be3aa3e15	2014-12-10 10:44:57 +01:00
James Zern	9475bef4d7	PickBestUV: fix VP8Copy16x8 invocation param order is src, dst broken in: `66ad372` factorize BPS definition in dsp.h and add VP8Copy16x8 Change-Id: I761f618e3fe31ae7f58953256381f4f16bdb238e	2014-12-04 23:12:30 -08:00
Pascal Massimino	66ad372500	factorize BPS definition in dsp.h and add VP8Copy16x8 Change-Id: Id73a1e968c96455808755df4d131d74e3e2e135d	2014-12-04 13:45:14 +01:00
Pascal Massimino	57606047ec	encoder: switch BPS to 32 instead of 16 this is a first step to unifying encoding/decoding cache stride and possibly sharing the prediction functions in dsp/ With this layout, there's a little (~7%) space lost with unused samples. But no speed change was observed. Change-Id: I016df8cad41bde5088df3579e6ad65d884ee711e	2014-12-04 09:17:18 +01:00
James Zern	c6d0f9e758	histogram: cosmetics fix indent + other minor spelling / whitespace changes Change-Id: I6e4462b75c98994e3c53c115de07047dbe71ce3c	2014-11-25 15:53:19 -08:00
Vikas Arora	e0c809ad23	Move Entropy methods to lossless.c Move all the Entropy evaluation methods to lossless.c (from histogram.c). There's slight difference in the way entropy is computed for evaluating entropy in prediction methods and histogram (literal) for huffman trees. Plan (later) to merge few (static) methods and reduce the code size. This change has no impact on the compression speed/density. Change-Id: Ife3d96a3c4a8d78a91723d9e0a8d1b78c0256a15	2014-11-20 13:48:05 -08:00
Vikas Arora	a0df55104e	Remove handling for WEBP_HINT_GRAPH Remove handling for WEBP_HINT_GRAPH w.r.t use_palette flag. The WEBP_HINT_GRAPH is now used at one place, to set the initial size of the Bit Writer as bpp for photo images are generally larger than the graphical images. Change-Id: I1b9c4436c85a8f69da74c0dbcd292397323f2696	2014-11-13 15:49:23 -08:00
Vikas Arora	413dfc0c4b	Move static method definition before its usage. Change-Id: Id766c2bea92e7ebf0de65046f73429b74b4fdda4	2014-11-13 13:18:30 -08:00
Vikas Arora	0f23566558	Update BackwardRefsWithLocalCache. Update BackwardRefsWithLocalCache to do in-place update of backward references w.r.t local color cache index. No impact on the compression density or compression speed. Change-Id: Ie066251464c3928c044e037b43df3af28b48ca30	2014-11-13 11:54:26 -08:00
Vikas Arora	d69e36ec59	Remove TODOs from lossless encoder code. histogram.c: - Verified (earlier) that there's low correlation between Red & Blue colors (particularly after applying Cross-color transform). The Bin based histogram merge, bins on three entropies viz literal, red & blue symbols. Removing either of blue or red increases the compression density. So keeping the bins for red & blue sybmols. - Keeping the compact bins method as-is. This way it's simpler to read. huffman_encode.h: Added field comments for struct HuffmanTree and removed the TODO. Change-Id: Ia76f7bc730079d1b3b644038c5d9931db3797f0e	2014-11-12 16:10:16 -08:00
Vikas Arora	fdaac8e0ca	Optmize VP8LGetBackwardReferences LZ77 references. Use the refs_lz77 computed (with cache_bits=0) in the method 'CalculateBestCacheSize' to regenerate the LZ77 references corresponding to the optimum cache_bits and avoid calling costly 'BackwardReferencesLz77' one extra time. This change leaves the compression density unchanged and speeds up compression by 10-15%. Change-Id: I5a92e11788d3c3f656aa7e1fba54fb5d96ee0027	2014-11-12 14:50:04 -08:00
pascal massimino	a3e79a46f6	Merge "WebPEncode: Support encoding same pic twice (even if modified)"	2014-11-06 22:20:01 -08:00
Urvang Joshi	e4f4dddba3	WebPEncode: Support encoding same pic twice (even if modified) This wasn't working for this specific scenario: - Encode an RGBA 'pic' (with trivial alpha) using lossy encoding. (so that pic->a == NULL after import happens). - Modify the 'pic->argb' so that it has non-trivial alpha. - Encode the same 'pic' again. This used to fail to encode alpha data as pic->a == NULL. Change-Id: Ieaaa7bd09825c42f54fbd99e6781d98f0b19cc0c	2014-11-06 13:52:48 -08:00
Vikas Arora	95a9bd85c4	Updated VP8LGetBackwardReferences and color cache. - The optimal cache bits is evaluated inside the method 'VP8LGetBackwardReferences'. - The input cache_bits to 'VP8LGetBackwardReferences' sets the maximum cache bits to use (passing 0 implies disabling the local color cache). - The local color cache is disabled for lowerf (<= 25) quality levels (as before). - Enabled local color cache for palette images as well. This saves additional 0.017% bytes with a slight (2-3%) improvement in the compression speed. - Removed 'use_2d_locality' parameter from method VP8LGetBackwardReferences, as this option is not an option now (after we freeze the lossless bit-stream). Change-Id: I33430401e465474fa1be899f330387cd2b466280	2014-11-06 13:14:05 -08:00
James Zern	4171b6724e	backward_references.c: reindent after `c8581b0` Change-Id: Icfc0fe8e266c0f67a70b8cb095e5aaee155290b6	2014-11-04 17:40:04 +01:00
Vikas Arora	c8581b06e1	Optimize BackwardReferences for RLE encoding. Updated BackwardReferencesRle method by utilizing the local color cache. Also changed the name of method BackwardReferencesHashChain to BackwardReferencesLz77 to reflect the LZ77 coding. For the 1000 image corpus, this change saves 0.2% bytes (at default settings) and is 2-5% faster to encode. Change-Id: Ic3f288253b3bbb101a69945a80994c3fd0917f8b	2014-11-04 08:12:07 -08:00
Vikas Arora	4167a3f5f7	Optimize backwardreferences Optimize backwardreferences (about 0.1% byte savings) with almost same compression speed (3% faster on defaut compression settings). 1.) Simplified iteration logic for HashChainFindCopy. - Remapped the iter_max constant. 2.) Simplified main for loop for BackwardReferencesHashChain - Removed 'if' conditions for corner cases in the main loop. - Refactored the method(AddSingleLiteral) for adding one pixel. Change-Id: I1bc44832fd81f11e714868a13e606c8f83157e64	2014-10-31 18:08:38 -07:00
Vikas Arora	77bdddf016	Speed up BackwardReferences Speed up BackwardReferencesHashChainDistanceOnly method by: 1.) Remove for loop for shortmax code path. 2.) Execute the shortmax code path after regular call to HashChainFindCopy, only if HashChainFindCopy() returns length > 2 (MIN_LENGTH). 3.) Also for shortmax, call method HashChainFindOffset (for length = 2), instead of expensive method HashChainFindCopy(). 4.) Handling first pixel (i==0) outside main loop and removing one if condition (i > 0) per pixel. 5.) Handle the last pixel outside the main 'for' loop. Overall compression speedup observed is around 5% (+/- noise). Change-Id: Ifa30c4035f8d26e6e43e3c4881244d777961c22b	2014-10-30 10:58:24 -07:00
Vikas Arora	abf04205b3	Enable entropy based merge histo for (q<100) Enable bin-partition entropy based heuristic for merging histograms for higher (q >= 90) qualities as well. Keep the old behavior at the maximum quality level (q==100). This speeds up the compression between Q=90-99 (method=4) by factor 5-7X and with loss of 0.5-0.8% in the compression density. Change-Id: I011182cb8ae5403c565a150362bc302630b3f330	2014-10-30 03:59:36 -07:00
Vikas Arora	653ace55c3	Increase the MAX_COLOR_CACHE_BITS from 9 to 10. The Maximum allowed limit is 11. The Q=25 and below is not impacted as cache bits are forced to 0. This saves 0.05% - 0.1% bytes for other quality with almost same compression speed (+/- 2-3%, that's more of a noise). Change-Id: Icf972a98f298c89e140e37a627baf709134be9a0	2014-10-27 14:19:04 -07:00
Vikas Arora	919220c7e6	Change the logic adjusting the Histogram bits. Updated the logic to limit the Histogram size to a constant, instead of computing the same based on the Histogram size (that's variable size based on the cache bits) for the maximum possible cache bits. The actual cache bits may be lower than the maximum. Note: The constant 2600 is 16MB/Sizeof(HistogramSize(MAX_COLOR_CACHE_BITS)). The compression density remains the same with this change, with little faster compression speed. Change-Id: I3149894962852e9dad2501b9aa16bb847a20fd86	2014-10-27 09:57:17 -07:00
Vikas Arora	e912bd55be	Fix bug in VP8LCalculateEstimateForCacheSize. The method VP8LCalculateEstimateForCacheSize is not evaluating the all possible range for cache_bits. Also added a small penality for choosing the larger cache-size. This is done to strike a balance between additional memory/CPU cost (with larger cache-size) and byte savings from smaller WebP lossless files. This change saves about 0.07% bytes and speeds up compression by 8% (default settings). There's small speedup at Q=50 along with byte savings as well. Compression at Quality=25 is not effected by this change. Change-Id: Id8f87dee6b5bccb2baa6dbdee479ee9cda8f4f77	2014-10-26 20:05:48 -07:00
Pascal Massimino	5b90d8fe42	Unify the API between VP8BitWriter and VP8LBitWriter BitReader will be next... Change-Id: Icd9e7ab2e3890131e664c0523627d9b8c5399a74	2014-10-23 15:35:16 +02:00
Vikas Arora	c2b5a0396a	Modify CostModel to allocate optimal memory. Change-Id: I7d52675d28bfc109d4e901581fc24cd36fcb79ee	2014-10-22 13:30:33 -07:00
Vikas Arora	139142e440	Optimize BackwardReferenceHashChainFollowPath. Instead of calling HashChainFindMethod, call a new (subset) method HashChainFindOffset to get the offset/distance for a given length. The encoding is tad faster at default compression Before After bpp/rate bpp/rate 442 Palette 0.2720/5.270 MP/s 0.2720/5.790 MP/s 558 non-palette 3.7607/0.797 MP/s 3.7607/0.816 MP/s Change-Id: If4041a9c18f7e972f49fcbab8c3e2f013d8bf1cf	2014-10-21 10:04:27 -07:00
James Zern	5f36b68d22	enc/backward_references.c: fix indent reindent after `c24f895` Change-Id: I55adcbef21ea3fdaded84b138745515596191a09	2014-10-20 11:35:20 +02:00
James Zern	e0e9960dd1	Merge "sync version numbers to 0.4.2 release"	2014-10-17 11:47:30 -07:00
James Zern	64ac51446d	sync version numbers to 0.4.2 release libwebp{,decoder} - 0.4.2 libwebp libtool - 5.2.0 libwebpdecoder libtool - 1.2.0 mux/demux - 0.2.2 libtool - 1.2.0 (cherry picked from commit `eec5f5f121`) (cherry picked from commit `857578a811`) Change-Id: Ie9d10c68e28083674a8865ad8447b1a70dcea95d	2014-10-17 19:50:21 +02:00
Vikas Arora	c24f8954be	Simplify and speedup Backward refs computation. Updated VP8LGetBackwardReferences and HashChainFindCopy method with following: - Remove the recursive CostModelBuild. - Reuse the lz77 backward refs in CostModelBuild, instead of evaluating it again (as it was done for recursion_level=0). - Consolidated the Match-length logic inside FindMatchLength method. - Removed the logic for altering best_length/val based on the 2D distance. The additional 162 value (+= 9 * 9 + 9 * 9 - y * y - x * x) can't change the best_val eval computation to choose a different curr_length, as best_val was set to 'curr_length << 16'. Following is the impact on the compression speed/density at default & max quality, overall this speeds up compression by 5-15% (q=100 -> 75) with a tad drop (0.02-0.03%) in compression density for the non-palette images. Before After bpp/Rate(MP/s) bpp/Rate(MP/s) q=75 (def) All 1000 2.4492/1.049 MP/s 2.4498/1.230 MP/s Palette 0.2719/5.060 MP/s 0.2719/6.110 MP/s non-Palette 3.7597/0.732 MP/s 3.7607/0.840 MP/s q=100 All 1000 2.4134/0.125 MP/s 2.4142/0.131 MP/s Palette 0.2692/2.585 MP/s 0.2692/2.885 MP/s non-Palette 3.7040/0.079 MP/s 3.7053/0.083 MP/s Change-Id: I27a5eff3356d876c3e949fd32262244b25678b7a	2014-10-17 09:21:30 -07:00
Pascal Massimino	6c6736816c	Improved near-lossless mode. Compared to previous mode it gives another 10-30% improvement in compression keeping comparable PSNR on corresponding quality settings. Still protected by the WEBP_EXPERIMENTAL_FEATURES flag. Change-Id: I4821815b9a508f4f38c98821acaddb74c73c60ac	2014-10-15 10:57:21 -07:00

1 2 3 4 5 ...

540 Commits