libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-03 01:24:30 +02:00

Author	SHA1	Message	Date
Pascal Massimino	7fa67c9b9e	change GetPixPairHash64() return type to uint32_t Change-Id: Ibb61c1631d7a4bcda5417b5a85864d5e2c3f3858	2015-04-16 00:55:25 -07:00
Pascal Massimino	7073bfb3ee	Merge "split 64-mult hashing into two 32-bit multiplies"	2015-04-15 23:04:47 -07:00
Pascal Massimino	7fe357b8c0	split 64-mult hashing into two 32-bit multiplies Speed-wise equivalent on x86 and ARM (maybe a tad faster, hard to tell). Note that the two 32-bit multiples are not strictly equivalent to the 64-bit one, since we're missing one carry propagation. In practice, no observable difference was seen because of this slightly different hashing result. Change-Id: I8f2381175eae1cb20dabf149e6b27e1768fba6ab	2015-04-15 17:45:19 +02:00
Pascal Massimino	6121413415	remove VP8Residual::cost unused field Change-Id: Id494475b05c540b40fd104594acbcaa783b88d77	2015-04-15 01:56:31 -07:00
James Zern	db12250fd1	cosmetics: vp8enci.h: break long line Change-Id: Ib7c7ef6171506e826ed5f7df20c5644f240fd645	2015-04-06 16:11:02 -07:00
Pascal Massimino	82d980209b	add a dec/common.h header to collect common enc/dec #defines had to rename few structs. -> we can now include both vp8i.h and vp8enci.h without naming conflicts. Change-Id: Ib41b498f1b57aab3d6b796361afc45210ec75174	2015-03-31 22:17:58 -07:00
James Zern	553051f741	dsp/lossless: split enc/dec functions adds lossless_enc*.c; reduces the size of the decode-only so: ~78K w/gcc-4.8.2 on x86_64. Change-Id: If5e4610b67d05eba5896bc64bab79e9df92b2092	2015-03-23 22:57:50 -07:00
James Zern	92a5da9c8c	sync versions with 0.4.3 libwebp{,decoder} - 0.4.3 libwebp libtool - 5.3.0 libwebpdecoder libtool - 1.3.0 mux/demux - 0.2.2 (unchanged) libtool - 1.2.0 (unchanged) (cherry picked from commit bd852f5d81edbcf201a4f6a1567689c9b95444d1) Change-Id: Ie8c35ffc20c1bfd782bdafd99da6c6b1373022c1	2015-03-11 17:29:23 -07:00
Pascal Massimino	b1bdbbabfb	~30% faster smart-yuv (-pre 4) with early-out criterion we look at average global improvement and stop when things are moving slow, or when we had a quite good first iteration already (means: the picture is "not difficult") Change-Id: I8ab7d100353039b5b32bb5fac3fe03c8440c78d5	2015-03-11 00:42:12 -07:00
Pascal Massimino	44bd95612e	fix signature for VP8RecordCoeffTokens() Change-Id: Ia2fe764b7280931335237ced8190604129fae565	2015-03-02 23:38:20 -08:00
Pascal Massimino	c9b8ea0eef	small cosmetics on TokenBuffer. Change-Id: I7c33651ed8e3a151aef44247db5fb1e8bf41f8ba	2015-03-03 00:48:28 +01:00
Vikas Arora	ef98750027	Speedup method StoreImageToBitMask by 5%. Speedup method StoreImageToBitMask by replacing the code to find histogram index and Huffman tree codes at every iteration to a more optimal code that updates these only when the current pixel (to write) crosses the histogram tile-row boundary. This change speeds up the StoreImageToBitMask method by 5%. Change-Id: If01a1ccd7820f9a3a3e5bc449d070defa51be14b	2015-02-20 09:46:19 -08:00
Pascal Massimino	2382050748	1-2% faster encoding by removing an indirection in GetResidualCost() The MIPS code for cost is not updated yet, that's why i keep Residual::cost around for now. Should be removed in favor of costs later. Change-Id: Id1d09a8c37ea8c5b34ad5eb8811d6a3ec6c4d89f	2015-02-19 08:44:35 +01:00
James Zern	9bc0f922aa	ApplyFiltersAndEncode: only copy lossless stats this avoids a race with multi-threaded lossy + alpha compression Change-Id: Ie437105f5a899ed28b9c8885b6ca5431092ce8f5	2015-02-12 19:44:25 -08:00
James Zern	e15560107c	move some cost tables from enc/ to dsp/ removes circular dependency between dsp and enc. since: a987fae MIPS: dspr2: added optimization for function GetResidualCost Change-Id: Ifeb8fc02de89e2ba982ed7ffacd925d649bfec3c	2015-02-11 16:10:06 -08:00
pascal massimino	c3a031686a	Merge "picture_csp: fix build w/USE_GAMMA_COMPRESSION undefined"	2015-02-10 00:09:03 -08:00
James Zern	1dd419ced5	picture_csp: fix build w/USE_GAMMA_COMPRESSION undefined kGammaFix is now only defined with USE_GAMMA_COMPRESSION; fixes: use of undeclared identifier 'kGammaFix' Change-Id: Ib1e2f410eff9b83be065894f88181f91dd2776e1	2015-02-09 23:57:14 -08:00
James Zern	0ec4da960d	picture_csp::InitGammaTables*: add missing TSan annotations Change-Id: I66ca5b3e7b1614f861a9b68bd437f58b24cb1ebb	2015-02-09 23:44:47 -08:00
Pascal Massimino	a987faedfa	MIPS: dspr2: added optimization for function GetResidualCost set/get residual C functions moved to new file in src/dsp mips32 version of GetResidualCost moved to new file Change-Id: I7cebb7933a89820ff28c187249a9181f281081d2	2015-02-07 02:13:26 -08:00
James Zern	3b77e5a735	VP8TBufferClear: remove some misleading const's the input to the function is non-const and the pointer being operated is being free'd; removes an unnecessary cast in the process Change-Id: Ic515ed672ddf7f8e4e36eeac696ff7aa8a3652f7	2015-02-05 23:56:26 -08:00
James Zern	aa139c8f1a	VP8EmitTokens: remove unnecessary param void cast 'final_pass' is used within the function Change-Id: I81be1a6e18cafaa6ae685ed8ad2b107fa7ed29cf	2015-02-05 23:56:26 -08:00
Vikas Arora	4c82284d2e	Updated the near-lossless level mapping. Updated the near-lossless level mapping and make it correlated to lossy quality i.e 100 => minimum loss (in-fact no-loss) and the visual-quality loss increases with decrease in near-lossless level (quality) till value 0. The new mapping implies following (PSNR) loss-metric: -near_lossless 100: No-loss (bit-stream same as -lossless). -near_lossless 80: Very very high PSNR (around 54dB). -near_lossless 60: Very high PSNR (around 48dB). -near_lossless 40: High PSNR (around 42dB). -near_lossless 20: Moderate PSNR (around 36dB). -near_lossless 0: Low PSNR (around 30dB). Change-Id: I930de4b18950faf2868c97d42e9e49ba0b642960	2015-02-05 11:17:14 -08:00
James Zern	c86b40cca0	enc/near_lossless.c: fix alignment Change-Id: Ifd1b1b88c375abf655d94e2ba7d52087110294a5	2015-02-02 19:35:12 -08:00
Vikas Arora	72831f6b28	Speedup AnalyzeAndInit for low effort compression. AnalyzeSubtractGreen constitutes about 8-10% of the comression CPU cycles. Statistically, subtract-green is proved to be useful for most of the non-palette compression. So instead of evaluating the entropy (by calling AnalyzeSubtractGreen) apply subtract-green transform for the low-effort compression. This changes speeds up the compression at m=0 by 8-10% (with very slight loss of 0.07% in the compression density). Change-Id: I9797dc39437ae089716acb14631bbc77d367acf4	2015-01-30 10:37:31 -08:00
Vikas Arora	a6597483af	Speedup Analyze methods for lossless compression. Speed up AnalyzeSubtractGreen by looping through the image pixel once to compute the two histograms. AnalyzeEntropy code cleanup. Removed some 'if' conditions and pointer indirections inside pixel iterate loop. Change-Id: Ia65e3033988ff67df8e3ecce19d6e34cfc76358e	2015-01-30 09:16:31 -08:00
Vikas Arora	98c8138663	Enable Near-lossless feature. Enable the WebP near-lossless feature by pre-processing the image to smoothen the pixels. On a 1000 PNG image corpus, for which WebP lossless (default settings) gets 25% compression gains, following is the performance of near-lossless feature at various '-near_lossless' levels: -near_lossless 90: 30% (very very high PSNR 54-60dB) -near_lossless 75: 38% (very high PSNR 48-54dB) -near_lossless 50: 45% (high PSNR 42-48dB) -near_lossless 25: 48% (moderate PSNR 36-42dB) -near_lossless 10: 50% (PSNR 30-36dB) WebP near-lossless is specifically useful for discrete-tone images like line-art, icons etc. Change-Id: I7d12a2c9362ccd076d09710ea05c85fa64664c38	2015-01-29 16:10:20 -08:00
Urvang Joshi	2db15a9583	Temporarily disable encoding of alpha plane with color cache. This is to avoid triggering the related decoder bug. Change-Id: I8fa074a5393bcd62aa4a2232cd4e02935e927a89	2015-01-28 15:28:02 -08:00
James Zern	cafa1d882f	Merge "Simplify backward refs calculation for low-effort."	2015-01-27 23:32:21 -08:00
Pascal Massimino	7afdaf8496	Alpha coding: reorganize the filter/unfiltering code Move the filtering code to their own dsp/ spot New function: VP8FiltersInit() Change-Id: I0b2041eab42346c59b972f2575b05509e6a8f7b1	2015-01-28 08:02:41 +01:00
Vikas Arora	4d6d7285b0	Simplify backward refs calculation for low-effort. Simplify and speedup backward references for low-effort settings by evaluating LZ77 references only. This change speeds up compression by 10-25% at lower (q <= 25) quality range with a slight drop (0.2%) in the compression density. Change-Id: Ibd6f03b1a062d8ab9191786c2a425e9132e4779f	2015-01-27 09:36:14 -08:00
Vikas Arora	ec0d1be577	Cleaup Near-lossless code. Cleaup Near-lossless code - Simplified and refactored the code. - Removed the requirement (TODO) to allocate the buffer of size WxH and work with buffer of size 3xW. - Disabled the Near-lossless prr-processing for small icon images (W < 64 and H < 64). Change-Id: Id7ee90c90622368d5528de4dd14fd5ead593bb1b	2015-01-26 15:29:59 -08:00
Vikas Arora	9814ddb601	Remove the post-transform near-lossless heuristic. Remove the post-transform (prediction, subtract green & cross-color) near-lossless heuristic, that's not ready yet and produces unacceptable visual (banding) artifacts. Change-Id: I9b606a790ce0344c588f2ef83a09c57ac19c2fc1	2015-01-26 14:19:00 -08:00
Pascal Massimino	0f027a72bf	simplify smart RGB->YUV conversion code * use the same TFIX == YFIX precision (2bits) * use int instead of float in LinearToGammaF() output is visually equivalent. Code is a little faster. Change-Id: Ie3cfebca351dbcbd924b3d00801d6523dca6981f	2015-01-23 14:42:32 +01:00
Pascal Massimino	0d5b334ee8	BackwardReferencesHashChainFollowChosenPath: remove unused variable Change-Id: I8dc4622dbacca03a7876f8856a0db5b9b9ec2fbd	2015-01-22 23:22:58 -08:00
Pascal Massimino	f480d1a7ef	Fix to near lossless artefacts on palettized images. Don't rely on palette not being there before the palette colors are counted. Change-Id: I988286675d3398f2da8f6d2fb6462db08af8028d	2015-01-22 17:47:50 +01:00
Pascal Massimino	cb4a18a7ba	rename HashChainInit into HashChainReset this avoids the confusion with "VP8LHashChainInit" Change-Id: Ia1686828c138729e5bda3cc5c8246d99c80915ef	2015-01-20 00:38:07 -08:00
Pascal Massimino	f079e487ae	use uint16_t for chosen_path[] len is MAX_LENGTH (4096) at max. This reduce memory for path by a half. Change-Id: I399fda4093d93b1e9d956397b7b210956c5b948f	2015-01-20 00:34:09 -08:00
James Zern	f0e0677b87	VP8LEncodeStream: add an assert check enc->argb_ to quiet an msvs /analyze warning: C6387: 'enc->argb_+y*width' could be '0': this does not adhere to the specification for the function 'memcpy'. Change-Id: I87544e92ee0d3ea38942a475c30c6d552f9877b7	2015-01-16 18:16:40 -08:00
Vikas Arora	b9e356b998	Disable costly TraceBackwards for method=0. Disable costly TraceBackwards heuristic for computing the backward references for low_effort (method=0) compression. The TraceBackwards heuristic is already disabled for lower (q < 25) quality range. Following is the compression data for 1000 image corpus for q >= 25. This speeds up compression (q >= 25) by a factor of 2.5-3X with slight loss of compression density (0.7% for lower quality range and 1.2% for higher qualities). Change-Id: I256c9e2137c7de4083f423ea32ee12d3b0f46253	2015-01-15 09:01:40 -08:00
Vikas Arora	ea08466d34	Tune BackwardReferencesLz77 for low_effort (m=0). - Lower the threshold parameters for HashChainFindCopy. For 1000 image PNG corpus (m=0), this change yields speedup of 15-20% at lower quality range (0.25% drop in compression density) and about 10% for higher quality range without any drop in the compression density. Following is the compression stats (before/after) for method = 0: Before After bpp/MPs bpp/MPs q=0 2.8615/18.000 2.8651/18.631 q=5 2.8615/18.216 2.8650/20.517 q=10 2.8572/18.070 2.8650/21.992 q=15 2.8519/18.371 2.8584/21.747 q=20 2.8454/18.975 2.8515/20.448 q=25 2.8230/8.531 2.8253/9.585 // Compression density remains same for q-range [30-100] q=30 2.7310/7.706 2.7310/8.028 q=35 2.7253/6.855 2.7253/7.184 q=40 2.7231/6.364 2.7231/6.604 q=45 2.7216/5.844 2.7216/6.223 q=50 2.7196/5.210 2.7196/5.731 q=55 2.7208/4.766 2.7208/4.970 q=60 2.7195/4.495 2.7195/4.602 q=65 2.7185/4.024 2.7185/4.236 q=70 2.7174/3.699 2.7174/3.861 q=75 2.7164/3.449 2.7164/3.605 q=80 2.7161/3.222 2.7161/3.038 q=85 2.7153/2.919 2.7153/2.946 q=90 2.7145/2.766 2.7145/2.771 q=95 2.7124/2.548 2.7124/2.575 q=100 2.6873/2.253 2.6873/2.335 Change-Id: I0e17581fb71f6094032ad06c6203350bd502f9a1	2015-01-08 00:30:21 -08:00
Vikas Arora	b0b973c39b	Speedup VP8LGetHistoImageSymbols for low effort (m=0) mode. - Do light weight entropy based histogram combine and leave out CPU intensive stochastic and greedy heuristics for combining the histograms. For 1000 image PNG corpus (m=0), this change yields speedup of 10% at lower quality range (1% drop in compression density) and about 5% for higher quality range (1% drop in compression density). Following is the compression stats (before/after) for method = 0: Before After bpp/MPs bpp/MPs q=0 2.8336/16.577 2.8615/18.000 q=5 2.8336/16.504 2.8615/18.216 q=10 2.8293/16.419 2.8572/18.070 q=15 2.8242/17.582 2.8519/18.371 q=20 2.8182/16.131 2.8454/18.975 q=25 2.7924/7.670 2.8230/8.531 q=30 2.7078/6.635 2.7310/7.706 q=35 2.7028/6.203 2.7253/6.855 q=40 2.7005/6.198 2.7231/6.364 q=45 2.6989/5.570 2.7216/5.844 q=50 2.6970/5.087 2.7196/5.210 q=55 2.6963/4.589 2.7208/4.766 q=60 2.6949/4.292 2.7195/4.495 q=65 2.6940/3.970 2.7185/4.024 q=70 2.6929/3.698 2.7174/3.699 q=75 2.6919/3.427 2.7164/3.449 q=80 2.6918/3.106 2.7161/3.222 q=85 2.6909/2.856 2.7153/2.919 q=90 2.6902/2.695 2.7145/2.766 q=95 2.6881/2.499 2.7124/2.548 q=100 2.6873/2.253 2.6873/2.285 Change-Id: I0567945068f8dc7888041e93d872f9def91f50ba	2015-01-08 00:29:57 -08:00
Pascal Massimino	72d573f693	simplify the PackARGB signature Change-Id: I51570e362126b2681f93211a4f59a3fedb5fd4b5	2015-01-05 02:10:04 -08:00
Pascal Massimino	1f4b8642e8	move VP8EncDspARGBInit() call closer to where it's needed Change-Id: I0d5121b456918f0ee6646903a8d71d4384deafe3	2014-12-23 16:04:14 +01:00
Djordje Pesut	7ce8788b06	MIPS: dspr2: added optimization for function MakeARGB32 inline function MakeARGB32 calls changed to call via pointers to functions which make (a)rgb for entire row Change-Id: Ia4bd4be171a46c1e1821e408b073ff5791c587a9	2014-12-22 12:31:36 +01:00
Pascal Massimino	24284459c7	replace unneeded calls to HistogramCopy() by swaps most of the time, we don't need to actually move the data. Compression is randomly slightly different, because HistogramCompactBins() changed. Timing is about the same. Change-Id: Ia6af8e9780581014d6860f2b546189ac817cfad1	2014-12-17 15:32:36 +01:00
Pascal Massimino	31a9cf6417	Speedup WebP lossless compression for low effort (m=0) mode with following: - Disable Cross-Color transform. - Evaluate predictors #11 (paeth), #12 and #13 only. Change-Id: I857264c85c61c3957d4fb45ae32d261d947c8bed	2014-12-17 11:52:11 +01:00
Pascal Massimino	bad775715a	simplify the Histogram struct, to only store max_value and last_nz we don't need to store the whole distribution in order to compute the alpha Later, we can incorporate the max_value / last_non_zero bookkeeping in SSE2 directly. Change-Id: I748ccea4ac17965d7afcab91845ef01be3aa3e15	2014-12-10 10:44:57 +01:00
James Zern	9475bef4d7	PickBestUV: fix VP8Copy16x8 invocation param order is src, dst broken in: 66ad372 factorize BPS definition in dsp.h and add VP8Copy16x8 Change-Id: I761f618e3fe31ae7f58953256381f4f16bdb238e	2014-12-04 23:12:30 -08:00
Pascal Massimino	66ad372500	factorize BPS definition in dsp.h and add VP8Copy16x8 Change-Id: Id73a1e968c96455808755df4d131d74e3e2e135d	2014-12-04 13:45:14 +01:00
Pascal Massimino	57606047ec	encoder: switch BPS to 32 instead of 16 this is a first step to unifying encoding/decoding cache stride and possibly sharing the prediction functions in dsp/ With this layout, there's a little (~7%) space lost with unused samples. But no speed change was observed. Change-Id: I016df8cad41bde5088df3579e6ad65d884ee711e	2014-12-04 09:17:18 +01:00

1 2 3 4 5 ...

616 Commits