libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-08-20 19:08:29 +02:00

Author	SHA1	Message	Date
Vincent Rabaud	d963775859	Compute the hash chain once and for all for lossless compression. In some cases, the hash chain for a function is filled several times: - GetBackwardReferences -> CalculateBestCacheSize -> BackwardReferencesLz77 that computes the hash chain - GetBackwardReferences -> (not always) BackwardReferencesTraceBackwards -> BackwardReferencesHashChainDistanceOnly that computes the hash chain in a slightly different way Speed and compression performance are slightly changed (+ or -) but will be homogneized in a later patch. Change-Id: I43f0ecc7a9312c2ed6cdba1c0fabc6c5ad91c953	2016-06-03 11:42:13 +02:00
Pascal Massimino	ca8d951980	remove some obsolete TODOs Change-Id: Ied77b2dd7e3e5bb65524c0ac7b9a3fb6585cac57	2016-06-01 16:23:16 +02:00
Pascal Massimino	fdd29a3d3f	speed-up MapToPalette() with binary search goes from ~2% CPU to ~0.7% for large images Change-Id: Ibd8a0fde9ba553f93157a49dcb7da0426209e404	2016-05-23 15:39:54 +02:00
Pascal Massimino	2ec2de1450	Merge "Speed-up BackwardReferencesHashChainDistanceOnly."	2016-05-19 05:17:03 +00:00
Vincent Rabaud	3e023c17cd	Speed-up BackwardReferencesHashChainDistanceOnly. Instead of comparing all the following pixels over len (which can frequently reach the maximum MAX_LENGTH=4096 for some images), intervals are stored and compared. Change-Id: I0dafef6cc988dde3c1c03ae07305ac48901d60ee	2016-05-19 04:51:13 +00:00
Marcin Kowalczyk	f2e1efbeb7	Improve near lossless compression when a prediction filter is used. The old implementation in enc/near_lossless.c performing a separate preprocessing step is used only when a prediction filter is not used, otherwise a new implementation integrated into lossless_enc.c is used. It retains the same logic for converting near lossless quality into max number of bits dropped, and for adjusting the number of bits based on the smoothness of the image at a given pixel. As before, borders are not changed. Then, instead of quantizing raw component values, the residual after subtract green and after prediction is quantized according to the resulting number of bits, taking care to not cross the boundary between 255 and 0 after decoding. Ties are resolved by moving closer to the prediction instead of by bankers’ rounding. This results in about 15% size decrease for the same quality. Change-Id: If3e9c388158c2e3e75ef88876703f40b932f671f	2016-05-18 20:59:02 +00:00
James Zern	9629f4bcda	SimplifySegments: quiet -Warray-bounds warning the number of segments are previously validated, but an explicit check is needed to avoid a warning under gcc-4.9 this is similar to the changes made in: `c8a87bb` AssignSegments: quiet -Warray-bounds warning `3e7f34a` AssignSegments: quiet array-bounds warning Change-Id: Iec7d470be424390c66f769a19576021d0cd9a2fd	2016-05-02 12:17:49 -07:00
James Zern	ec1b2407a4	WebPPictureImport*: check output pointer fixes crash with NULL output pointer in calls to simple encode api (WebPEncodeRGB, etc.) Change-Id: I91e7a1c0e070ea842b0a2a4ac54e981cac8629bf	2016-03-25 18:21:13 -07:00
Pascal Massimino	c07687699b	Merge "Revert "Re-enable encoding of alpha plane with color cache for next release.""	2016-03-25 07:29:07 +00:00
James Zern	41f14bcbc5	WebPPictureImport*: check src pointer fixes crash with NULL source pointer in calls to simple encode api (WebPEncodeRGB, etc.) Change-Id: I706d670c80298da5176aaa5ba0eb2238dd71a8f0	2016-03-24 22:52:01 -07:00
Pascal Massimino	97934e2447	Revert "Re-enable encoding of alpha plane with color cache for next release." This avoids generating file that would trigger a decoding bug found in 0.4.0 -> 0.4.3 libwebp versions. This reverts commit `6ecd72f845`. Change-Id: I4667cc8f7b851ba44479e3fe2b9d844b2c56fcf4	2016-03-18 11:01:54 +01:00
Pascal Massimino	e88c4ca013	fix -m 2 mode-cost evaluation (causing partition0 overflow) The mode's bits were not taken into account, which is ok for most of cases. But in case of super large image, with 'easy' content, their overhead starts mattering a lot and we were omitting to optimize for these. Now, these mode bits have their own lambda values associated, limiting the jerkiness. We also limit (for -m 2 only) the individual number of bits to something that will prevent the partition 0 overflow. removed the I4_PENALTY constant, which was a rather crude approximation. Replaced by some q-dependent expression. fixes issue #289 Change-Id: I956ae2d2308c339adc4706d52722f0bb61ccf18c	2016-03-11 20:34:45 +01:00
James Zern	71e856cf84	GetMBSSIM,cosmetics: fix alignment Change-Id: I884b3361484b48917fa4cba33cd1217ac51685f9	2016-03-08 23:26:10 -08:00
Pascal Massimino	423ecaf484	move some SSIM-accumulation function for dsp/ This is in preparation for some SSE2 code. And generally speaking, the whole SSIM code needs some revamp: we're not averaging the SSIM value at each pixels but just computing the overall SSIM value once, for the whole plane. The former might be better than the latter. Change-Id: I935784a917f84a18ef08dc5ec9a7b528abea46a5	2016-03-08 07:50:09 +01:00
Marcin Kowalczyk	e8feb20e39	Fix FindClosestDiscretized in near lossless: - The result is now indeed closest among possible results for all inputs, which was not the case for bits>4, where the mapping was not even monotonic because GetValAndDistance was correct only if the significant part of initial fit in a byte at most twice. - The set of results for a larger number of bits dropped is a subset of values for a smaller number of bits dropped. This implies that subsequent discretizations for a smaller number of bits dropped do not change already discretized pixels, which improves the quality (changes do not accumulate) and compression density (values tend to repeat more often). - Errors are more fairly distributed between upwards and downwards thanks to bankers’ rounding, which avoids images getting darker or lighter in overall. - Deltas between discretized values are more repetitive. This improves compression density if delta encoding is used. Also, the implementation is much shorter now. Change-Id: I0a98e7d5255e91a7b9c193a156cf5405d9701f16	2016-03-02 12:47:22 +01:00
James Zern	a5193774b0	Merge "Near lossless feature: fix some comments."	2016-02-20 03:51:15 +00:00
Urvang Joshi	3335713169	Near lossless feature: fix some comments. Change-Id: I2c5fc2a3b3fe5123d66b42bf148e361b4862dfb9	2016-02-19 19:26:39 -08:00
James Zern	0beed01aa5	cosmetics: fix indent after `2f5e898` `2f5e898` fix multiple allocation for transform buffer Change-Id: Ied5c89c0040671e2eddf23c8b7a78e0d817dd18e	2016-02-19 19:22:34 -08:00
James Zern	db86088426	Merge "remove useless #include"	2016-02-17 23:17:12 +00:00
Pascal Massimino	2f5e8986cf	fix multiple allocation for transform buffer We were not updating the current_width_, which is usually not a problem, unless we use Delta Palette with small number of colors -> Addressed this re-entrancy problem by checking we have enough capacity for transform buffer. The problem is not currently visible, until we restrict the number of gradient used in delta-palette to less than 16. Then the buffers have different current_width_ and the problem surfaces. Change-Id: Icd84b919905d7789014bb6668bfb6813c93fb36e	2016-02-17 06:14:39 +01:00
Pascal Massimino	75f4af4d54	remove useless #include Change-Id: Id34f12ec94d8be6853fabd67609a6006ac99f152	2016-01-25 09:34:10 -08:00
Vincent Rabaud	8ce975ac82	SSE optimization for vector mismatch. Change-Id: I564b822033b59d86635230f29ed6197e306a2c4f	2016-01-07 18:23:45 +01:00
James Zern	71100500a8	bump version to 0.5.0 libwebp{,decoder} - 0.5.0 libwebp libtool - 6.0.0 libwebpdecoder libtool - 2.0.0 mux/demux - 0.3.0 libtool - 2.0.0 Change-Id: I5346d13eb827fb5890efbb63ff3f28cea9d0c55f	2015-12-17 19:45:14 -08:00
James Zern	99a01f4f8b	Merge "Unify some entropy functions."	2015-12-17 22:35:29 +00:00
James Zern	4b025f10f7	Merge "configure: disable asserts by default"	2015-12-17 22:28:37 +00:00
James Zern	92cbddf89c	Merge "fix PrintBlockInfo()"	2015-12-17 21:00:57 +00:00
Vincent Rabaud	ca509a3362	Unify some entropy functions. The code and logic is unified when computing bit entropy + Huffman cost. Speed-wise, we gain 8% for lossless encoding. Logic-wise, the beginning/end of the distributions are handled properly and the compression ratio does not change much. Change-Id: Ifa91d7d3e667c9a9a421faec4e845ecb6479a633	2015-12-17 17:00:08 +01:00
Pascal Massimino	367bf903b3	fix PrintBlockInfo() ... which has gone out of sync since the last block-cache layout change. Change-Id: Ic441ec07b0198b508ce3fd34ab582cb60b1daabc	2015-12-17 15:47:25 +01:00
Lode Vandevenne	fb4c7832f1	lossless: simpler alpha cleanup preprocessing setting all transparent pixels to black rather than the "flatten" method. 0.3% smaller filesize on the 1000 PNGs if alpha cleanup is used (before: 18685774, after: 18622472) Change-Id: Ib0db9e7ccde55b36e82de07855f2dbb630fe62b1	2015-12-17 15:04:50 +01:00
Vincent Rabaud	47ddd5a4cc	Move some codec logic out of ./dsp . The functions containing magic constants are moved out of ./dsp . VP8LPopulationCost got put back in ./enc VP8LGetCombinedEntropy is now unrefined (refinement happening in ./enc) VP8LBitsEntropy is now unrefined (refinement happening in ./enc) VP8LHistogramEstimateBits got put back in ./enc VP8LHistogramEstimateBitsBulk got deleted. Change-Id: I09c4101eebbc6f174403157026fe4a23a5316beb	2015-12-17 07:03:25 +00:00
James Zern	b9d80fa4e8	configure: disable asserts by default --enable-asserts can be used to avoid defining NDEBUG Change-Id: I6216668e3f79f69bd8c453f0b36cecb3b585688e	2015-12-16 13:15:53 -08:00
Pascal Massimino	7badd3da4a	cosmetic fix: sizeof(type) -> sizeof(*var) Change-Id: I1a39fccfdcb9f0a4b9b025d3c9b522e8edfe7fd6	2015-12-16 18:29:14 +01:00
Pascal Massimino	e0c0bb3480	remove TODO about unused ref_lf_delta[] Change-Id: I54983c0dfc6927564143bad56bd2e4c4cdfefc0e	2015-12-15 22:57:53 -08:00
Pascal Massimino	9cf1cc2bd6	remove few TODO: * 256 -> RD_DISTO_MULT * don't use TDisto for UV mode picking Change-Id: I243148c716fe688b5c1b1fb9b7a6e58d0b5e6835	2015-12-15 22:52:12 -08:00
James Zern	d4f9c2efd4	enc/Makefile.am: add missing headers Change-Id: Ic29497f425909eda1a7f23e6c8e92bd4ca17d44b	2015-12-14 23:07:54 -08:00
Pascal Massimino	e6c9351918	add disto-based refinement for UV mode (if method = 1 or 2) This doesn't slow down much and give some quality improvement. Change-Id: I5afbe62b9c3922b3ec1bf6538c68dcdb0f25d2e4	2015-12-11 03:15:59 -08:00
Vincent Rabaud	d3d163972f	Optimize the heap usage in HistogramCombineGreedy. The previous priority system used a heap which was too heavy to maintain (what was gained from insertions / deletions was lost due to a linear that still happened on the heap for invalidation). The new structure is a priority queue where only the head is ordered. Change-Id: Id13f8694885a934fe2b2f115f8f84ada061b9016	2015-12-10 12:44:11 +01:00
Pascal Massimino	14d27a46be	improve method #2 by merging DistoRefine() and SimpleQuantize() it's now a single function, that reconstructs the intra4x4 block during the scan The I4_PENALTY had to be adjusted. Overall, result is better quality-wise (esp. at q < 50), and a tad faster too. method #0, #1 and #3+ are unchanged Change-Id: If262aeb552397860b3dd532df8df6b1357779222	2015-12-10 08:04:04 +01:00
Pascal Massimino	7eb01ff3e8	Merge "Improved alpha cleanup for the webp encoder when prediction transform is used."	2015-12-08 11:32:37 +00:00
Pascal Massimino	fb8c9106c7	Merge "introduce WebPMemToUint32 and WebPUint32ToMem for memory access"	2015-12-08 11:32:05 +00:00
Vincent Rabaud	6c702b81ac	Speed up hash chain initialization using memset. That gains 1% on lossy compression. Change-Id: Ib9aa210194ed2f17eaff85b499b55cc4eb99ff11	2015-12-07 11:54:50 +01:00
Lode Vandevenne	6938111357	Improved alpha cleanup for the webp encoder when prediction transform is used. Gives 0.9% smaller (2.4% compared to before alpha cleanup) size on the 1000 PNGs dataset: Alpha cleanup before: 18856614 Alpha cleanup after: 18685802 For reference, with no alpha cleanup: 19159992 Note: WebPCleanupTransparentArea is still also called in WebPEncode. This cleanup still helps preprocessing in the encoder, and the cases when the prediction transform is not used. Change-Id: I63e69f48af6ddeb9804e2e603c59dde2718c6c28	2015-12-04 13:50:56 +00:00
Pascal Massimino	2c08aac81a	introduce WebPMemToUint32 and WebPUint32ToMem for memory access it uses memcpy() when unaligned memory write is tricky Change-Id: I5d966ca9d19e9b43ac90140fa487824116982874	2015-12-04 13:43:01 +00:00
Vincent Rabaud	010ca3d10d	Fix FindMatchLength with non-aligned buffers. The 32-bit buffers are actually rarely 64-bit aligned. The new solution uses memcmp and is alignment agnostic. It is also slightly faster. Change-Id: I863003e9ee4ee8a3eed25b7b2478cb82a0ddbb20	2015-12-04 10:19:58 +01:00
Scott Hancher	5ae220bef6	backward_references.c: Fixed compiler warning "Implicit conversion loses integer precision: 'long' to 'int'." Change-Id: I1aec7431f84123e5280447883eb80b84a3821d91	2015-12-02 23:51:06 -08:00
Vincent Rabaud	a141178255	Optimization in hash chain comparison for 64 bit Arrays were compared 32 bits at a time, it is now done 64 bits at a time. Overall encoding speed-up is only of 0.2% on @skal's small PNG corpus. It is of 3% on my initial 1.3 Mp desktop screenshot image. Change-Id: I1acb32b437397a7bf3dcffbecbcd4b06d29c05e1	2015-12-01 13:01:57 +01:00
Lode Vandevenne	239421c5ef	lossless: make prediction in encoder work per scanline instead of per block. This prepares for a next CL that can make the predictors alter RGB value behind transparent pixels for denser encoding. Some predictors depend on the top-right pixel, and it must have been already processed to know its new RGB value, so requires per scanline instead of per block. Running the encode speed test on 1000 PNGs 10 times with default settings: Before: Compression (output/input): 2.3745/3.2667 bpp, Encode rate (raw data): 1.497 MP/s After: Compression (output/input): 2.3745/3.2667 bpp, Encode rate (raw data): 1.501 MP/s Same but with quality 0, method 0 and 30 iterations: Before: Compression (output/input): 2.9120/3.2667 bpp, Encode rate (raw data): 36.379 MP/s After: Compression (output/input): 2.9120/3.2667 bpp, Encode rate (raw data): 36.462 MP/s No effect on compressed size, this produces exactly same files. No significant measured effect on speed. Expected faster speed from better memory layout with scanline processing but slower speed due to needing to get predictor mode per pixel, may compensate each other. Change-Id: I40f766f1c1c19f87b62c1e2a1c4cd7627a2c3334	2015-11-25 00:38:27 -08:00
Pascal Massimino	3770f3bbb6	Merge "cleanup the YFIX/TFIX difference by removing some code and #define"	2015-11-23 20:47:42 +00:00
Pascal Massimino	997e103871	cleanup the YFIX/TFIX difference by removing some code and #define no speed or output difference Change-Id: I50bfb44f357e19431457b1cf9504a5a6bcce1945	2015-11-21 23:51:58 -08:00
Lode Vandevenne	1f9be97c22	Make discarding invisible RGB values (cleanup alpha) the default. Rename the flag to exact instead of the opposite cleanup_alpha. Add the flag to WebPConfig. Do the cleanup in the webp encoder library rather than the cwebp binary, this will be needed for the next stage: smarter alpha cleanup for better compression which cannot be done as a preprocessing due to depending on predictor choices in the encoder. Change-Id: I2fbf57f918a35f2da6186ef0b5d85e5fd0020eef	2015-11-21 12:32:32 -08:00

1 2 3 4 5 ...

752 Commits