libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2026-03-02 21:22:19 +01:00

Author	SHA1	Message	Date
James Zern	201894ef24	dsp/dec*: use WEBP_RESTRICT qualifier A minor improvement for arm targets with ndk r27/gcc-13 in H/VFilter8 (a couple fewer moves w/aarch64) and much better vectorization of DitherCombine8x8_C in most targets. This only affects non-vector pointers; any vector pointers are left as a follow up. Change-Id: I03e73e6d6404261bb8408a9ae76a4b6ef142f8f0	2024-10-02 14:55:14 -07:00
James Zern	02eac8a741	dsp/cost: use WEBP_RESTRICT qualifier on SetResidualCoeffs_. This results in some minor code reordering when targeting arvm7 with ndk r27 and other recent versions of clang. No changes in the x86 compilations with clang-16 / gcc-13. This only affects non-vector pointers; any vector pointers are left as a follow up. Change-Id: I7c3554ece848fafbc5ac9c4944f1dc85129f6fd8	2024-10-02 14:55:14 -07:00
Vincent Rabaud	220ee52967	Search for best predictor transform bits This is useful in cruncher mode. Change-Id: I8586bdbf464daf85db381ab77a18bf63dd48f323	2024-09-24 10:44:22 +02:00
Vincent Rabaud	7861947813	Try to reduce the sampling for the entropy image This offers minor compression improvements. Change-Id: I4b3b1bb11ee83273c0e4c9f47e53b21cf7cd5f76	2024-09-24 10:28:43 +02:00
Vincent Rabaud	a78c5356ba	Remove a useless malloc for entropy image histogram_symbols is converted to uint32_t and <<8 into histogram_argb. Using a uint32_t buffer from the start prevents copying and converting the data. Change-Id: I245003a6a0f048c31519afa25a600d4479e762e3	2024-09-18 22:38:11 +02:00
Vincent Rabaud	367ca938f1	Refactor predictor finding This is useful for a forward change that will improve compression. It splits the residual computation and the best predictor selection. The only downside is that more memory is allocated: we had 2 histograms before, we now have 14, but this is necessary for the later change. Still, this is nothing compared to what is done later in the pipeline in HistogramSetTotalSize where the number of histograms created is the number of pixels in the subsampled image. Change-Id: If03501a26f00462dd1809daa6e9314abd180945d	2024-09-17 09:49:43 +02:00
James Zern	f888291359	anim_encode.c: fix function ref in comment WebPCleanupTransparentAreaLossless() was renamed to WebPReplaceTransparentPixels() in: `55a080e5` Add WebPReplaceTransparentPixels() in dsp Change-Id: I91e32574e6add2748c0655146f100eb2b40498b2	2024-09-09 19:28:12 -07:00
Vincent Rabaud	2e81017c7a	Convert predictor_enc.c to fixed point Also remove the last float in histogram_enc.c Change-Id: I6f647a5fc6dd34a19292820817472b4462c94f49	2024-08-30 09:22:48 +02:00
Vincent Rabaud	8e0cc14c3e	Fix static overflow warning. In practice, this can never happen because: - 'streak' is at most as long as a histogram - 'count' counts the number of streaks 'streak' and 'count' are therefore at most as big as the histogram length which is at most the max of VP8LHistogramNumCodes, which is 256+24+(1<<10). Change-Id: I31c8834543479c8a9260732313ea26b045519515	2024-08-28 10:23:54 +02:00
James Zern	615e58744f	Merge "make VP8LPredictor[01]_C() static" into main	2024-08-22 17:35:52 +00:00
James Zern	233e86b91f	Merge changes Ie43dc5ef,I94cd8bab into main * changes: DoFilter_: remove row & num_rows parameters Do*Filter_C: remove dead 'inverse' code paths	2024-08-19 18:51:06 +00:00
James Zern	1a29fd2fc3	make VP8LPredictor[01]_C() static Only predictors 2-13 are reused in lossless_enc.c. Change-Id: Ia3a7342fccfb44b9ad5297f48d6be2d96af68ec8	2024-08-16 10:58:45 -07:00
James Zern	dd9d3770d7	DoFilter_: remove row & num_rows parameters The row parameter became a constant in: `2102ccd` update the Unfilter API in dsp to process one row independently num_rows is always equal to height. Change-Id: Ie43dc5ef222e442ce8c92766da0b9824ccbca236	2024-08-12 19:36:31 -07:00
James Zern	ab451a495c	Do*Filter_C: remove dead 'inverse' code paths The inverse parameter became a constant in: `2102ccd` update the Unfilter API in dsp to process one row independently The row parameter to these functions is in a similar state; it will be removed in a follow up. Change-Id: I94cd8babe0e42474ff794ba5fa29dd48039de5f8	2024-08-08 18:13:48 -07:00
James Zern	f9a480f7c3	{TrueMotion,TM16}_NEON: remove zero extension Replace vmovl_u8 -> s16 + signed vaddq with unsigned vaddw. No change in assembly with clang-16 (armv7 & aarch64) and gcc-13 (aarch64). armv7 gcc-13 had kept the vmovl instructions, those are now gone. Change-Id: Ibb4fbdd5680d3e9dd06933c100528a6f363de472	2024-08-07 16:43:14 -07:00
James Zern	04834acae7	Merge changes I25c30a9e,I0a192fc6,I4cf89575 into main * changes: WASM: Enable VP8L_USE_FAST_LOAD WASM: don't use USE_GENERIC_TREE WASM: Enable 64-bit BITS caching	2024-08-01 18:36:34 +00:00
Vincent Rabaud	74be8e22d9	Fix implicit conversion issues Change-Id: If2cc8a137371ef365cf4a9c55f1b6ab131fba564	2024-07-25 22:30:15 +02:00
Vincent Rabaud	f2d6dc1eef	Increase the transform bits if possible. This brings minor size improvements because repetitive values in the transform images are easily explainable through LZ77. Still, it makes an upcoming pull request a bit more stable. This is a rollforward of `7ec51c5916` `ee26766a89` Change-Id: I254ab3ccd5053344f89099280e8d994ecd55aee0	2024-07-19 23:22:27 +02:00
wrv	8a7c8dc662	WASM: Enable VP8L_USE_FAST_LOAD It is 2-5% faster to use VP8L fast load on WASM Bug: webp:643 Change-Id: I25c30a9e6bcfc7cadd640122579eeebcb37e6fc0	2024-07-15 14:41:36 -05:00
wrv	f0c53cd966	WASM: don't use USE_GENERIC_TREE It is 2-4% faster to use hard-coded tree on WASM Bug: webp:643 Change-Id: I0a192fc6af210c79814a81084cd1f199714bf46c	2024-07-15 14:41:14 -05:00
wrv	eef903d04a	WASM: Enable 64-bit BITS caching Bug: webp:643 Change-Id: I4cf89575e0ebcfeaf9d84be8e188863657893a07	2024-07-15 14:40:45 -05:00
James Zern	6296cc8d0d	iterator_enc: make VP8IteratorReset() static This function is unused outside of iterator_enc.c. Change-Id: I0f1ecedeb9ed4d9f51d0135f04b8ef00424f24cc	2024-07-12 15:23:10 -07:00
James Zern	fbd93896a6	histogram_enc: make VP8LGetHistogramSize static This function is unused outside of histogram_enc.c. Change-Id: I527f54408383d0bc9d04878ca397a3d044b350de	2024-07-12 15:23:10 -07:00
James Zern	cc7ff5459a	cost_enc: make VP8CalculateLevelCosts[] static This table is unused outside of cost_enc.c. Change-Id: I0aa46554b8470fb09a7ffeae0e98d2356b40b671	2024-07-12 15:23:10 -07:00
James Zern	4e2828bae8	vp8l_dec: make VP8LClear() static This function is unused outside of vp8l_dec.c. Change-Id: I16733a44ea024ca9601c098641a3cd464bed2b53	2024-07-12 15:22:20 -07:00
James Zern	d742b24a88	Intra16Preds_NEON: fix truemotion saturation This needs to be done with signed saturation as the sum may be negative. fixes mismatch with C code after: `3bfb05e3` Add AArch64 Neon implementation of Intra16Preds Change-Id: I017e939d7155cc3489ceb76fc8ad50ac9917f23d	2024-07-11 13:37:06 -07:00
James Zern	c7bb4cb585	Intra4Preds_NEON: fix truemotion saturation This needs to be done with signed saturation as the sum may be negative. fixes mismatch with C code after: `baa93808` Add AArch64 Neon implementation of Intra4Preds Change-Id: I190c3d7f78cfd2c7ae83fb7059de41e307abda36	2024-07-11 13:37:06 -07:00
Vincent Rabaud	dde11574b0	Remove TODO now that log is using fixed point. Bug: webp:499 Change-Id: I39ab340ec6b5932db7535c6b7f31843c28de8415	2024-07-11 20:11:03 +00:00
James Zern	3bd9420289	Merge changes Iff6e47ed,I24c67cd5,Id781e761 into main * changes: Use QuantizeBlock_NEON for VP8EncQuantizeBlockWHT on Arm Add AArch64 Neon implementation of Intra16Preds Add AArch64 Neon implementation of Intra4Preds	2024-07-11 02:04:42 +00:00
Vincent Rabaud	d27d246e42	Merge "Convert VP8LFastSLog2 to fixed point" into main	2024-07-10 21:52:39 +00:00
Istvan Stefan	314a142a34	Use QuantizeBlock_NEON for VP8EncQuantizeBlockWHT on Arm Use the Neon implementation instead of falling back to QuantizeBlock_C. Change-Id: Iff6e47eda353cbaa9766f75040fa63aa34607816	2024-07-10 14:48:38 +01:00
Istvan Stefan	3bfb05e38c	Add AArch64 Neon implementation of Intra16Preds Add a Neon implementation of Intra16Preds for use on 64-bit Arm platforms. (This implementation cannot be used on 32-bit Arm platforms as it makes use of a number of AArch64-only Neon instructions.) Change-Id: I24c67cd54b66307e3924fd332c2795fd7422f082	2024-07-10 14:48:38 +01:00
Istvan Stefan	baa93808d9	Add AArch64 Neon implementation of Intra4Preds Add Neon implementation of Intra4Preds for use on 64-bit Arm platforms. (The same implementation cannot be used for 32-bit Arm platforms as it uses a number of AArch64-only Neon instructions.) Change-Id: Id781e7614f4e8e876dfeecd95cfc85e04611d8c6	2024-07-10 14:48:26 +01:00
Vincent Rabaud	41a5e582c2	Fix errors when compiling code as C++ Change-Id: Iba94e24e764038640f39d61fb2bc9cfb3434cc8f	2024-07-10 10:30:48 +02:00
Vincent Rabaud	fb444b692b	Convert VP8LFastSLog2 to fixed point Speedups: 1% with '-lossless', 2% with '-lossless -q 100 -m6' Change-Id: I1d79ea8e3e9e4bac7bcea4d7cbcc1bd56273988e	2024-07-09 16:42:21 +02:00
Vincent Rabaud	c1c89f5189	Fix WEBP_NODISCARD comment and C++ version Change-Id: I8b94974a46b7ac7d9bce179a48655ba8d9700edf	2024-07-09 14:24:00 +02:00
Vincent Rabaud	66408c2c7c	Switch the histogram_enc.h API to fixed point Speedups: 4% with '-lossless', 8% with '-lossless -q 100 -m6' Change-Id: I8f1c244b290d48132c1edc6a1c9fc3f79fef68ec	2024-07-09 13:39:45 +02:00
Vincent Rabaud	0a9f1c19f8	Convert VP8LFastLog2 to fixed point The lossless encoding speed-ups are: - up to 1% with default parameters - up to 4% in cruncher mode: -q 100 -m 6 Change-Id: Id92d4bad0b0a2c28c8aa9ff5280eea5717017f30	2024-07-02 10:29:38 +02:00
Vincent Rabaud	c4af79d053	Put 0 at the end of a palette and do not store it. This only applies to kSortedDefault and kMinimizeDelta. Change-Id: I9d4178406ed2ef91c5c55f0a1919cfc6605fedf9	2024-06-25 14:46:05 +02:00
Vincent Rabaud	0ec80aef3d	Delete last references to delta palettization Change-Id: I1f931d3aa587d2ae82895ae7c7f4c94fb82fbfb1	2024-06-25 10:53:43 +02:00
James Zern	27731afd47	make VP8I4ModeOffsets & VP8MakeIntra4Preds static These are unused outside of quant_enc.c. Change-Id: I2c5cd0df28c25f279cd05667b327fea14f3fa50e	2024-06-06 19:07:48 -07:00
wrv	5e5b8f0c95	Fix SSE2 Transform_AC3 function name Change-Id: I5fda3221612beafc3548d2abfa7c1e3f686aaaf0	2024-05-29 21:41:14 -05:00
James Zern	45129ee027	Revert "Check all the rows." This reverts commit `ee26766a89`. This change also reverts the parent. Revert "Increase the transform bits if possible." This reverts commit `7ec51c5916`. These changes result in non-lossless encodes. Bug: oss-fuzz:69231, oss-fuzz:69109, oss-fuzz:69208 Bug: b:341475869, b:342743143 Change-Id: Ia28f558992e0aa6f024af1ff66da52e0a5e26fa3	2024-05-25 11:00:32 -07:00
Vincent Rabaud	ee26766a89	Check all the rows. A 3 by 1 image would not have its 1st and 3rd lines compared at the second iteration. BUG=oss-fuzz:69208 Change-Id: I9213e73995d31907f358310a0b7d5ebb21c1f8b2	2024-05-24 23:11:20 +02:00
Vincent Rabaud	7ec51c5916	Increase the transform bits if possible. This brings minor size improvements because repetitive values in the transform images are easily explainable through LZ77. Still, it makes an upcoming pull request a bit more stable. This is `971a03d820` with a fix to not forget to analyze the end of the line. A const has also been added to match VP8LColorSpaceTransform's signature. Change-Id: Iae03216fef298c7abc96a766f8a799552b05ade5	2024-05-23 14:04:34 +02:00
James Zern	3cd16fd3e2	Revert "Increase the transform bits if possible." This reverts commit `971a03d820`. Reason for revert: This creates non-lossless encodes. Original change's description: > Increase the transform bits if possible. > > This brings minor size improvements because repetitive values in > the transform images are easily explainable through LZ77. Still, > it makes an upcoming pull request a bit more stable. > > Change-Id: I1c7135675cb59b5e27ca960738d74465f10d0deb Bug: oss-fuzz:69109, b:341475869 Change-Id: I3b9f21a5498735eb3681e62fb35bf9f9c2ed4f9f	2024-05-20 22:25:57 +00:00
Vincent Rabaud	971a03d820	Increase the transform bits if possible. This brings minor size improvements because repetitive values in the transform images are easily explainable through LZ77. Still, it makes an upcoming pull request a bit more stable. Change-Id: I1c7135675cb59b5e27ca960738d74465f10d0deb	2024-05-17 15:19:03 +02:00
Vincent Rabaud	1bf198a22b	Allow transform_bits to be different during encoding. The spec allows it but it is currently forced to the same value for simplicity. Change-Id: I26197dbf3342f7a72115cc7f7805c154313a2afb	2024-05-13 16:56:19 +02:00
Vincent Rabaud	1e462ca80e	Define MAX_TRANSFORM_BITS according to the specification. Change-Id: I0d575aa84e143bea56b55deb8f42b44e13dd5f1e	2024-05-07 09:16:02 +02:00
Vincent Rabaud	64d1ec23ac	Use (MIN/NUM)_(TRANSFORM/HUFFMAN)_BITS where appropriate Change-Id: I849ff8864f7abcc723dfe2b7ee0f290c8ee89c3f	2024-05-06 22:46:44 +02:00

1 2 3 4 5 ...

2934 Commits