libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2026-02-13 05:19:30 +01:00

Author	SHA1	Message	Date
James Zern	41f14bcbc5	WebPPictureImport*: check src pointer fixes crash with NULL source pointer in calls to simple encode api (WebPEncodeRGB, etc.) Change-Id: I706d670c80298da5176aaa5ba0eb2238dd71a8f0	2016-03-24 22:52:01 -07:00
Pascal Massimino	64eed38779	Pass stride parameter to WebPDequantizeLevels() and also pass 'VP8Io* io' extra param to VP8DecompressAlphaRows() This is somehow in preparation for some memory optimizations in the 'cropping' case. For now, only the easy crop_bottom case is optimized. Change-Id: Ib54531ba057bf62b98422dbb6c181dda626c72c2	2016-03-18 15:36:58 +01:00
Pascal Massimino	e88c4ca013	fix -m 2 mode-cost evaluation (causing partition0 overflow) The mode's bits were not taken into account, which is ok for most of cases. But in case of super large image, with 'easy' content, their overhead starts mattering a lot and we were omitting to optimize for these. Now, these mode bits have their own lambda values associated, limiting the jerkiness. We also limit (for -m 2 only) the individual number of bits to something that will prevent the partition 0 overflow. removed the I4_PENALTY constant, which was a rather crude approximation. Replaced by some q-dependent expression. fixes issue #289 Change-Id: I956ae2d2308c339adc4706d52722f0bb61ccf18c	2016-03-11 20:34:45 +01:00
Pascal Massimino	4562e83dc2	Merge "add extra meaning to WebPDecBuffer::is_external_memory"	2016-03-09 08:24:19 +00:00
Pascal Massimino	abdb109f3b	add extra meaning to WebPDecBuffer::is_external_memory If value is '2', it means the buffer is a 'slow' one, like GPU-mapped memory. This change is backward compatible (setting is_external_memory to 2 will be a no-op in previous libraries) dwebp: add flags to force a particular colorspace format new flags is: -pixel_format {RGB,RGBA,BGR,BGRA,ARGB,RGBA_4444,RGB_565, rgbA,bgrA,Argb,rgbA_4444,YUV,YUVA} and also,external_memory {0,1,2} These flags are mostly for debuggging purpose, and hence are not documented. Change-Id: Iac88ce1e10b35163dd7af57f9660f062f5d8ed5e	2016-03-09 09:00:13 +01:00
James Zern	875aec7044	enc_neon,cosmetics: break long comment Change-Id: I88dff0271fef1cc6dd5888572bfe0f09f467b028	2016-03-08 23:33:21 -08:00
James Zern	71e856cf84	GetMBSSIM,cosmetics: fix alignment Change-Id: I884b3361484b48917fa4cba33cd1217ac51685f9	2016-03-08 23:26:10 -08:00
Pascal Massimino	a90edffb7e	fix missing 'extern' for SSIM function in dsp/ Change-Id: Id8143120f01065dc088f4e90bd930f8ea7c3ae5a	2016-03-08 10:27:46 -08:00
Pascal Massimino	423ecaf484	move some SSIM-accumulation function for dsp/ This is in preparation for some SSE2 code. And generally speaking, the whole SSIM code needs some revamp: we're not averaging the SSIM value at each pixels but just computing the overall SSIM value once, for the whole plane. The former might be better than the latter. Change-Id: I935784a917f84a18ef08dc5ec9a7b528abea46a5	2016-03-08 07:50:09 +01:00
Pascal Massimino	f08e66245a	Merge "Fix FindClosestDiscretized in near lossless:"	2016-03-04 09:34:16 +00:00
James Zern	0d40cc5ea3	enc_neon,Disto4x4: remove an unnecessary transpose based on the sse2 change in: `9960c31` Remove an unnecessary transposition in TTransform. ~9-10.5% faster at the function-level, < 1% overall Change-Id: I44413369b230b250fb0dbc51ff2f17cfeda609b7	2016-03-03 16:18:59 -08:00
Marcin Kowalczyk	e8feb20e39	Fix FindClosestDiscretized in near lossless: - The result is now indeed closest among possible results for all inputs, which was not the case for bits>4, where the mapping was not even monotonic because GetValAndDistance was correct only if the significant part of initial fit in a byte at most twice. - The set of results for a larger number of bits dropped is a subset of values for a smaller number of bits dropped. This implies that subsequent discretizations for a smaller number of bits dropped do not change already discretized pixels, which improves the quality (changes do not accumulate) and compression density (values tend to repeat more often). - Errors are more fairly distributed between upwards and downwards thanks to bankers’ rounding, which avoids images getting darker or lighter in overall. - Deltas between discretized values are more repetitive. This improves compression density if delta encoding is used. Also, the implementation is much shorter now. Change-Id: I0a98e7d5255e91a7b9c193a156cf5405d9701f16	2016-03-02 12:47:22 +01:00
James Zern	a6f23c49b2	Merge "AnimEncoder: Support progress hook and user data."	2016-02-20 04:25:57 +00:00
James Zern	a5193774b0	Merge "Near lossless feature: fix some comments."	2016-02-20 03:51:15 +00:00
Urvang Joshi	da98d31ced	AnimEncoder: Support progress hook and user data. Pass them along to internal 'pic' object, so that progress can be reported back and user data can also be inspected. Change-Id: Idb5d0d4a76d07283d704a86c5892e1ad7bda09fa	2016-02-19 19:50:30 -08:00
Urvang Joshi	3335713169	Near lossless feature: fix some comments. Change-Id: I2c5fc2a3b3fe5123d66b42bf148e361b4862dfb9	2016-02-19 19:26:39 -08:00
James Zern	0beed01aa5	cosmetics: fix indent after `2f5e898` `2f5e898` fix multiple allocation for transform buffer Change-Id: Ied5c89c0040671e2eddf23c8b7a78e0d817dd18e	2016-02-19 19:22:34 -08:00
Pascal Massimino	6753f35cac	Merge "FTransformWHT optimization."	2016-02-19 09:38:04 +00:00
Vincent Rabaud	6583bb1a42	Improve SSE4.1 implementation of TTransform. SSE4.1 is slower than the SSE2 implementation and this seems to be due to a slow _mm_loadl_epi64 implementation by gcc (hence a bug with my gcc 4.8) and a very slow _mm_hadd_epi32. Both got confirmed by IACA and experiments. Change-Id: I05607f66b7ccd8f4f42e000693aea583ffd5768f	2016-02-19 09:11:53 +01:00
Vincent Rabaud	7561d0c338	FTransformWHT optimization. Data is packed sooner in the functions. Change-Id: I018cfeca43f015ac755c7f209f9a97984cc0517b	2016-02-18 17:44:05 +01:00
Pascal Massimino	7ccdb734c2	fix indentation after patch #328220 Change-Id: Iccfcf4deaed6b383b9f80ae84b5b0575a4e94b5f	2016-02-18 15:14:41 +01:00
Pascal Massimino	6ec0d2a946	clarify the logic of the error path when decoding fails. Change-Id: I2f86751ddafa4708dac3ffc9d6ec1f156e027a83	2016-02-18 14:58:18 +01:00
Vincent Rabaud	8aa352b256	Merge "Remove an unnecessary transposition in TTransform."	2016-02-18 08:15:10 +00:00
James Zern	db86088426	Merge "remove useless #include"	2016-02-17 23:17:12 +00:00
Vincent Rabaud	9960c31685	Remove an unnecessary transposition in TTransform. Change-Id: Ib715c2d5ba659cb2db9c6832875ba508cc2fca3e	2016-02-17 21:41:28 +01:00
Vincent Rabaud	6e36b51188	Small speedup in FTransform. It removes two _mm_unpacklo_epi32 and two _mm_sub_epi16. Change-Id: Icdf86259f796ba855d1cda5e9c0e99cb396cb351	2016-02-17 21:26:36 +01:00
Pascal Massimino	2b4fe33e00	Merge "fix multiple allocation for transform buffer"	2016-02-17 07:27:23 +00:00
Pascal Massimino	2f5e8986cf	fix multiple allocation for transform buffer We were not updating the current_width_, which is usually not a problem, unless we use Delta Palette with small number of colors -> Addressed this re-entrancy problem by checking we have enough capacity for transform buffer. The problem is not currently visible, until we restrict the number of gradient used in delta-palette to less than 16. Then the buffers have different current_width_ and the problem surfaces. Change-Id: Icd84b919905d7789014bb6668bfb6813c93fb36e	2016-02-17 06:14:39 +01:00
Vincent Rabaud	bf2b4f114f	Regroup common SSE code + optimization. The transpose refactoring will help removing a transpose in a later CL. The horizontal add function helps removing a _mm_sad_epu8 in DC8uv => the latency/throughput went from 29/25 to 23/19 Change-Id: I5f3dfd4aad614eb079b1e83631e6a7cef49a3766	2016-02-16 18:34:34 +01:00
Nico Weber	3ef1ce98b9	yuv_sse2: fix -Wconstant-conversion warning 'implicit conversion from 'int' to 'short' changes value from 33050 to -32486' original patch: https://codereview.chromium.org/1657313003/ Make libwebp build with -Wconstant-conversion from newer clangs. After http://llvm.org/viewvc/llvm-project?rev=259271&view=rev, clang points out that _mm_set1_epi16(33050) causes an overflow in the short argument to _mm_set1_epi16(). Since there's no version that takes an unsigned short, add an explicit cast to tell the compiler that this is intentional. No behavior change. Change-Id: I6b4e3401b15cfbcc895f9e81b5c2dc59d43ffb9b	2016-02-02 14:52:11 -08:00
James Zern	ab3c2583aa	anim_encode,DefaultEncoderOptions: init verbose default to disabled broken since: `c13245c` AnimEncoder: Add a GetError() method. Change-Id: I51ccb85d5df338570512ec1d7430ad3229f93a9f	2016-02-01 17:00:19 -08:00
Pascal Massimino	75f4af4d54	remove useless #include Change-Id: Id34f12ec94d8be6853fabd67609a6006ac99f152	2016-01-25 09:34:10 -08:00
Pascal Massimino	6c1d763119	avoid Yoda style for comparison Change-Id: I8ff9f96951e5e8a619f7132455dd281cbf91aa4d	2016-01-15 23:52:29 -08:00
Vincent Rabaud	8ce975ac82	SSE optimization for vector mismatch. Change-Id: I564b822033b59d86635230f29ed6197e306a2c4f	2016-01-07 18:23:45 +01:00
Pascal Massimino	7e7b6ccc7f	faster rgb565/rgb4444/argb output SSE2 and NEON implementation. Change-Id: I342a1c3d84937b8497f0aaecb7ce9bdb7f50296b	2015-12-17 23:38:58 -08:00
James Zern	71100500a8	bump version to 0.5.0 libwebp{,decoder} - 0.5.0 libwebp libtool - 6.0.0 libwebpdecoder libtool - 2.0.0 mux/demux - 0.3.0 libtool - 2.0.0 Change-Id: I5346d13eb827fb5890efbb63ff3f28cea9d0c55f	2015-12-17 19:45:14 -08:00
James Zern	d48e427b1d	Merge "demux: accept raw bitstreams"	2015-12-17 22:52:10 +00:00
James Zern	99a01f4f8b	Merge "Unify some entropy functions."	2015-12-17 22:35:29 +00:00
James Zern	4b025f10f7	Merge "configure: disable asserts by default"	2015-12-17 22:28:37 +00:00
James Zern	92cbddf89c	Merge "fix PrintBlockInfo()"	2015-12-17 21:00:57 +00:00
Vincent Rabaud	ca509a3362	Unify some entropy functions. The code and logic is unified when computing bit entropy + Huffman cost. Speed-wise, we gain 8% for lossless encoding. Logic-wise, the beginning/end of the distributions are handled properly and the compression ratio does not change much. Change-Id: Ifa91d7d3e667c9a9a421faec4e845ecb6479a633	2015-12-17 17:00:08 +01:00
Pascal Massimino	367bf903b3	fix PrintBlockInfo() ... which has gone out of sync since the last block-cache layout change. Change-Id: Ic441ec07b0198b508ce3fd34ab582cb60b1daabc	2015-12-17 15:47:25 +01:00
Pascal Massimino	b0547ff0b4	move back common constants for lossless_enc*.c into the .h Change-Id: I11bc979db691f6518d85e2e1c3ac7f05d69681b0	2015-12-17 15:11:56 +01:00
Lode Vandevenne	fb4c7832f1	lossless: simpler alpha cleanup preprocessing setting all transparent pixels to black rather than the "flatten" method. 0.3% smaller filesize on the 1000 PNGs if alpha cleanup is used (before: 18685774, after: 18622472) Change-Id: Ib0db9e7ccde55b36e82de07855f2dbb630fe62b1	2015-12-17 15:04:50 +01:00
Vincent Rabaud	47ddd5a4cc	Move some codec logic out of ./dsp . The functions containing magic constants are moved out of ./dsp . VP8LPopulationCost got put back in ./enc VP8LGetCombinedEntropy is now unrefined (refinement happening in ./enc) VP8LBitsEntropy is now unrefined (refinement happening in ./enc) VP8LHistogramEstimateBits got put back in ./enc VP8LHistogramEstimateBitsBulk got deleted. Change-Id: I09c4101eebbc6f174403157026fe4a23a5316beb	2015-12-17 07:03:25 +00:00
James Zern	357f455dec	yuv_sse2: fix 32-bit visual studio build src\dsp\yuv_sse2.c : C2719: 'in': formal parameter with __declspec(align('16')) won't be aligned src\dsp\yuv_sse2.c : C2719: 'out': formal parameter with __declspec(align('16')) won't be aligned Change-Id: Ifd79e33b35c70748faff19cd64eba4a8ffce5a5a	2015-12-16 15:04:36 -08:00
James Zern	b9d80fa4e8	configure: disable asserts by default --enable-asserts can be used to avoid defining NDEBUG Change-Id: I6216668e3f79f69bd8c453f0b36cecb3b585688e	2015-12-16 13:15:53 -08:00
Pascal Massimino	7badd3da4a	cosmetic fix: sizeof(type) -> sizeof(*var) Change-Id: I1a39fccfdcb9f0a4b9b025d3c9b522e8edfe7fd6	2015-12-16 18:29:14 +01:00
Vincent Rabaud	80ce27d34e	Speed up 24-bit packing / unpacking in YUV / RGB conversions. This implementation brings: - an SSE implementation of packing / unpacking - bigger buffers processed at the same time The speedup is of 4% on lossy decoding (YUV to RGB), 0.5% on lossy encoding (RGB to YUV was already optimized). Change-Id: Iec677ee17f91c08614d1adab67c6df551925767f	2015-12-16 11:06:42 +01:00
Pascal Massimino	68eebcb0ff	remove a TODO about rotation (won't happen yet) Change-Id: Ibb4ceccd1d7af0f76594e71062983dc311ba9aa2	2015-12-15 23:36:12 -08:00

1 2 3 4 5 ...

2012 Commits