libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-07 03:24:30 +02:00

Author	SHA1	Message	Date
Pascal Massimino	abdb109f3b	add extra meaning to WebPDecBuffer::is_external_memory If value is '2', it means the buffer is a 'slow' one, like GPU-mapped memory. This change is backward compatible (setting is_external_memory to 2 will be a no-op in previous libraries) dwebp: add flags to force a particular colorspace format new flags is: -pixel_format {RGB,RGBA,BGR,BGRA,ARGB,RGBA_4444,RGB_565, rgbA,bgrA,Argb,rgbA_4444,YUV,YUVA} and also,external_memory {0,1,2} These flags are mostly for debuggging purpose, and hence are not documented. Change-Id: Iac88ce1e10b35163dd7af57f9660f062f5d8ed5e	2016-03-09 09:00:13 +01:00
James Zern	875aec7044	enc_neon,cosmetics: break long comment Change-Id: I88dff0271fef1cc6dd5888572bfe0f09f467b028	2016-03-08 23:33:21 -08:00
James Zern	71e856cf84	GetMBSSIM,cosmetics: fix alignment Change-Id: I884b3361484b48917fa4cba33cd1217ac51685f9	2016-03-08 23:26:10 -08:00
Pascal Massimino	a90edffb7e	fix missing 'extern' for SSIM function in dsp/ Change-Id: Id8143120f01065dc088f4e90bd930f8ea7c3ae5a	2016-03-08 10:27:46 -08:00
Pascal Massimino	423ecaf484	move some SSIM-accumulation function for dsp/ This is in preparation for some SSE2 code. And generally speaking, the whole SSIM code needs some revamp: we're not averaging the SSIM value at each pixels but just computing the overall SSIM value once, for the whole plane. The former might be better than the latter. Change-Id: I935784a917f84a18ef08dc5ec9a7b528abea46a5	2016-03-08 07:50:09 +01:00
Pascal Massimino	f08e66245a	Merge "Fix FindClosestDiscretized in near lossless:"	2016-03-04 09:34:16 +00:00
James Zern	0d40cc5ea3	enc_neon,Disto4x4: remove an unnecessary transpose based on the sse2 change in: 9960c31 Remove an unnecessary transposition in TTransform. ~9-10.5% faster at the function-level, < 1% overall Change-Id: I44413369b230b250fb0dbc51ff2f17cfeda609b7	2016-03-03 16:18:59 -08:00
Marcin Kowalczyk	e8feb20e39	Fix FindClosestDiscretized in near lossless: - The result is now indeed closest among possible results for all inputs, which was not the case for bits>4, where the mapping was not even monotonic because GetValAndDistance was correct only if the significant part of initial fit in a byte at most twice. - The set of results for a larger number of bits dropped is a subset of values for a smaller number of bits dropped. This implies that subsequent discretizations for a smaller number of bits dropped do not change already discretized pixels, which improves the quality (changes do not accumulate) and compression density (values tend to repeat more often). - Errors are more fairly distributed between upwards and downwards thanks to bankers’ rounding, which avoids images getting darker or lighter in overall. - Deltas between discretized values are more repetitive. This improves compression density if delta encoding is used. Also, the implementation is much shorter now. Change-Id: I0a98e7d5255e91a7b9c193a156cf5405d9701f16	2016-03-02 12:47:22 +01:00
James Zern	a6f23c49b2	Merge "AnimEncoder: Support progress hook and user data."	2016-02-20 04:25:57 +00:00
James Zern	a5193774b0	Merge "Near lossless feature: fix some comments."	2016-02-20 03:51:15 +00:00
Urvang Joshi	da98d31ced	AnimEncoder: Support progress hook and user data. Pass them along to internal 'pic' object, so that progress can be reported back and user data can also be inspected. Change-Id: Idb5d0d4a76d07283d704a86c5892e1ad7bda09fa	2016-02-19 19:50:30 -08:00
Urvang Joshi	3335713169	Near lossless feature: fix some comments. Change-Id: I2c5fc2a3b3fe5123d66b42bf148e361b4862dfb9	2016-02-19 19:26:39 -08:00
James Zern	0beed01aa5	cosmetics: fix indent after 2f5e898 2f5e898 fix multiple allocation for transform buffer Change-Id: Ied5c89c0040671e2eddf23c8b7a78e0d817dd18e	2016-02-19 19:22:34 -08:00
Pascal Massimino	6753f35cac	Merge "FTransformWHT optimization."	2016-02-19 09:38:04 +00:00
Vincent Rabaud	6583bb1a42	Improve SSE4.1 implementation of TTransform. SSE4.1 is slower than the SSE2 implementation and this seems to be due to a slow _mm_loadl_epi64 implementation by gcc (hence a bug with my gcc 4.8) and a very slow _mm_hadd_epi32. Both got confirmed by IACA and experiments. Change-Id: I05607f66b7ccd8f4f42e000693aea583ffd5768f	2016-02-19 09:11:53 +01:00
Vincent Rabaud	7561d0c338	FTransformWHT optimization. Data is packed sooner in the functions. Change-Id: I018cfeca43f015ac755c7f209f9a97984cc0517b	2016-02-18 17:44:05 +01:00
Pascal Massimino	7ccdb734c2	fix indentation after patch #328220 Change-Id: Iccfcf4deaed6b383b9f80ae84b5b0575a4e94b5f	2016-02-18 15:14:41 +01:00
Pascal Massimino	6ec0d2a946	clarify the logic of the error path when decoding fails. Change-Id: I2f86751ddafa4708dac3ffc9d6ec1f156e027a83	2016-02-18 14:58:18 +01:00
Vincent Rabaud	8aa352b256	Merge "Remove an unnecessary transposition in TTransform."	2016-02-18 08:15:10 +00:00
James Zern	db86088426	Merge "remove useless #include"	2016-02-17 23:17:12 +00:00
Vincent Rabaud	9960c31685	Remove an unnecessary transposition in TTransform. Change-Id: Ib715c2d5ba659cb2db9c6832875ba508cc2fca3e	2016-02-17 21:41:28 +01:00
Vincent Rabaud	6e36b51188	Small speedup in FTransform. It removes two _mm_unpacklo_epi32 and two _mm_sub_epi16. Change-Id: Icdf86259f796ba855d1cda5e9c0e99cb396cb351	2016-02-17 21:26:36 +01:00
Pascal Massimino	2b4fe33e00	Merge "fix multiple allocation for transform buffer"	2016-02-17 07:27:23 +00:00
Pascal Massimino	2f5e8986cf	fix multiple allocation for transform buffer We were not updating the current_width_, which is usually not a problem, unless we use Delta Palette with small number of colors -> Addressed this re-entrancy problem by checking we have enough capacity for transform buffer. The problem is not currently visible, until we restrict the number of gradient used in delta-palette to less than 16. Then the buffers have different current_width_ and the problem surfaces. Change-Id: Icd84b919905d7789014bb6668bfb6813c93fb36e	2016-02-17 06:14:39 +01:00
Vincent Rabaud	bf2b4f114f	Regroup common SSE code + optimization. The transpose refactoring will help removing a transpose in a later CL. The horizontal add function helps removing a _mm_sad_epu8 in DC8uv => the latency/throughput went from 29/25 to 23/19 Change-Id: I5f3dfd4aad614eb079b1e83631e6a7cef49a3766	2016-02-16 18:34:34 +01:00
Nico Weber	3ef1ce98b9	yuv_sse2: fix -Wconstant-conversion warning 'implicit conversion from 'int' to 'short' changes value from 33050 to -32486' original patch: https://codereview.chromium.org/1657313003/ Make libwebp build with -Wconstant-conversion from newer clangs. After http://llvm.org/viewvc/llvm-project?rev=259271&view=rev, clang points out that _mm_set1_epi16(33050) causes an overflow in the short argument to _mm_set1_epi16(). Since there's no version that takes an unsigned short, add an explicit cast to tell the compiler that this is intentional. No behavior change. Change-Id: I6b4e3401b15cfbcc895f9e81b5c2dc59d43ffb9b	2016-02-02 14:52:11 -08:00
James Zern	ab3c2583aa	anim_encode,DefaultEncoderOptions: init verbose default to disabled broken since: c13245c AnimEncoder: Add a GetError() method. Change-Id: I51ccb85d5df338570512ec1d7430ad3229f93a9f	2016-02-01 17:00:19 -08:00
Pascal Massimino	75f4af4d54	remove useless #include Change-Id: Id34f12ec94d8be6853fabd67609a6006ac99f152	2016-01-25 09:34:10 -08:00
Pascal Massimino	6c1d763119	avoid Yoda style for comparison Change-Id: I8ff9f96951e5e8a619f7132455dd281cbf91aa4d	2016-01-15 23:52:29 -08:00
Vincent Rabaud	8ce975ac82	SSE optimization for vector mismatch. Change-Id: I564b822033b59d86635230f29ed6197e306a2c4f	2016-01-07 18:23:45 +01:00
Pascal Massimino	7e7b6ccc7f	faster rgb565/rgb4444/argb output SSE2 and NEON implementation. Change-Id: I342a1c3d84937b8497f0aaecb7ce9bdb7f50296b	2015-12-17 23:38:58 -08:00
James Zern	71100500a8	bump version to 0.5.0 libwebp{,decoder} - 0.5.0 libwebp libtool - 6.0.0 libwebpdecoder libtool - 2.0.0 mux/demux - 0.3.0 libtool - 2.0.0 Change-Id: I5346d13eb827fb5890efbb63ff3f28cea9d0c55f	2015-12-17 19:45:14 -08:00
James Zern	d48e427b1d	Merge "demux: accept raw bitstreams"	2015-12-17 22:52:10 +00:00
James Zern	99a01f4f8b	Merge "Unify some entropy functions."	2015-12-17 22:35:29 +00:00
James Zern	4b025f10f7	Merge "configure: disable asserts by default"	2015-12-17 22:28:37 +00:00
James Zern	92cbddf89c	Merge "fix PrintBlockInfo()"	2015-12-17 21:00:57 +00:00
Vincent Rabaud	ca509a3362	Unify some entropy functions. The code and logic is unified when computing bit entropy + Huffman cost. Speed-wise, we gain 8% for lossless encoding. Logic-wise, the beginning/end of the distributions are handled properly and the compression ratio does not change much. Change-Id: Ifa91d7d3e667c9a9a421faec4e845ecb6479a633	2015-12-17 17:00:08 +01:00
Pascal Massimino	367bf903b3	fix PrintBlockInfo() ... which has gone out of sync since the last block-cache layout change. Change-Id: Ic441ec07b0198b508ce3fd34ab582cb60b1daabc	2015-12-17 15:47:25 +01:00
Pascal Massimino	b0547ff0b4	move back common constants for lossless_enc*.c into the .h Change-Id: I11bc979db691f6518d85e2e1c3ac7f05d69681b0	2015-12-17 15:11:56 +01:00
Lode Vandevenne	fb4c7832f1	lossless: simpler alpha cleanup preprocessing setting all transparent pixels to black rather than the "flatten" method. 0.3% smaller filesize on the 1000 PNGs if alpha cleanup is used (before: 18685774, after: 18622472) Change-Id: Ib0db9e7ccde55b36e82de07855f2dbb630fe62b1	2015-12-17 15:04:50 +01:00
Vincent Rabaud	47ddd5a4cc	Move some codec logic out of ./dsp . The functions containing magic constants are moved out of ./dsp . VP8LPopulationCost got put back in ./enc VP8LGetCombinedEntropy is now unrefined (refinement happening in ./enc) VP8LBitsEntropy is now unrefined (refinement happening in ./enc) VP8LHistogramEstimateBits got put back in ./enc VP8LHistogramEstimateBitsBulk got deleted. Change-Id: I09c4101eebbc6f174403157026fe4a23a5316beb	2015-12-17 07:03:25 +00:00
James Zern	357f455dec	yuv_sse2: fix 32-bit visual studio build src\dsp\yuv_sse2.c : C2719: 'in': formal parameter with __declspec(align('16')) won't be aligned src\dsp\yuv_sse2.c : C2719: 'out': formal parameter with __declspec(align('16')) won't be aligned Change-Id: Ifd79e33b35c70748faff19cd64eba4a8ffce5a5a	2015-12-16 15:04:36 -08:00
James Zern	b9d80fa4e8	configure: disable asserts by default --enable-asserts can be used to avoid defining NDEBUG Change-Id: I6216668e3f79f69bd8c453f0b36cecb3b585688e	2015-12-16 13:15:53 -08:00
Pascal Massimino	7badd3da4a	cosmetic fix: sizeof(type) -> sizeof(*var) Change-Id: I1a39fccfdcb9f0a4b9b025d3c9b522e8edfe7fd6	2015-12-16 18:29:14 +01:00
Vincent Rabaud	80ce27d34e	Speed up 24-bit packing / unpacking in YUV / RGB conversions. This implementation brings: - an SSE implementation of packing / unpacking - bigger buffers processed at the same time The speedup is of 4% on lossy decoding (YUV to RGB), 0.5% on lossy encoding (RGB to YUV was already optimized). Change-Id: Iec677ee17f91c08614d1adab67c6df551925767f	2015-12-16 11:06:42 +01:00
Pascal Massimino	68eebcb0ff	remove a TODO about rotation (won't happen yet) Change-Id: Ibb4ceccd1d7af0f76594e71062983dc311ba9aa2	2015-12-15 23:36:12 -08:00
Pascal Massimino	2dee2966df	remove few obsolete TODO about aligned loads in SSE2 Change-Id: I3628602942ea2ce34dbcb85975d15afc1041f76c	2015-12-15 23:00:41 -08:00
Pascal Massimino	e0c0bb3480	remove TODO about unused ref_lf_delta[] Change-Id: I54983c0dfc6927564143bad56bd2e4c4cdfefc0e	2015-12-15 22:57:53 -08:00
Pascal Massimino	9cf1cc2bd6	remove few TODO: * 256 -> RD_DISTO_MULT * don't use TDisto for UV mode picking Change-Id: I243148c716fe688b5c1b1fb9b7a6e58d0b5e6835	2015-12-15 22:52:12 -08:00
James Zern	791896455a	Merge changes from topic 'demux-fragment-cleanup' * changes: demux: remove GetFragment() demux: remove dead fragment related TODO demux, Frame: remove is_fragment_ field demux,WebPIterator: remove fragment_num/num_fragments demux: remove WebPDemuxSelectFragment	2015-12-16 06:45:00 +00:00

... 2 3 4 5 6 ...

2158 Commits