libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-04 01:54:30 +02:00

Author	SHA1	Message	Date
skal	248f3aed22	remove br->error_ field it's somewhat redundant with br->eos_ also make the status-check coherent. Change-Id: I98e755e037d45acb0760baf2344bf11fb5fb5cda	2014-09-23 00:04:58 -07:00
Vikas Arora	f9ced95a9b	Optimize lossless decoding for trivial(ARB) codes. Optimize the decoding for region that have trivial literal codes. The trivial literal is defined as huffman image with Red, Blue and Alpha huffman trees with only single code values. This speeds up lossless decoding by 3% Change-Id: I0204949917836f74c0eb4ba5a7f4052a4797833b	2014-09-12 09:08:08 -07:00
Pascal Massimino	d3242aee16	make VP8LSetBitPos() set br->eos_ flag ReadSymbol() finishes with a VP8LSetBitPos() call only and could miss an eos_ during the decode loop. Things are faster because of inlining too. Change-Id: I2d2a275f38834ba005bc767d45c5de72d032103e	2014-09-06 08:40:20 +02:00
Pascal Massimino	a9decb5584	Lossless decoding: fix eos_ flag condition eos_ needs to be set only when superfluous bits have actually been requested. Earlier, we were assuming pre-mature end-of-stream to be an error. Now, more precisely, we mark error when we have encountered end-of-stream and we attempt to read more bits after that. This handles cases where image data requires no bits to be read Change-Id: I628e2c39c64f10c443fb51f86b1f5919cc9fd299	2014-09-05 20:21:50 +02:00
skal	77d4c7e337	address cosmetic comments from patch #71380 Change-Id: Iaba301b9e77aa4febe0efe1e6016fab42d5589f3	2014-08-28 18:08:00 -07:00
skal	f75dfbf23d	Speed up Huffman decoding for lossless speed-up is ~1.6% for photographic image to 10% for graphical image (1000 images corpus was sped up by 5.8 %) Code by akramarz@google.com and jyrki@google.com Change-Id: Iceb2e50e6cc761b9315a3865d22ec9d19b8011c6	2014-08-28 12:28:04 -07:00
James Zern	8323a9038d	dsp.h: collect gcc/clang version test macros endian_inl.h already relies on dsp.h, grab the definitions from there. Change-Id: I445f7d0631723043c55da1070498f89965bec7b1	2014-08-27 19:33:09 -07:00
Djordje Pesut	b4dc4069a2	MIPS: dspr2: added optimization for (un)filters HorizontalFilter VerticalFilter GradientFilter HorizontalUnfilter VerticalUnfilter GradientUnfilter Change-Id: I54055b4767c37719691811072e95bf79c1f627b1	2014-08-14 11:55:19 -07:00
Djordje Pesut	98c54107df	MIPS: mips32r2: added optimization for BSwap32 gcc < 4.8.3 doesn't translate bswap optimally. use optimized version always Change-Id: I979ea26ad6dc0166d3d2f39c4148eb8adfb7ddec	2014-08-12 09:29:13 +02:00
pascal massimino	bb07022b66	Merge "cosmetics"	2014-08-06 12:30:08 -07:00
James Zern	e300c9d819	cosmetics fix some indent/whitespace, remove a few duplicate includes, extra semi-colons Change-Id: If937182b40a21e0f2028496e7b4b06c6e8a41352	2014-08-06 12:10:59 -07:00
James Zern	4c6dde37b9	bit_writer: cosmetics: rename kFlush() -> Flush() Change-Id: I8907927974188bee85ffade1d75d2e50817aa115	2014-08-05 22:14:29 -07:00
James Zern	0524d9e5e8	dsp: detect mips64 & disable mips32 code Change-Id: Icf68dafd5cf0614ca25b36a0252caa1784ac8059	2014-08-01 21:18:53 -07:00
James Zern	46fd44c104	thread: remove harmless race on status_ in End() if a thread was still doing work when End() was called there'd be a race on worker->status_. in these cases, however, the specific value is meaningless as it would be >= OK and the thread would have been shut down properly, but we'll check 'impl_' instead to avoid any potential TSan/DRD reports. Change-Id: Ib93cbc226a099f07761f7bad765549dffb8054b1	2014-07-08 20:32:29 -07:00
James Zern	6781423b7d	configure: check for __builtin_bswapXX() defines HAVE_BUILTIN_BSWAP16/32/64 updated endian_inl.h to have a non-configure fallback for gcc and clang BSwap16() now uses __builtin_bswap16 if available Change-Id: Ia04ee07b39303c4b247df96d84f298fb8a81f389	2014-07-05 12:35:13 -07:00
James Zern	6422e683af	VP8LFillBitWindow: enable fast path for 32-bit builds also reduce the load size from 64 to 32 bits as the top 32 bits are being shifted away in the operation. the change is neutral speed-wise on x86_64 as is the change in load size on x86, but it gives a slight improvement on 32-bit arm. x86 is improved ~13%, 32-bit arm ~3.7% aarch64 is untested but will likely benefit as well. Change-Id: Ibcb02a70f46f2651105d7ab571afe352673bef48	2014-07-04 14:42:47 -07:00
James Zern	4f7f52b2a1	VP8LFillBitWindow: respect WEBP_FORCE_ALIGNED Change-Id: I23eddf01590de002efc21d8c7acc545a08fc3e48	2014-07-04 13:53:29 -07:00
James Zern	e458badcc3	endian_inl.h: implement htoleXX with BSwapXX + s/htole(16\|32)/HToLE$1/ to avoid any name conflicts Change-Id: Ic1c84711557e50f73d83ca5aa2b3992ac6738216	2014-07-04 12:16:36 -07:00
James Zern	f2664d1aab	endian_inl.h: add BSwap16 + use it in VP8LoadNewBytes() Change-Id: I701d3652dc0cbd553852978702ef68c2657bca1c	2014-07-04 12:16:28 -07:00
James Zern	dc0f479d6a	configure: add --enable-aligned forces aligned memory reads (via memcpy) in the VP8 bit reader, useful for platforms that don't support unaligned loads. Change-Id: Ifa44a9a1677fbdc6a929520f9340b7e3fcbd6692	2014-07-03 23:30:09 -07:00
James Zern	380cca4f2c	configure.ac: add AC_C_BIGENDIAN this defines WORDS_BIGENDIAN, replacing uses of __BIG_ENDIAN__/__BYTE_ORDER__ with it + fixes lossless BGRA output with big-endian toolchains that do not define __BIG_ENDIAN__ (codesourcery mips gcc) Change-Id: Ieaccd623292d235343b5e34b7a720fc251c432d7	2014-07-03 18:15:50 -07:00
James Zern	ee70a90187	endian_inl.h: add BSwap64 Change-Id: I66672b770500294b8f4ee8fa4bf1dfff1119dbe6	2014-07-03 13:35:34 -07:00
James Zern	47779d46c8	endian_inl.h: add BSwap32 Change-Id: I96e3ae49659307024415d64587e6312888a0070f	2014-07-03 13:28:13 -07:00
James Zern	d5104b1ff6	utils: add endian_inl.h moves the following to this header: - htole*() definitions from bit_writer.c - __BIG_ENDIAN__ fallback define from bit_reader_inl.h Change-Id: I7fff59543f08a70bf8f9ddac849b72ed290471b1	2014-07-03 13:07:14 -07:00
James Zern	1da3d46138	VP8LoadNewBytes: use __builtin_bswap32 if available mostly to balance the use of bswap64, some gcc platforms are already interpreting the default case the same Change-Id: Icf860f55b3f16bea349a7d721e6d6abeeb4e5cf3	2014-06-24 23:24:36 -07:00
Pascal Massimino	25aaddc84c	rename interface -> winterface to avoid name clash on win32 Change-Id: Ia4ad3d4528df4652bab803f9a9544e21e6c4b177	2014-06-23 14:21:22 -07:00
skal	5584d9d2fc	make WebPSetWorkerInterface() check its arguments Change-Id: I522c58cfe05e864a50cacb58bdfa14d5369c6d60	2014-06-23 07:39:56 +02:00
James Zern	a9cf31913c	cosmetics: update thread.h comments WebPWorker*() are now part of WebPWorkerInterface; refer to them with unadorned names. Change-Id: Iae1dd59f1e545cba6dd8c18f26ba60eb9a84419b	2014-06-19 19:31:10 -07:00
skal	bbe32df1e3	add alpha dithering for lossy new options: dwebp -alpha_dither vwebp -noalphadither When the source was marked as quantized, we use a threshold-averaging filter to smooth the decoded alpha plane. Note: this option forces the decoding of alpha data in one pass, and might slow the decoding a bit. The new field in WebPDecoderOptions struct is 'alpha_dithering_strength' (0 by default, means: off). Max strength value is '100'. Change-Id: I218e21af96360d4781587fede95f8ea4e2b7287a	2014-06-14 00:06:16 +02:00
James Zern	7a93c000ee	**/Makefile.am: remove unused AM_CPPFLAGS only 1 of <lib>_CPPFLAGS and AM_CPPFLAGS is used, with the former getting precedence when it's defined. configure's DEFAULT_INCLUDES is covering what's necessary given the include paths are all source relative. Change-Id: I7d14076acd266b28a88a3d92bcc3d7165284d5f3	2014-06-12 11:59:05 -07:00
skal	24e3080571	Add an interface abstraction to the WebP worker thread implementation This allows custom implementations of threading mecanism. Patch by Leonhard Gruenschloss. Change-Id: Id8ea5917acd2f24fa8bce79748d1747de2751614	2014-06-12 11:35:44 +02:00
James Zern	32b3137936	configure: move config.h to src/webp/config.h this change has the side-effect of using directory names in the include, silencing a lint warning. Change-Id: Ib91cf63a90534e32fadfa5c2372bfdb29f854d02	2014-06-10 23:42:00 -07:00
James Zern	73fee88c4a	VP8RandomBits2: prevent signed int overflow 'diff' at its largest may be INT_MAX; << 1 of anything at or above 1 << 30 will overflow. Change-Id: Idb2b5a9b55acc2f6d5e32be8baaebee3f89919ad	2014-06-04 23:19:03 -07:00
skal	a2ac8a420e	restore original value_/range_ field order no speed change, just for coherency Change-Id: Iaa395bca24f33a14b68ba6920b838ef87d0d0db6	2014-06-03 09:36:56 +02:00
Pascal Massimino	42c447aeb0	Merge "lossy bit-reader clean-up:"	2014-06-02 23:53:00 -07:00
Pascal Massimino	09545eeadc	lossy bit-reader clean-up: * remove LEFT/RIGHT_JUSTIFY distinction. It's all RIGHT_JUSTIFY now. * simplify VP8GetSigned(), and add some masking branch-less code. Much faster on ARM (~13% speed-up). 8% on x86-64, 5% on MacBook. * split critical implementation into separate bit_reader_inl.h file that is only included where needed (vp8.c / tree.c / bit_reader.c) * bumped BITS value from 16 to 24 for x86-32b too, since it's a bit faster. Change-Id: If41ca1da3e5c3dadacf2379d1ba419b151e7fce8	2014-06-03 07:46:55 +02:00
skal	6679f8996f	Optimize VP8SetResidualCoeffs. Brings down WebP lossy encoding timings by 5% Change-Id: Ia4a2fab0a887aaaf7841ce6d9ee16270d3e15489	2014-06-03 06:44:04 +02:00
skal	399b916d27	lossy decoding: correct alpha-rescaling for YUVA format The luminance needs to be pre- and post- multiplied by the alpha value in case of rescaling, for proper averaging. Also: - removed util/alpha_processing and moved it to dsp/ - removed WebPInitPremultiply() which was mostly useless and merged it with the new function WebPInitAlphaProcessing() Change-Id: If089cefd4ec53f6880a791c476fb1c7f7c5a8e60	2014-05-27 15:27:13 -07:00
Pascal Massimino	fdbcd44dd3	simplify VP8LInitBitReader() gcc was generating very complex code, one for each case of br->len_ values! also, pretty-fy the mask constants Change-Id: If62b1e8266f3fe5334517305113038d2ea8a6b42	2014-05-22 21:44:16 -07:00
skal	176fda2650	fix the bit-writer for lossless in 32bit mode Sometimes, we can write 18bit or more at time, and it would overflow the 32bit accumulator. Also clarified the num-bits limitations (and exposed VP8L_MAX_NUM_BIT_READ in bit_reader.h) fixes http://code.google.com/p/webp/issues/detail?id=200 Seems a bit faster (use of local fields for bits_ / used_) also: added the __QNX__ bswap while at it. Change-Id: I876db93a931db15b083cf1d838c70105effa7167	2014-05-22 07:19:22 +02:00
skal	54bfffcabc	move RemapBitReader() from idec.c to bit_reader code mostly for coherency and later patch. Change-Id: Ica8352d67845b6c5b3153435edfb4646c6f24341	2014-05-14 07:07:08 +02:00
skal	f948d08c81	memory debug: allow setting pre-defined malloc failure points MALLOC_FAIL_AT flag can be used to set-up a pre-determined failure point during malloc calls. The counter value is retrieved using getenv(). Example usage: export MALLOC_FAIL_AT=37 && cwebp input.png will make 'cwebp' report a memory allocation error the 37th time malloc() or calloc() is called. MALLOC_MEM_LIMIT can be used similarly to prevent allocating more than a given amount of memory. This is usually less convenient to use than MALLOC_FAIL_AT since one has to know in advance the typical memory size allocated. Both these flags are meant to be used for debugging only! Also: added a 'total_mem_allocated' to record the overall memory allocated Change-Id: I9d408095ee7d76acba0f3a31b1276fc36478720a	2014-05-05 14:01:33 -07:00
Vikas Arora	9383afd5c7	Reduce number of memory allocations while decoding lossless. This change reduces the number of calls to WebPSafeMalloc from 200 to 100. The overall memory consumption is down 3% for Lenna image. Change-Id: I1b351a1f61abf2634c035ef1ccb34050b7876bdd	2014-05-02 01:01:43 -07:00
Pascal Massimino	2aa187360d	instrument memory allocation routines for debugging Some tracing code is activated by PRINT_MEM_INFO flag. For debugging only! (not thread-safe, and slow). Change-Id: I282c623c960f97d474a35b600981b761ef89ace9	2014-05-02 00:19:55 -07:00
Pascal Massimino	b3a616b356	make HistogramAdd() a pointer in dsp * merged the two HistogramAdd/AddEval() into a single call (with detection of special case when b==out) * added a SSE2 variant * harmonize the histogram type to 'uint32_t' instead of just 'int'. This has a lot of ripples on signatures. * 1-2% faster Change-Id: I10299ff300f36cdbca5a560df1ae4d4df149d306	2014-04-28 10:09:34 -07:00
Vikas Arora	0b896101b4	Reduce memory footprint for encoding WebP lossless. Reduce calls to Malloc (WebPSafeMalloc/WebPSafeCalloc) for: - Building HashChain data-structure used in creating the backward references. - Creating Backward references for LZ77 or RLE coding. - Creating Huffman tree for encoding the image. For the above mentioned code-paths, allocate memory once and re-use it subsequently. Reduce the foorprint of VP8LHistogram struct by changing the Struct field 'literal_' from an array of constant size to dynamically allocated buffer based on the input parameter cache_bits. Initialize BitWriter buffer corresponding to 16bpp (2WH). There are some hard-files that are compressed at 12 bpp or more. The realloc is costly and can be avoided for most of the WebP lossless images by allocating some extra memory at the encoder initializaiton. Change-Id: I1ea8cf60df727b8eb41547901f376c9a585e6095	2014-04-26 01:14:33 -07:00
Urvang Joshi	8c7cd722f6	Bugfix: Incremental decode of lossy-alpha When remapping buffer, br->eos_ was wrongly being set to true for certain images. Also, refactored the end-of-stream detection as a function. Reported in http://crbug.com/364830 Change-Id: I716ce082ef2b505fe24246b9c14912d8e97b5d84	2014-04-22 16:06:32 -07:00
Djordje Pesut	2772b8bd98	MIPS: fix assembler error revealed by clang's debug build .set at - Indicates that macro expansions may clobber the assembler temporary ($at or $28) register. Some macros may not be expanded without this and will generate an error message if noat is in effect. "at" also added to the clobber list. Change-Id: I67feebbd9f2944fc7f26c28496e49e1e2348529d	2014-04-18 18:10:52 +02:00
James Zern	1a05dfa7f5	windows: fix dll builds WebPSafe* need to be marked external to allow mux/demux to access them through libwebp.dll Change-Id: Ib6620e00d376f7aa5a0550e1e244f759977f97a0	2014-03-31 17:46:12 -07:00
skal	af93bdd6bc	use WebPSafe[CM]alloc/WebPSafeFree instead of [cm]alloc/free there's still some malloc/free in the external example This is an encoder API change because of the introduction of WebPMemoryWriterClear() for symmetry reasons. The MemoryWriter object should probably go in examples/ instead of being in the main lib, though. mux_types.h stil contain some inlined free()/malloc() that are harder to remove (we need to put them in the libwebputils lib and make sure link is ok). Left as a TODO for now. Also: WebPDecodeRGB*() function are still returning a pointer that needs to be free()'d. We should call WebPSafeFree() on these, but it means exposing the whole mechanism. TODO(later). Change-Id: Iad2c9060f7fa6040e3ba489c8b07f4caadfab77b	2014-03-27 15:50:59 -07:00

1 2 3 4 5 ...

302 Commits