libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-18 23:09:52 +02:00

Author	SHA1	Message	Date
Pascal Massimino	09545eeadc	lossy bit-reader clean-up: * remove LEFT/RIGHT_JUSTIFY distinction. It's all RIGHT_JUSTIFY now. * simplify VP8GetSigned(), and add some masking branch-less code. Much faster on ARM (~13% speed-up). 8% on x86-64, 5% on MacBook. * split critical implementation into separate bit_reader_inl.h file that is only included where needed (vp8.c / tree.c / bit_reader.c) * bumped BITS value from 16 to 24 for x86-32b too, since it's a bit faster. Change-Id: If41ca1da3e5c3dadacf2379d1ba419b151e7fce8	2014-06-03 07:46:55 +02:00
skal	c9b340a279	fix missing WebPInitAlphaProcessing call for premultiplied colorspace output (lossless only) Change-Id: Ic2d01c8cf9bc1082f07f348733461eb2ee30288a	2014-05-28 10:44:05 +02:00
skal	399b916d27	lossy decoding: correct alpha-rescaling for YUVA format The luminance needs to be pre- and post- multiplied by the alpha value in case of rescaling, for proper averaging. Also: - removed util/alpha_processing and moved it to dsp/ - removed WebPInitPremultiply() which was mostly useless and merged it with the new function WebPInitAlphaProcessing() Change-Id: If089cefd4ec53f6880a791c476fb1c7f7c5a8e60	2014-05-27 15:27:13 -07:00
skal	a05dc1402c	SSE2: yuv->rgb speed-up for point-sampling - use statically initialized tables (if WEBP_YUV_USE_SSE2_TABLES is defined) - use SSE2 row conversion for yuv->ARGB / RGBA / ABGR / RGB / BGR - clean-up and harmonize the WebpUpsamplers[] usage. Change-Id: Ic5f3659a995927bd7363defac99c1fc03a85a47d	2014-05-22 09:56:47 +02:00
Pascal Massimino	a2f8b28905	revamp the point-sampling functions by processing a full plane -nofancy is slower than fancy upsampler, because the latter has SSE2 optim. Change-Id: Ibf22e5a8ea1de86a54248d4a4ecc63d514c01b88	2014-05-20 15:13:44 -07:00
skal	54bfffcabc	move RemapBitReader() from idec.c to bit_reader code mostly for coherency and later patch. Change-Id: Ica8352d67845b6c5b3153435edfb4646c6f24341	2014-05-14 07:07:08 +02:00
Pascal Massimino	f1e771735a	remove all unused layer code Change-Id: I220590162b24c70f404fe3087f19dd3e6cac3608	2014-05-08 22:37:38 -07:00
Vikas Arora	9383afd5c7	Reduce number of memory allocations while decoding lossless. This change reduces the number of calls to WebPSafeMalloc from 200 to 100. The overall memory consumption is down 3% for Lenna image. Change-Id: I1b351a1f61abf2634c035ef1ccb34050b7876bdd	2014-05-02 01:01:43 -07:00
James Zern	e69a1df4b7	dec/vp8l: prevent signed int overflow in left shift ops force unsigned when shifting by 24. Change-Id: I6f9ca5fa2109e59b1d46a909136384fc6dc8ca0b	2014-04-29 14:12:38 -07:00
skal	af93bdd6bc	use WebPSafe[CM]alloc/WebPSafeFree instead of [cm]alloc/free there's still some malloc/free in the external example This is an encoder API change because of the introduction of WebPMemoryWriterClear() for symmetry reasons. The MemoryWriter object should probably go in examples/ instead of being in the main lib, though. mux_types.h stil contain some inlined free()/malloc() that are harder to remove (we need to put them in the libwebputils lib and make sure link is ok). Left as a TODO for now. Also: WebPDecodeRGB*() function are still returning a pointer that needs to be free()'d. We should call WebPSafeFree() on these, but it means exposing the whole mechanism. TODO(later). Change-Id: Iad2c9060f7fa6040e3ba489c8b07f4caadfab77b	2014-03-27 15:50:59 -07:00
skal	207d03b484	fix out-of-bound read during alpha-plane decoding With -bypass_filter switched on, the lossless-compressed data is decoded ahead of time (before being transformed and display). Hence, the last row was called twice. http://code.google.com/p/webp/issues/detail?id=193 Change-Id: I9e13f495f6bd6f75fa84c4a21911f14c402d4b10	2014-03-26 22:45:03 +01:00
Pascal Massimino	b7685d73fe	Rescale: let ImportRow / ExportRow be pointer-to-function Separate the C version from the MIPS32 version and have run-time initialization during RescalerInit() Change-Id: I93cfa5691c073a099fe62eda1333ad2bb749915b	2014-02-17 00:58:17 -08:00
Urvang Joshi	1684f4ee37	WebP Decoder: Mark some truncated bitstreams as invalid Specifically, check for truncated RIFF and/or VP8(L) chunks. For more context, see: https://code.google.com/p/webp/issues/detail?id=185 Change-Id: I91ca2dbf05080660fbc513244fc53adc57fc04b5	2014-02-10 16:35:27 -08:00
Vikas Arora	1d1cd3bbd6	Fix decode bug for rgbA_4444/RGBA_4444 color-modes. The WEBP_SWAP_16BIT_CSP flag needs to be honored while filling the Alpha (4 bits) data in the destination buffer and while pre-multiplying the alpha to RGB colors. Change-Id: I3b07307d60963db8d09c3b078888a839cefb35ba	2014-02-03 09:20:54 -08:00
Djordje Pesut	53520911c3	Added support for calling sampling functions via pointers. Change-Id: Ic4d72e6b175a6b27bcdcc8cd97828e44ea93e743	2014-01-29 15:32:35 +01:00
skal	bbc23ff34c	parse one row of intra modes altogether (instead of per-macroblock) speed unchanged. simplified the context-saving for incremental decoding Change-Id: I301be581bab581ff68de14c4ffe5bc0ec63f34be	2014-01-28 21:40:40 +01:00
James Zern	c5a5b0286f	decode mt+incremental: fix segfault in debug builds VP8GetThreadMethod() may be called with a NULL headers param; correct an assert. broken since: `8a2fa09` Add a second multi-thread method Change-Id: If7b6d1b8f4ec874d343a806cee5f5e6bb6438620	2014-01-27 20:33:05 -08:00
skal	5da185522b	add a decoding option to flip image vertically New API options: WebPDecoderOptions.flip and 'dwebp -flip ...' it uses negative stride trick. Also changed the decoder code to support user-supplied buffers with negative stride, independently of the WebPDecoderOptions.flip value. Change-Id: I4dc0d06f0c87e51a3f3428be4fee2d6b5ad76053	2014-01-16 15:48:43 +01:00
James Zern	8c524db84c	bump version to 0.4.0 libwebp{,decoder} - 0.4.0 libwebp libtool - 5.0.0 libwebpdecoder libtool - 1.0.0 mux/demux - 0.2.0 libtool - 1.0.0 Change-Id: Idbd067f95a6af2f0057d6a63ab43176fcdbb767d	2013-12-18 19:20:00 -08:00
James Zern	c72e08119a	Merge "dec/webp.c: don't wait for data before reporting w/h"	2013-12-18 18:47:43 -08:00
James Zern	5ad653145a	dec/frame.c: fix formatting since `26d842e` NEON speed up + drop a duplicate and from a comment Change-Id: I710f46f83b80161064910c7efc16788b88c089fe	2013-12-18 17:16:31 -08:00
James Zern	f7fc4bc89b	dec/webp.c: don't wait for data before reporting w/h this partially reverts `f626fe2` Detect canvas and image size mismatch in decoder. the original change would cause calls to e.g., WebPGetInfo to fail until a portion of the image chunk was available. With lossy+alpha this meant waiting for the entire ALPH chunk to be received. this change restores the original behavior -- reporting the values from VP8X if available -- while retaining some of the added canvas/image size checks if the image data is available Change-Id: I6295b00a2e2d0d4d8847371756af347e4a80bc0e	2013-12-18 17:09:04 -08:00
skal	66a32af5e1	Merge "NEON speed up"	2013-12-18 14:17:19 -08:00
skal	26d842eb8f	NEON speed up add TransformDC special case, and make the switch function inlined. Recovers a few of the CPU lost during the addition of TransformAC3 (only on ARM) Change-Id: I21c1f0c6a9cb9d1dfc1e307b4f473a2791273bd6	2013-12-18 22:32:58 +01:00
James Zern	605a712701	simplify __cplusplus ifdef drop c_plusplus which is from a quite ancient pre-standard compiler Change-Id: I9e357b3292a6b52b14c2641ba11f4f872c04b7fb	2013-12-16 20:16:02 -08:00
James Zern	5227d99146	drop: ifdef __cplusplus checks from C files the prototypes are already marked in the headers Change-Id: I172fe742200c939ca32a70a2299809b8baf9b094	2013-12-13 11:42:13 -08:00
James Zern	c12e2369d8	cosmetics: fix a few typos Change-Id: I73b1900b2d960c4c57ef7078df137c776b321a1b	2013-12-03 22:36:29 -08:00
James Zern	cc55790e37	Merge changes I8bb7a4dc,I2c180051,I021a014f,I8a224a62 * changes: mux: add some missing casts enc/vp8l: add a missing cast idec: add some missing casts ErrorStatusLossless: correct return type	2013-11-27 17:06:35 -08:00
James Zern	c536afb57b	Merge "cosmetics: fix some typos"	2013-11-27 17:04:00 -08:00
skal	cbdd3e6e53	add a -dither dithering option to the decoder Even at high quality setting, the U/V quantizer step is limited to 4 which can lead to banding on gradient. This option allows to selectively apply some randomness to potentially flattened-out U/V blocks and attenuate the banding. This option is off by default in 'dwebp', but set to -dither 50 by default in 'vwebp'. Note: depending on the number of blocks selectively dithered, we can have up to a 10% slow-down in decoding speed it seems. Change-Id: Icc2446007f33ddacb60b3a80a9e63f2d5ad162de	2013-11-27 00:57:51 -08:00
James Zern	4931c3294b	cosmetics: fix some typos Change-Id: I0d6efebd817815139db5ae87236fd8911df4d53c	2013-11-26 19:21:14 -08:00
James Zern	46db286572	idec: add some missing casts Change-Id: I021a014f1f3a597f37ba8d9f4006c8cb4100723c	2013-11-25 20:47:35 -08:00
James Zern	b524e3369c	ErrorStatusLossless: correct return type int -> VP8StatusCode Change-Id: I8a224a622e98d401a6e401047780fa6b25cb0ae4	2013-11-25 20:45:53 -08:00
skal	df3649a287	remove all disabled code related to P-frames it's drifting out of sync, and won't be used anyway... Change-Id: I931b288e1c8480bf3bccd685b3a356bb6bd8e10b	2013-11-04 11:52:05 +01:00
skal	43148b6cd2	filtering: precompute ilimit and hev_threshold no speed change, just simplifying the logic Change-Id: I518800494428596733d4fbae69072049828aec3c	2013-10-28 13:37:33 +01:00
Pascal Massimino	18f992ec0f	simplify f_inner calculation a little by incorporating the is_4x4 flag at init Change-Id: I042e04aacb15181db0bf86f3212c880087519189	2013-10-28 01:49:09 -07:00
Pascal Massimino	241d11f141	add missing const Change-Id: Id1c767d21d52197ed2e4497005eb9c4795c602f0	2013-10-25 20:34:14 +02:00
Pascal Massimino	86c0031eb2	add a 'format' field to WebPBitstreamFeatures Change-Id: I79a688e4c34fb77527127bbdf4bc844efa6aa9a4	2013-10-25 20:34:06 +02:00
skal	0b2b05049f	Use deterministic random-dithering during RGB->YUV conversion -> helps debanding (sky, gradients, etc.) This dithering can only be triggered when using -preset photo or -pre 2 (as a preprocessing). Everything is unchanged otherwise. Note that this change is likely to make the perceived PSNR/SSIM drop since we're altering the input internally. Change-Id: Id8d4326245d9b828141de162c94ba381b1fa5813	2013-10-17 22:36:49 +02:00
skal	8a2fa099cc	Add a second multi-thread method method 1 grouping: [parse + reconstruction] // [filtering + output] method 2 grouping: [parse] // [reconstruction+filtering + output] Depending on some heuristics (see VP8ThreadMethod()), we can pick one of the other when -mt flag (or option.use_threads) is selected. Conservatively, we always use method #2 for now until the heuristic is refined (so, timing should be the same the before this patch) + replace 'use_threads' by 'mt_method' + define MIN_WIDTH_FOR_THREADS constant + fix comment alignment Change-Id: I11a756dea9070d6e21b1a9481d357a1e8aa0663e	2013-10-15 23:58:31 +02:00
skal	0532149c8a	up to 20% faster multi-threaded decoding Mostly visible for large images. Reconstruction+filtering is now done in parallel to bitstream-parsing. Change-Id: I4cc4483d803b255f4d97a2fcd9158b1c291dd900	2013-10-15 00:25:21 +02:00
skal	cb22155201	Decode a full row of bitstream before reconstructing Needs more memory but allows for future parallelization. Noticeably faster on ARM, slightly faster on x86 also: remove dec->filter_row_ unnecessary field Change-Id: I044a808839b4e000c838a477e3e8688820436d9a	2013-10-10 21:29:58 +02:00
skal	0b28d7ab08	use a macrofunc for setting NzCoeffs bits (avoids code dup) Change-Id: I776f065538e562673ca08f3bc43c7167d13254d9	2013-10-09 11:46:32 +02:00
skal	f9bbc2a034	Special-case sparse transform If the number of non-zero coeffs is <= 3, use a simplified transform for luma. Change-Id: I78a1252704228d21720d4bc1221252c84338d9c8	2013-10-08 22:05:38 +02:00
skal	63f9aba4b3	special-case WHT transform when there's only DC happens surprisingly often at low quality, so we might as well hard-code a simplified TransformWHT() directly. Change-Id: Ib7a858ef74e8f334bd59d6512bf5bd3e455c5459	2013-10-02 14:27:36 +02:00
skal	2a98136667	7-8% faster decoding by rewriting GetCoeffs() Change-Id: Ib7c27e985d3b5222e8fa1f98cec462458caa9541	2013-09-30 23:17:34 +02:00
skal	92d47e4ca9	improve VP8L signature detection by checking the version bits too Change-Id: I20bea00b9582d7ea8c7b643616c78f717ce1bdf2	2013-09-27 18:17:23 +02:00
Pascal Massimino	40ae3520b1	fix memleak in WebPIDelete() happens when decoding is partial (past Partition0), without error and interrupted by calling WebPIDelete() WebPIDelete() needs to call VP8ExitCritical() to free in-flight resources Change-Id: Id4faef1b92f7edd8c17d642c58860e70dd570506	2013-09-17 15:45:43 -07:00
Pascal Massimino	6a44550a8c	break down the proba 4D-array into some handy structs Makes it easy to later add derived satellite fields... Change-Id: I445767ea78cc788d11aec367479e74e485fdabe5	2013-09-14 03:12:09 -07:00
skal	c13fecf908	remove the PACK() bit-packing tricks was too smart for its own good :) This is more ARM-friendly, since it removes a mult. Change-Id: If146034c8efa2e71e3eaaf1230cb553884a42ebb	2013-09-05 08:53:36 +02:00

1 2 3 4 5 ...

424 Commits