libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-18 23:09:52 +02:00

Author	SHA1	Message	Date
Vikas Arora	8eae188a62	WebP-Lossless encoding improvements. Lossy (with Alpha) image compression gets 2.3X speedup. Compressing lossless images is 20%-40% faster now. Change-Id: I41f0225838b48ae5c60b1effd1b0de72fecb3ae6	2013-05-08 17:22:11 -07:00
skal	b7eaa85d6a	inline VP8LFastLog2() and VP8LFastSLog2 for small values larger values are still dealt with in the .cc ~5% faster encoding Output size is slightly different (variably), because of different floating-point calculation ordering. Change-Id: I6ede18b09c753997cf78aa1199a807d9ddb5d4b4	2013-02-25 22:46:52 +01:00
skal	943386db4b	disable SSE2 for now (until proper run-time detection is ready) Change-Id: I7b8eee52b23fce2f1612ad7d4ed603ffb02620a2	2013-02-20 08:20:47 +01:00
skal	9479fb7d2d	lossless encoding speedup * add SSE2 variant for lossless * speed-up TransformColor calls using specialized TransformColorBlue/Red * Fuse the Shannon Entropy calls to compute it for X and X+Y simultaneously. This latter changes the output size a little bit. Change-Id: Ie5df94da78bf51a58da859c9099b56340da9ec89	2013-02-20 08:13:12 +01:00
skal	b7490f8553	introduce WEBP_REFERENCE_IMPLEMENTATION compile option This flag will make the code use no uint64, no asm, and no fancy trick, but instead aim at being as simple and straightforward as possible. Main use is to help emscripten generate proper JS code. More code needs to be simplified later. Also: tune the BITS values to be 24 and make use of WEBP_RIGHT_JUSTIFY Here are the typical timing for decoding a large image: ARM7-a: dwebp_justify_32_neon Time to decode picture: 3.280s dwebp_justify_24_neon Time to decode picture: 2.640s dwebp_justify_16_neon Time to decode picture: 2.723s dwebp_justify_8_neon Time to decode picture: 2.802s dwebp_justify_32 Time to decode picture: 4.264s dwebp_justify_24 Time to decode picture: 3.696s dwebp_justify_16 Time to decode picture: 3.779s dwebp_justify_8 Time to decode picture: 3.834s dwebp_32_neon Time to decode picture: 4.010s dwebp_24_neon Time to decode picture: 2.725s dwebp_16_neon Time to decode picture: 2.852s dwebp_8_neon Time to decode picture: 2.778s dwebp_32 Time to decode picture: 4.587s dwebp_24 Time to decode picture: 3.800s dwebp_16 Time to decode picture: 3.902s dwebp_8 Time to decode picture: 3.815s REFERENCE (HEAD) Time to decode picture: 3.818s x86_64: dwebp_justify_32 Time to decode picture: 0.473s dwebp_justify_24 Time to decode picture: 0.434s dwebp_justify_16 Time to decode picture: 0.450s dwebp_justify_8 Time to decode picture: 0.467s dwebp_32 Time to decode picture: 0.474s dwebp_24 Time to decode picture: 0.468s dwebp_16 Time to decode picture: 0.468s dwebp_8 Time to decode picture: 0.481s REFERENCE (HEAD) Time to decode picture: 0.436s i386: dwebp_justify_32 Time to decode picture: 0.723s dwebp_justify_24 Time to decode picture: 0.618s dwebp_justify_16 Time to decode picture: 0.626s dwebp_justify_8 Time to decode picture: 0.651s dwebp_32 Time to decode picture: 0.744s dwebp_24 Time to decode picture: 0.627s dwebp_16 Time to decode picture: 0.642s dwebp_8 Time to decode picture: 0.642s Change-Id: Ie56c7235733a24f94fbfc2e4351aae36ec39c225	2013-02-14 15:46:12 +01:00
Vikas Arora	94a48b4bc3	Provide option to swap bytes for 16 bit colormodes Color modes: RGB_565 & RGBA_4444 Change-Id: I571b6832b9848e5c4109272978f68623ca373383	2013-01-22 14:51:20 -08:00
vikas arora	e6409adc2e	Remove redundant include from dsp/lossless code. Change-Id: Ie8a497a486653f907c2a27f4027640a3308c6cc8	2013-01-10 15:09:19 -08:00
Urvang Joshi	f56e98fd11	Alignment fix Change-Id: Ia5475247f03456b01571ae7531da90f74c068045	2012-08-10 02:10:32 +05:30
Urvang Joshi	a0a488554d	Lossless decoder fix for a special transform order Fix the lossless decoder for the case when it has to apply other inverse transforms before applying Color indexing inverse transform. The main idea is to make ColorIndexingInverse virtually in-place: we use the fact that the argb_cache is allocated to accommodate all unpacked pixels of a macro-row, not just packed pixels. Change-Id: I27f11f3043f863dfd753cc2580bc5b36376800c4	2012-08-08 23:52:08 -07:00
Pascal Massimino	4af3f6c4d3	fix indentation Change-Id: Ib00b3cdc21ac336a56390f1e71c169e7fd4767a6	2012-08-02 11:55:55 -07:00
Pascal Massimino	323dc4d9b9	remove use of log2(). Use VP8LFastLog2() instead. Order-by-cost mostly unchanged (up to a scaling constant 1/log(2)) (except for few minor diff in < 2% of cases) + remove unused field cost_mode->cache_bits_ Change-Id: I714f8ab12f49a23f5d499a64c741382c9b489a3e	2012-08-02 00:08:58 -07:00
Pascal Massimino	2fc1301577	harmonize authors as "Name (mail@address)" Change-Id: I85bfae61a37de75a5ed945a906002de2ef75149f	2012-07-19 16:09:47 -07:00
Pascal Massimino	78f3e34504	Enable lossless encoder code Remove USE_LOSSLESS_ENCODER compile flag Update Makefile.am and makefile.unix Change-Id: If7080c4d8f37994c7c784730c5e547bb0a851455	2012-06-13 00:26:58 -07:00
James Zern	42f6df9da3	fix some implicit type conversion warnings Change-Id: I0653d10410c0d46f91fedad4c4dffa9c1de402cb	2012-06-04 22:33:32 -07:00
Pascal Massimino	48f827574e	add colorspace for premultiplied alpha The new modes are MODE_rgbA MODE_bgrA MODE_Argb MODE_rgbA_4444 It's binary incompatible, since the enums changed. While at it, i removed the now unneeded KeepAlpha methods. -> Saved ~12k of code! * made explicit mention that alpha_plane is persistent, so we have access to the full alpha plane data at all time. Incremental decoding of alpha was planned for, but not implemented. So better not dragged this constaint for now and make the code easier until we revisit that. Change-Id: Idaba281a6ca819965ca062d1c23329f36d90c7ff	2012-06-04 07:50:41 -07:00
Pascal Massimino	75d7f3b222	Merge "make input data be 'const' for VP8LInverseTransform()"	2012-05-23 07:54:12 -07:00
Pascal Massimino	9a721c6d24	make input data be 'const' for VP8LInverseTransform() Change-Id: I5b5b1e29bca6c42704df141b21632a0d0fcb07cf	2012-05-23 07:21:53 -07:00
Vikas Arora	237eab6764	Add two more color-spaces for lossless decoding. Added color-spaces (RGBA_4444 and RGB_565), required for Android device to lossless decoding. Change-Id: I229832edd4deca59e066f463e7454f77457c5bcd	2012-05-23 12:10:13 +05:30
James Zern	37a77a6bf4	remove some variable shadowing Change-Id: I4348253ec6b50639095b22c4745dc26da0904466	2012-05-15 14:04:24 -07:00
James Zern	e38602d2ad	Merge branch 'lossless_encoder' * lossless_encoder: (46 commits) split StoreHuffmanCode() into smaller functions more consolidation: introduce VP8LHistogramSet big code clean-up and refactoring and optimization Some cosmetics in histogram.c Approximate FastLog between value range [256, 8192] Forgot to update out_bit_costs to symbol_bit_costs at one instance. Evaluate output cluster's bit_costs once in HistogramRefine. Simple Huffman code changes. Lossless decoder: remove an unneeded param in ReadHuffmanCodeLengths(). Reducing emerging palette size from 11 to 9 bits. Move GetHistImageSymbols to histogram.c Improve predict vs no-predict heuristic. code-moving and clean-up reduce memory usage by allocating only one histo Restrict histo_bits to ensure histo_image size is under 32MB further simplification for the meta-Huffman coding A quick pass of cleanup in backward reference code Make transform bits a function of encode method (-m). introduce -lossless option, protected by USE_LOSSLESS_ENCODER Run TraceBackwards for higher qualities. ... Conflicts: src/enc/webpenc.c Change-Id: I9a5d98cba0889ea91d10699466939cc283da345a	2012-05-07 14:27:17 -07:00
Vikas Arora	ada6ff77df	Approximate FastLog between value range [256, 8192] Profiled data: Profiled few images and found that in the function VP8LFastLog, 90% of time table lookup is performed, while rest of time (10%) call to log function is made. Typical lookup accounts for 10 CPU instructions and call to log 200 instruction counts. The weighted average comes out to be 30 instructions per call. For mid qualities (25-75), this function (VP8LFastLog) accounts for 30-50% of total CPU cycles (via call path: VP8LCOlorSpaceTransform -> PredictionCostCrossColor -> ShannonEntropy). After this change, the log is called less that 1% of time, with average instructions being 15 per call. Measured the performance over 1000 files for various qualities and found overall compression speedup between 10-15% (in quality range [0, 75]). The compression density loss is around 0.5% (though at some qualities, compression is little better as well). Change-Id: I247bc6a8d4351819c871f19d65455dc23aea8650	2012-05-07 14:25:26 -07:00
Urvang Joshi	0993a611cd	Full and final fix for prediction transform use (tile_size + 1) rows of scratch area. Change-Id: I06d612fff1794fc045ba76275e94e7210802c332	2012-05-07 14:24:43 -07:00
Urvang Joshi	afd2102f43	Fix cross-color transform in lossless encoder make elements of "Multiplier" struct unsigned, so that any negative values are automatically converted to "mod 256" values. Change-Id: Iab4f9bacc50dcd94a557944727d9338dbb0982f7	2012-05-07 14:24:41 -07:00
Urvang Joshi	4f0c5caf67	Fix prediction transform in lossless encoder. (Keep one tile as a scratch buffer). Change-Id: If112ada29bfd0bdc81b82e849a566b30dd331d2f	2012-05-07 14:24:35 -07:00
Vikas Arora	d673b6b9a0	Change the predictor function to pass left pixel instead of pointer to the source. Change-Id: Ia2c8e17c3140709a825c2f85a88c5e31bd6e462f	2012-05-07 14:24:29 -07:00
Urvang Joshi	b2f99465a7	Fix CopyTileWithPrediction() so that it uses original values of left, top etc for prediction rather than the predicted values of the same. Also, do some renaming in the same to make it more readable. Change-Id: I2fe94e35a6700bd437f5c601e2af12323bf32445	2012-05-07 14:24:27 -07:00
Urvang Joshi	6b38378acb	Guard the lossless encoder (in flux) under a flag Change-Id: I6dd8fd17089c199001c06b1afde14233dc3e3234	2012-05-07 14:24:23 -07:00
Vikas Arora	09f7532cce	Fix few nits (const qualifiers) Change-Id: I527e82af49956b695ab18625d34e143854067421	2012-05-07 14:24:21 -07:00
Vikas Arora	648be3939f	Added implementation for various lossless functions - VP8LEncAnalyze, EvalAndApplySubtractGreen, ApplyPredictFilter, ApplyCrossColorFilter - Added palette handling and transform buffer management in VP8LEncodeImage() - Add Transforms (subtract Green, Predict, cross_color) to dsp/lossless.c. These are more-or-less copied from src/lossless code. After this Change, will implement the EncodeImageInternal() method. Change-Id: Idf71f803c24b3b5ae3b5079b15e019721784611d	2012-05-07 14:24:19 -07:00
Pascal Massimino	b38dfccf8d	remove unneeded reference to NUM_LITERAL_CODES Change-Id: I3e98acce3a69fa45054ffcf77644fcbbc04bd366	2012-05-04 19:01:09 -07:00
James Zern	532020f24a	lossless: remove some size_t -> int conversions Sizes are given as ints in the documentation and used as such elsewhere. Change-Id: I51ecd9e501cf9b4e3948aa0e947d2c9b5c85a30f	2012-04-24 16:00:00 -07:00
James Zern	b08819a624	dsp/lossless: silence some build warnings src/dsp/lossless.c: In function 'VP8LInverseTransform': src/dsp/lossless.c:312:23: warning: 'packed_pixels' may be used uninitialized in this function [-Wuninitialized] src/dsp/lossless.c:304:16: note: 'packed_pixels' was declared here src/dsp/lossless.c:258:34: warning: 'm.red_to_blue_' may be used uninitialized in this function [-Wuninitialized] src/dsp/lossless.c:275:17: note: 'm.red_to_blue_' was declared here src/dsp/lossless.c:257:34: warning: 'm.green_to_blue_' may be used uninitialized in this function [-Wuninitialized] src/dsp/lossless.c:275:17: note: 'm.green_to_blue_' was declared here src/dsp/lossless.c:255:33: warning: 'm.green_to_red_' may be used uninitialized in this function [-Wuninitialized] src/dsp/lossless.c:275:17: note: 'm.green_to_red_' was declared here patch by pepijn vaneeckhoudt Change-Id: Iffa4764487a75479df45e772169325cd9ee60d94	2012-04-20 12:35:35 -07:00
James Zern	514d008921	add dsp/lossless.[hc] from experimental Pulled from the current HEAD (218c32e). The history of this and related files is a bit entangled so rather trying to split the changes and introduce some noise in master's history we'll start with a fresh snapshot. The file progression is still available in the experimental branch. Change-Id: I40538799dbf999abb9408ac83f55b897d8e22498	2012-04-10 17:37:44 -07:00

1 2

83 Commits