libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-10-15 15:01:05 +02:00

Author	SHA1	Message	Date
skal	744930dbe2	add support for BITS > 32 on x86_64 desktop, it's a little faster to use BITS=56 on MacOS (/llvm) it's _much_ faster (~10%) Change-Id: I47c66ab7488341d8d1696d9301954b86b241b36d	2013-03-15 18:37:08 -07:00
skal	1667bded67	Remove ReadOneBit() and ReadSymbolUnsafe() Simplify and re-organize the VP8L bit-reader functions (e.g.: the 40-bit look-ahead code was helping much) Speed-up with LBITS=64, on arm7-a: => before: ./dwebp_justify_24_neon -v bryce_ll.webp Time to decode picture: 11.393s File bryce_ll.webp can be decoded (dimensions: 11158 x 2156). ... => after (LBITS=64): Time to decode picture: 9.953s making the VP8L bit-reader in 32 bit mode is going to be harder (because we need to be able to read two symbols at a time, each with max length 15 bits) Change-Id: I89746fb103b87b5e2fd40a3208a6fbc584b88297	2013-02-20 00:13:23 +01:00
skal	b7490f8553	introduce WEBP_REFERENCE_IMPLEMENTATION compile option This flag will make the code use no uint64, no asm, and no fancy trick, but instead aim at being as simple and straightforward as possible. Main use is to help emscripten generate proper JS code. More code needs to be simplified later. Also: tune the BITS values to be 24 and make use of WEBP_RIGHT_JUSTIFY Here are the typical timing for decoding a large image: ARM7-a: dwebp_justify_32_neon Time to decode picture: 3.280s dwebp_justify_24_neon Time to decode picture: 2.640s dwebp_justify_16_neon Time to decode picture: 2.723s dwebp_justify_8_neon Time to decode picture: 2.802s dwebp_justify_32 Time to decode picture: 4.264s dwebp_justify_24 Time to decode picture: 3.696s dwebp_justify_16 Time to decode picture: 3.779s dwebp_justify_8 Time to decode picture: 3.834s dwebp_32_neon Time to decode picture: 4.010s dwebp_24_neon Time to decode picture: 2.725s dwebp_16_neon Time to decode picture: 2.852s dwebp_8_neon Time to decode picture: 2.778s dwebp_32 Time to decode picture: 4.587s dwebp_24 Time to decode picture: 3.800s dwebp_16 Time to decode picture: 3.902s dwebp_8 Time to decode picture: 3.815s REFERENCE (HEAD) Time to decode picture: 3.818s x86_64: dwebp_justify_32 Time to decode picture: 0.473s dwebp_justify_24 Time to decode picture: 0.434s dwebp_justify_16 Time to decode picture: 0.450s dwebp_justify_8 Time to decode picture: 0.467s dwebp_32 Time to decode picture: 0.474s dwebp_24 Time to decode picture: 0.468s dwebp_16 Time to decode picture: 0.468s dwebp_8 Time to decode picture: 0.481s REFERENCE (HEAD) Time to decode picture: 0.436s i386: dwebp_justify_32 Time to decode picture: 0.723s dwebp_justify_24 Time to decode picture: 0.618s dwebp_justify_16 Time to decode picture: 0.626s dwebp_justify_8 Time to decode picture: 0.651s dwebp_32 Time to decode picture: 0.744s dwebp_24 Time to decode picture: 0.627s dwebp_16 Time to decode picture: 0.642s dwebp_8 Time to decode picture: 0.642s Change-Id: Ie56c7235733a24f94fbfc2e4351aae36ec39c225	2013-02-14 15:46:12 +01:00
skal	3383885799	faster decoding (3%-6%) . revamped the boolean decoder to use less shifts . added some description and ASCII art as explanations too. . clarified the types further (bit_t, lbit_t, range_t, etc.) . changed the negative field 'missing_' into positive 'bits_' Some stats, decoding some randomly encoded WebP files: with USE_RIGHT_JUSTIFY: BITS=32 => 133 files, 50 loops => 7.3s (1.097 ms/file/iterations) BITS=24 => 133 files, 50 loops => 7.3s (1.097 ms/file/iterations) BITS=16 => 133 files, 50 loops => 7.4s (1.120 ms/file/iterations) BITS=8 => 133 files, 50 loops => 7.5s (1.128 ms/file/iterations) without USE_RIGHT_JUSTIFY: BITS=32 => 133 files, 50 loops => 7.5s (1.131 ms/file/iterations) BITS=24 => 133 files, 50 loops => 7.6s (1.142 ms/file/iterations) BITS=16 => 133 files, 50 loops => 7.6s (1.143 ms/file/iterations) BITS=8 => 133 files, 50 loops => 7.6s (1.149 ms/file/iterations) Change-Id: I9277fb051676c05582e9c7ea3cb5a4b2a3ffb12e	2013-02-14 15:42:58 +01:00
skal	ba2aa0fdda	Add support for BITS=24 case The main advantage is that you can avoid the use of uint64_t some times, sticking to 32bit only. Default still is BITS=32, this is mainly "in case". Change-Id: Id694028793117ba822c37d46ef6c52fa0afed4ac	2012-11-26 23:47:08 +01:00
skal	2afee60a7c	speed up for ARM using 8bit for boolean decoder SBITS=8 is reported 20-30% faster on ARM (where 64bit ops are expensive). Also use 32bits for i32. Change-Id: Id6a7197d805061aeb8832f20432512d0d930ebfa	2012-09-10 23:27:58 +02:00
Pascal Massimino	2cf1f81590	Merge "fix the BITS=8 case"	2012-09-03 02:36:03 -07:00
Pascal Massimino	12f78aec48	fix the BITS=8 case spotted by Måns Rullgård (mans at mansr dot com) Change-Id: I4720dc2eeb645af894e396739be6fa11b5fe2739	2012-09-03 02:29:14 -07:00
Pascal Massimino	6920c71f0a	fix MSVC warnings regarding implicit uint64 to uint32 conversions Change-Id: I284dae9222a3817bba3c5ba6be271b31b5bf660d	2012-09-01 07:21:45 -07:00
James Zern	1bd0bd0d4d	bit_reader.h: correct include use webp/types.h rather than webp/decode_vp8.h Change-Id: I9c6da04b92ff00d6dac47ce3eb0bcb2d6a96712d	2012-04-23 17:04:22 -07:00
James Zern	dceb8b4d9a	Merge changes If1331d3c,I86fe3847 * changes: types.h: centralize use of stddef.h vp8io: use size_t for buffer size	2012-04-14 13:01:14 -07:00
Pascal Massimino	fac0f12e1b	rename BitReader to VP8LBitReader Change-Id: I192b76422e131a94fb58c2c4a5520a5dba807126	2012-04-13 01:56:31 -07:00
James Zern	fbd82b5a39	types.h: centralize use of stddef.h for size_t / NULL Change-Id: If1331d3cf44296ed0ba9e838eae2f5b1bcaeb61b	2012-04-12 17:14:58 -07:00
James Zern	8d254a0927	cosmetics long line, remove out of date TODO Change-Id: Ic8a40c9d731178af85645b3e24c1cbd807d7d58b	2012-04-11 15:44:48 -07:00
Pascal Massimino	6f01b830e2	split the VP8 and VP8L decoding properly * each with their own decoder instances. * Refactor the incremental buffer-update code a lot. * remove br_offset_ for VP8LDecoder along the way * make VP8GetHeaders() be used only for VP8, not VP8L bitstream * remove VP8LInitDecoder() * rename VP8LBitReaderResize() to VP8LBitReaderSetBuffer() (cherry picked from commit 5529a2e6d47212a721ca4ab003215f97bd88ebb4) Change-Id: I58f0b8abe1ef31c8b0e1a6175d2d86b863793ead	2012-04-10 23:06:58 -07:00
James Zern	c4ae53c8b3	add utils/bit_reader.[hc] changes from experimental Pulled from the parent of the current version (5529a2e^). The history of this and related files is a bit entangled so rather trying to split the changes and introduce some noise in master's history we'll start with a fresh snapshot. The file progression is still available in the experimental branch. Change-Id: I6dae97fc381cd6c1d1640c4c565b2084a41ec955	2012-04-10 17:48:49 -07:00
Pascal Massimino	6f7bf645b4	issue 111: fix little-endian problem in bit-reader patch by naideflan Change-Id: I874dbd5588d5cd2559c54ca9ad5582fa3a589b1b	2012-03-27 05:55:20 -07:00
James Zern	b3e4054f14	silence msvc debug build warning _byteswap_ulong is defined in stdlib.h, release builds seem to pull it in through a different path. Change-Id: I510d2624150f89a4a77734bf3dc5b4db60a4ba95	2012-02-21 13:57:48 -08:00
Pascal Massimino	01b6380656	4-5% faster decoding, optimized byte loads in arithmetic decoder. Bits are loaded 32bits at a time (and often aligned). Rather 64bit-friendly Change-Id: If7f67dbe5e37696efbeb6d579d9d8482350b79ee	2012-01-31 02:07:15 -08:00
James Zern	ad1e163a0d	cosmetics: normalize copyright headers Change-Id: I5e2462b101e0447a4f15a1455c07131bc97a52dd	2012-01-06 14:49:06 -08:00
James Zern	964387ed19	use WEBP_INLINE for inline function declarations removes a #define inline, objectionable in certain projects Change-Id: Iebe0ce0b25a030756304d402679ef769e5f854d1	2011-11-11 10:53:58 -08:00
Pascal Massimino	6a32a0f5bf	make VP8BitReader a typedef, for better re-use Change-Id: Id91f8c5649f9fd078facc9f280a314377193b5e8	2011-09-13 15:47:24 -07:00
Pascal Massimino	b112e83647	create a libwebputils under src/utils with bit_reader bit_writer and thread for now. Change-Id: If961933fcfc43e60220913fe4d527230ba8f46bb	2011-09-13 15:34:15 -07:00

23 Commits