libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-26 18:59:48 +02:00

Author	SHA1	Message	Date
pascal massimino	35d7d095dd	Merge "Reduce memory footprint for encoding WebP lossless."	2014-04-26 01:28:43 -07:00
Vikas Arora	0b896101b4	Reduce memory footprint for encoding WebP lossless. Reduce calls to Malloc (WebPSafeMalloc/WebPSafeCalloc) for: - Building HashChain data-structure used in creating the backward references. - Creating Backward references for LZ77 or RLE coding. - Creating Huffman tree for encoding the image. For the above mentioned code-paths, allocate memory once and re-use it subsequently. Reduce the foorprint of VP8LHistogram struct by changing the Struct field 'literal_' from an array of constant size to dynamically allocated buffer based on the input parameter cache_bits. Initialize BitWriter buffer corresponding to 16bpp (2WH). There are some hard-files that are compressed at 12 bpp or more. The realloc is costly and can be avoided for most of the WebP lossless images by allocating some extra memory at the encoder initializaiton. Change-Id: I1ea8cf60df727b8eb41547901f376c9a585e6095	2014-04-26 01:14:33 -07:00
pascal massimino	9c0a60ccb3	Merge "dwebp: move webp decoding to example_util"	2014-04-26 01:05:48 -07:00
Djordje Pesut	1d62acf6af	MIPS: MIPS32r1: Added optimization for HuffmanCost functions. HuffmanCost and HuffmanCostCombined optimized and added 'const' to some variables from ExtraCost functions. Change-Id: I28b2b357a06766bee78bdab294b5fc8c05ac120d	2014-04-24 11:14:57 +02:00
James Zern	4a0e73904d	dwebp: move webp decoding to example_util this will allow reuse by cwebp Change-Id: I667252fdacfc5436112d21b040ca299273ec1515	2014-04-22 20:31:41 -07:00
James Zern	c0220460e9	Merge "Bugfix: Incremental decode of lossy-alpha"	2014-04-22 16:33:12 -07:00
Urvang Joshi	8c7cd722f6	Bugfix: Incremental decode of lossy-alpha When remapping buffer, br->eos_ was wrongly being set to true for certain images. Also, refactored the end-of-stream detection as a function. Reported in http://crbug.com/364830 Change-Id: I716ce082ef2b505fe24246b9c14912d8e97b5d84	2014-04-22 16:06:32 -07:00
Djordje Pesut	7955152d58	MIPS: fix error with number of registers. Some versions of compiler in debug build can't find a register in class 'GR_REGS' while reloading 'asm' Number of used registers is decreased in this fix. Change-Id: I7d7b8172b8f37f1de4db3d8534a346d7a72c5065	2014-04-22 12:06:45 +02:00
skal	b1dabe3767	Merge "Move the HuffmanCost() function to dsp lib"	2014-04-18 12:08:22 -07:00
skal	75b12006e3	Move the HuffmanCost() function to dsp lib This is to help further optimizations. (like in https://gerrit.chromium.org/gerrit/#/c/69787/) There's a small slowdown (~0.5% at -z 9 quality) due to function pointer usage. Note that, for speed, it's important to return VP8LStreaks by value, and not pass a pointer. Change-Id: Id4167366765fb7fc5dff89c1fd75dee456737000	2014-04-18 11:59:48 -07:00
Djordje Pesut	2772b8bd98	MIPS: fix assembler error revealed by clang's debug build .set at - Indicates that macro expansions may clobber the assembler temporary ($at or $28) register. Some macros may not be expanded without this and will generate an error message if noat is in effect. "at" also added to the clobber list. Change-Id: I67feebbd9f2944fc7f26c28496e49e1e2348529d	2014-04-18 18:10:52 +02:00
James Zern	6653b601ef	enc_mips32: fix unused symbol warning in debug move kC1 / kC2 under __OPTIMIZE__ missed in: `8dec120` enc_mips32: disable ITransform(One) in debug builds Change-Id: Ic9a12e6d73090c8c06b0e7a4bc56dd9c76b8e596	2014-04-17 23:35:36 -07:00
James Zern	8dec120975	enc_mips32: disable ITransform(One) in debug builds avoids: src/dsp/enc_mips32.c: In function 'ITransformOne': src/dsp/enc_mips32.c:123:3: can't find a register in class 'GR_REGS' while reloading 'asm' src/dsp/enc_mips32.c:123:3: 'asm' operand has impossible constraints Change-Id: Ic469667ee572f25e502c9873c913643cf7bbe89d	2014-04-17 20:10:31 -07:00
James Zern	98519dd5c1	enc_neon: convert Disto4x4 to intrinsics Change-Id: I0f00d5af2de2301e8237c2a38a9612d3645abad6	2014-04-17 18:29:31 -07:00
Pascal Massimino	fe9317c9bf	cosmetics: * remove MIPS32 suffix from static function names * fix a long line in enc_neon.c Change-Id: Ia1294ae46f471b3eb1e9ba43c6aa1b29a7aeb447	2014-04-16 00:36:19 -07:00
James Zern	953b074677	enc_neon: cosmetics fix/remove incorrect comments + whitespace Change-Id: Id1b86beb23e5bf946e73c34ab7066b6ca177f33b	2014-04-15 23:57:03 -07:00
skal	a9fc697cb6	Merge "WIP: extract the float-calculation of HuffmanCost from loop"	2014-04-15 11:33:11 -07:00
skal	3f84b5219d	Merge "replace some mult-long (vmull_u8) with mult-long-accumulate (vmlal_u8)"	2014-04-15 07:09:12 -07:00
Djordje Pesut	4ae0533f39	MIPS: MIPS32r1: Added optimizations for ExtraCost functions. ExtraCost and ExtraCostCombined Change-Id: I7eceb9ce2807296c6b43b974e4216879ddcd79f2	2014-04-15 15:37:06 +02:00
skal	b30a04cf11	WIP: extract the float-calculation of HuffmanCost from loop new function: VP8FinalHuffmanCost() Change-Id: I42102f8e5ef6d7a7af66490af77b7dc2048a9cb9	2014-04-15 14:52:52 +02:00
skal	a8fe8ce231	Merge "NEON intrinsics version of CollectHistogram"	2014-04-15 03:00:45 -07:00
skal	95203d2d1b	NEON intrinsics version of CollectHistogram apparently faster, but we might save some load/store to/from memory once we settle for the intrinsics-based FTransform() (also: fixed some #ifdef USE_INTRINSICS problems) Change-Id: I426dea299cea0c64eb21c4d81a04a960e0c263c7	2014-04-14 16:47:20 +02:00
skal	7ca2e74bb4	replace some mult-long (vmull_u8) with mult-long-accumulate (vmlal_u8) saves few instructions Change-Id: If8f464bb2894a209bba94825a4db9267df126d47	2014-04-14 15:14:45 +02:00
skal	41c6efbdc5	fix lossless_neon.c * some extra {xx , 0 } in initializers * replaced by vget_lane_u32() where appropriate Change-Id: Iabcd8ec34d7c853920491fb147a10d4472280a36	2014-04-14 14:27:11 +02:00
skal	8ff96a027a	NEON intrinsics version of FTransform as little bit slower than inlined asm it seems. So disabled for now. Change-Id: I8c942846f9bedaed57275675ea9dbbcb8dfd9ccd	2014-04-14 09:58:35 +02:00
Jovan Zelincevic	0214f4a908	Merge "MIPS: MIPS32r1: Added optimizations for FastLog2"	2014-04-10 08:54:12 -07:00
Jovan Zelincevic	baabf1ea3a	MIPS: MIPS32r1: Added optimizations for FastLog2 Functions VP8LFastLog2Slow and VP8LFastSLog2Slow also: replaced some "% y" by "& (y-1)" in the C-version (since y is a power-of-two) Change-Id: I875170384e3c333812ca42d6ce7278aecabd60f0	2014-04-10 08:32:51 -07:00
skal	3d49871dbe	NEON functions for lossless coding Verified OK, but right now they don't seem faster. So they are disabled behind a USE_INTRINSICS flag (off for now) Change-Id: I72a1c4fa3798f98c1e034f7ca781914c36d3392c	2014-04-10 15:32:08 +02:00
Slobodan Prijic	3fe0291530	MIPS: MIPS32r1: Added optimizations for SSE functions. Change-Id: I1287fa65064192cc2edc5c4be2b1974be665b9b4	2014-04-09 11:02:13 +02:00
skal	c503b485b6	Merge "fix the gcc-4.6.0 bug by implementing alternative method"	2014-04-08 23:25:59 -07:00
skal	abe6f48709	fix the gcc-4.6.0 bug by implementing alternative method previous functions are a bit faster with gcc-4.8, so we keep them for now. Change-Id: I4081e5af66fbf606295d8a83875c1b889729b4dc	2014-04-09 07:53:55 +02:00
James Zern	5598bdecd8	enc_mips32.c: fix file mode Change-Id: I5a43320e2ea2eebc88c65398acb9ea59b63af1fd	2014-04-08 15:12:54 -07:00
Slobodan Prijic	2b1b4d5ae9	MIPS: MIPS32r1: Add optimization for GetResidualCost + reorganize the cost-evaluation code by moving some functions to cost.h/cost.c and exposing VP8Residual Change-Id: Id976299b5d4484e65da8bed31b3d2eb9cb4c1f7d	2014-04-08 15:28:49 +02:00
pascal massimino	f0a1f3cd51	Merge "MIPS: MIPS32r1: Added optimization for FTransform"	2014-04-08 04:17:27 -07:00
Djordje Pesut	7231f610aa	MIPS: MIPS32r1: Added optimization for FTransform Change-Id: I9384dac483e8f98bcfdd277a0a3d6ec7c7a7b297	2014-04-08 04:16:44 -07:00
skal	869eaf6c60	~30% encoding speedup: use NEON for QuantizeBlock() also revamped the signature to avoid having to pass the 'first' parameter Change-Id: Ief9af1747dcfb5db0700b595d0073cebd57542a5	2014-04-08 03:08:22 -07:00
James Zern	f758af6b73	enc_neon: convert FTransformWHT to intrinsics slightly faster than the inline asm in practice not much faster than the C-code in a full NEON build, but still better overall in an Android-like one that only enables NEON for certain files. Change-Id: I69534016186064fd92476d5eabc0f53462d53146	2014-04-08 00:20:19 -07:00
Djordje Pesut	7dad095bb4	MIPS: MIPS32r1: Added optimization for Disto4x4 (TTransform) Change-Id: Ieb20c5c52b964247cfe46f45f9a7415725bf7c02	2014-04-07 15:04:23 +02:00
Jovan Zelincevic	2298d5f301	MIPS: MIPS32r1: Added optimization for QuantizeBlock Change-Id: I6047ab107e4d474e35b5af1dac391d5b3d8c049b	2014-04-07 09:22:35 +02:00
Djordje Pesut	e88150c9b6	Merge "MIPS: MIPS32r1: Add optimization for ITransform"	2014-04-05 10:36:05 -07:00
James Zern	de693f2502	lossless_neon: disable VP8LConvert* functions due to breakage with NDK/gcc-4.6 builds Change-Id: Id96258e710ee33e08a023354b3227f27da986620	2014-04-04 20:38:29 -07:00
skal	4143332b22	NEON intrinsics for encoding * inverse transform is actually slower with intrinsics + gcc-4.6, so is left disabled for now. With gcc-4.8, it's a bit faster than inlined assembly. * Sum of Square error function provide a 2-3% speed up There's enabled by default (since there's no inlined-asm equivalent) Change-Id: I361b3f0497bc935da4cf5b35e330e379e71f498a	2014-04-04 15:02:56 -07:00
Djordje Pesut	0ca2914b23	MIPS: MIPS32r1: Add optimization for ITransform Change-Id: Ie4c8b9bc3a7826bd443cdebf05386786fafe8c56	2014-04-04 10:50:35 +02:00
James Zern	71bca5ecf3	dec_neon: use vst_lane instead of vget_lane results in fewer instructions, small speed improvement Change-Id: I98de632d09ff09f295368c0d744cb4397b585084	2014-04-03 14:56:26 -07:00
skal	bf06105293	Intrinsics NEON version of TransformOne + misc cosmetics * seems 4% slower than inlined-asm with gcc-4.6 * is a tad faster (<1%) with gcc-4.8 (disabled for now) Change-Id: Iea6cd00053a2e9c1b1ccfdad1378be26584f1095	2014-04-03 14:41:56 -07:00
pascal massimino	19c6f1ba74	Merge "dec_neon: use vld?_lane instead of vset?_lane"	2014-04-03 01:16:29 -07:00
James Zern	7a94c0cf75	upsampling_neon: drop NEON suffix from local functions Change-Id: I6583ad74aacf78dcbeb5a0ff0218a39bc3460e5a	2014-04-02 23:24:39 -07:00
James Zern	d14669c83c	upsampling_sse2: drop SSE2 suffix from local functions Change-Id: I2349c1a8e5e15e1d204642096f84f3202721c297	2014-04-02 23:24:39 -07:00
James Zern	2ca42a4fb7	enc_sse2: drop SSE2 suffix from local functions Change-Id: I5d61605a9d410761d50b689b046114f0ab3ba24e	2014-04-02 23:24:36 -07:00
James Zern	d038e6193b	dec_sse2: drop SSE2 suffix from local functions Change-Id: Ie171778b84038d5b04c5dc6972f6015caf555882	2014-04-02 23:10:39 -07:00

1 2 3 4 5 ...

1937 Commits