libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-08-11 02:20:33 +02:00

Author	SHA1	Message	Date
Djordje Pesut	d4471637ef	MIPS: dspr2: added optimization for function FilterLoop24 affected functions: VFilter16i, HFilter16i, VFilter8i and HFilter8i Change-Id: I5d2bc7716e60e048a33d630fe4a86011bfb6d42e	2014-09-23 10:32:55 +02:00
skal	2aef54d429	Merge "prepare VP8LDecodeImage for incremental decode"	2014-09-23 00:31:27 -07:00
pascal massimino	aed0f5a231	Merge "MIPS: dspr2: added optimization for function FilterLoop26"	2014-09-23 00:17:25 -07:00
skal	286306853e	prepare VP8LDecodeImage for incremental decode - don't call VP8LClear() when there's no error (and let the caller do it) - only initialize output once if state_ is not READ_DATA - don't over-set dec->status_ = READ_DATA - don't re-set dec->status_ if DecodeImageStream() fails - remove unneeded dec->action_ field - make ReadImageInfo() check br->eos_ - use ErrorStatusLossless() more consistently Change-Id: Ica6e4b1c82e3fce8b1ce0274def551a886b73b0b	2014-09-23 00:13:52 -07:00
skal	248f3aed22	remove br->error_ field it's somewhat redundant with br->eos_ also make the status-check coherent. Change-Id: I98e755e037d45acb0760baf2344bf11fb5fb5cda	2014-09-23 00:04:58 -07:00
Djordje Pesut	49e15044ef	MIPS: dspr2: added optimization for function FilterLoop26 affected functions: VFilter16, HFilter16, VFilter8 and HFilter8 Change-Id: Ib2fc41aaa00b10c2906d689bdc5a10f4568e70a8	2014-09-23 08:46:05 +02:00
skal	c792d4129a	Premultiply with alpha during U/V downsampling This prevents the 'alpha-leak' reported in issue #220 Speed-diff is kept minimal. Change-Id: I1976de5e6de7cfcec89a54df9233c1a6586a5846	2014-09-18 23:40:34 -07:00
Vikas Arora	b901416b90	Record the lossless size stats. Record and show the lossless header and image data sizes in the cwebp. Change-Id: I08f19693cb7a756b6fdce5b55d71f5367b5f02fc	2014-09-17 15:16:05 -07:00
Pascal Massimino	cddd334050	Add a WebPExtractAlpha function to dsp This is the opposite of WebPDispatchAlpha + Implement the SSE2 version Change-Id: I0c297309255f508c5261da8aad01f7e57f924d6c	2014-09-15 08:12:03 +02:00
Pascal Massimino	0716a98eb3	fix indent after I0204949917836f74c0eb4ba5a7f4052a4797833b Change-Id: I5d9e5d0a2ad2cefd8c539571d2eaee948da60ad5	2014-09-12 19:59:53 +02:00
Vikas Arora	f9ced95a9b	Optimize lossless decoding for trivial(ARB) codes. Optimize the decoding for region that have trivial literal codes. The trivial literal is defined as huffman image with Red, Blue and Alpha huffman trees with only single code values. This speeds up lossless decoding by 3% Change-Id: I0204949917836f74c0eb4ba5a7f4052a4797833b	2014-09-12 09:08:08 -07:00
Pascal Massimino	690b491af1	fix loop bug in DispatchAlpha() * We were re-doing most of the work in plain-C as 'left-over'. * we were always returning has_alpha = true because of a bad mask all_0xff These bugs were conservative and silent, in the sense that we were 'just' doing more work than necessary. Now, the SSE2 version is really 2x faster than the C version. Change-Id: I6c8132a267fe3c7a3d1fa70e7a5fcd10719543fa	2014-09-11 22:35:08 +02:00
Djordje Pesut	3101f53720	MIPS: dspr2: added optimization for TransformOne added macros for TransformOne, TransformAC3 and TransfromDC Change-Id: I4341450f443cf46dcf91c0db17bde63c8fb8afee	2014-09-11 17:02:02 +02:00
Pascal Massimino	a6bb9b17d8	SSE2 for inverse Mult(ARGB)Row and ApplyAlphaMultiply Change-Id: Iab5c0e4a4d2b31f86736a9b277e62b6e28c3d2b4 WebPMultRow: ~7x faster WebPMultARGBRow: ~3x faster ApplyAlphaMultiply: 60% faster	2014-09-11 07:58:42 +02:00
Vikas Arora	d84a8ffdf7	Remove default initialization of decoder status. emove the default initialization of decoder status in the method VP8LDecodeImage(). Change-Id: Ie6b949606349f4e937c4c1dd2c02ff2a4f86870f	2014-09-10 14:55:46 -07:00
Vikas Arora	e0a9932161	Rectify bug in lossless incremental decoding. Handle the corner case when VP8LDecodeImage() method is called with an invalid header data. The lossless decoding doesn't support incremental mode yet. Return the error status as BITSTREAM error in case not all pixels are decoded with the provided bit-stream. Also added asserts in the VP8LDecodeImage() method to validate the decoder header with appropriate/valid data for huffman trees (htree_groups_ etc). Change-Id: Ibac9fcfc4bd0a2c5f624bb9d4a2b9f6459aa19ea	2014-09-09 15:34:16 -07:00
Djordje Pesut	e2502a97c1	MIPS: dspr2: added optimization for TransformAC3 Change-Id: Icd789ee5f6d764297e7dc0a0f8a3bc47ab92ac65	2014-09-09 14:53:36 +02:00
Djordje Pesut	24e1072aac	MIPS: dspr2: added optimization for TransformDC Change-Id: Iee69758f6442ea9c80ddaa32cea8d00dda4c6252	2014-09-09 14:15:04 +02:00
Pascal Massimino	c0e84df8e8	Merge "Slightly faster lossless decoding (1%)"	2014-09-09 03:55:00 -07:00
Pascal Massimino	8dd28bb560	Slightly faster lossless decoding (1%) -> introduce special case 64b pattern-copy, similar to the 8b one for alpha. -> use mempcy() for non-overlapping areas + cosmetics and homogenezation of the code Change-Id: I0e65e04b96fec94c009a4614137dfba2a0f98561	2014-09-09 11:18:30 +02:00
Djordje Pesut	f0103595dd	MIPS: dspr2: added optimization for ColorIndexInverseTransforms Change-Id: I5b6094ce489d4f896bc4b8f575142eb3c5054beb	2014-09-08 17:22:59 +02:00
Pascal Massimino	d3242aee16	make VP8LSetBitPos() set br->eos_ flag ReadSymbol() finishes with a VP8LSetBitPos() call only and could miss an eos_ during the decode loop. Things are faster because of inlining too. Change-Id: I2d2a275f38834ba005bc767d45c5de72d032103e	2014-09-06 08:40:20 +02:00
Pascal Massimino	a9decb5584	Lossless decoding: fix eos_ flag condition eos_ needs to be set only when superfluous bits have actually been requested. Earlier, we were assuming pre-mature end-of-stream to be an error. Now, more precisely, we mark error when we have encountered end-of-stream and we attempt to read more bits after that. This handles cases where image data requires no bits to be read Change-Id: I628e2c39c64f10c443fb51f86b1f5919cc9fd299	2014-09-05 20:21:50 +02:00
Pascal Massimino	3fea6a28da	fix erroneous dec->status_ setting We only need to set BITSTREAM_ERROR if !ok. Change-Id: I5bd13e64797e8bc509477edb29158abb39cb0ee1	2014-09-05 19:48:11 +02:00
Djordje Pesut	80b8099fd8	MIPS: dspr2: add some specific mips code to commit I2c3f2b12f8df15b785fad5a9c56316e954ae0c53 added some C-code tuning also Change-Id: I67ce70a063ef6b5821b9158a4defd6987eccbb9a	2014-09-04 13:42:39 +02:00
skal	e564062522	Merge "further refine the COPY_PATTERN optim for DecodeAlpha"	2014-09-04 03:43:55 -07:00
James Zern	854509fec0	enc/histogram.c: reindent after `f4059d0` fixes indent in HistogramRemap after: `f4059d0` Code cleanup for HistogramRemap. Change-Id: I9f53a088749e9100a70331bda1662488666c5156	2014-09-03 16:58:49 -07:00
skal	344219645b	Merge "~3-5% faster encoding optimizing PickBestIntra*()"	2014-09-03 15:53:32 -07:00
skal	865069c12e	further refine the COPY_PATTERN optim for DecodeAlpha * use functions instead of MACRO * adjust var's name Overall, same speed, with more readible code Change-Id: I2c3f2b12f8df15b785fad5a9c56316e954ae0c53	2014-09-04 00:25:27 +02:00
Djordje Pesut	a59562283f	added C-level optimization for DecodeAlphaData function Copies with short distances of 1,2 and 4 are specialized. up to 10-14% faster alpha decoding. Change-Id: I9708e98193910bfaf8ef43091f3fdea73b63896d	2014-09-03 16:49:17 +02:00
skal	187d379db6	add a fallback to ALPHA_NO_COMPRESSION if ALPHA_LOSSLESS_COMPRESSION produces a too big file (very rare!), we fall-back to no-compression automatically. Change-Id: I5f3f509c635ce43a5e7c23f5d0f0c8329a5f24b7	2014-09-02 21:55:13 +02:00
skal	a48a2d7635	~3-5% faster encoding optimizing PickBestIntra() Add early-out check for Intra16 * replace some memcpy() by pointer swap Change-Id: I5edc5f7fbc8e39984deb48e6c045c97c61418589	2014-09-01 14:40:25 +02:00
skal	77d4c7e337	address cosmetic comments from patch #71380 Change-Id: Iaba301b9e77aa4febe0efe1e6016fab42d5589f3	2014-08-28 18:08:00 -07:00
skal	f75dfbf23d	Speed up Huffman decoding for lossless speed-up is ~1.6% for photographic image to 10% for graphical image (1000 images corpus was sped up by 5.8 %) Code by akramarz@google.com and jyrki@google.com Change-Id: Iceb2e50e6cc761b9315a3865d22ec9d19b8011c6	2014-08-28 12:28:04 -07:00
James Zern	637b388809	dsp/lossless: workaround gcc-4.9 bug on arm force Sub3() to not be inlined, otherwise the code in Select() will be incorrect. https://android-review.googlesource.com/#/c/102511 Change-Id: I90ae58bf3e6cc92ca9897f69974733d562e29aaf	2014-08-27 20:31:21 -07:00
James Zern	8323a9038d	dsp.h: collect gcc/clang version test macros endian_inl.h already relies on dsp.h, grab the definitions from there. Change-Id: I445f7d0631723043c55da1070498f89965bec7b1	2014-08-27 19:33:09 -07:00
skal	e6c4b52f28	move static initialization of WebPYUV444Converters[] to the Init function. Split initialization of YUV444Converters[] out of Upsamplers init. update test for NULL function pointers Change-Id: I9603f54250f90c85a12ffbecfd6c59e9b06c47e0	2014-08-27 11:36:37 -07:00
skal	49911d4df2	Merge "fix indentation"	2014-08-27 07:52:36 -07:00
Vikas Arora	f4059d0c7d	Code cleanup for HistogramRemap. Avoid call to HistogramAddThresh when there's only one Histogram image. Change-Id: I43b09e8e2d218c95969567034224777dcce37ab9	2014-08-26 15:45:22 -07:00
skal	e632b0929b	fix indentation Change-Id: I2294a6c83e5f345f64bd5120b91532e00ed6c543	2014-08-25 23:52:09 -07:00
skal	f5c04d64b7	Merge "add a DispatchAlpha() for SSE2 that handles 8 pixels at a time"	2014-08-25 22:43:42 -07:00
skal	fc98edd936	add a DispatchAlpha() for SSE2 that handles 8 pixels at a time Only slightly faster. Change-Id: Ie2e57e6a0950166124cf1075c6c9b45b7abdad8c	2014-08-25 21:03:03 -07:00
skal	73d361dd5f	introduce VP8EncQuantize2Blocks to quantize two blocks at a time No speed diff for now. We might reorder better the instructions later, to speed things up. Change-Id: I1949525a0b329c7fd861b8dbea7db4b23d37709c	2014-08-25 20:21:42 -07:00
Djordje Pesut	0b21c30b1a	MIPS: dspr2: added optimization for EmitAlphaRGB New dsp function: WebPDispatchAlpha() Change-Id: I48e539d22471279ec75185759bc68d18b127f716	2014-08-21 20:39:35 -07:00
James Zern	953acd56a4	enc_neon: enable QuantizeBlock for aarch64 vtbl4_u8 is available everywhere except iOS arm64: use vtbl2q_u8 there with a corresponding change in the load. Change-Id: Ib84212dda3c7875348282726c29e3b79b78b0eac	2014-08-20 11:48:25 -07:00
Djordje Pesut	f4ae143720	MIPS: mips32: code rebase mips code rebased to be same as C code from commit I8c29a8a0285076cb3423b01ffae9fcc465da6a81 Change-Id: I3848f4ce43387c3a62b336606498779f7b07ec44	2014-08-19 15:13:16 +02:00
Djordje Pesut	569771549a	MIPS: dspr2: added optimizations for VP8YuvTo* VP8YuvToRgb VP8YuvToBgr VP8YuvToRgb565 VP8YuvToRgba4444 VP8YuvToArgb VP8YuvToBgra VP8YuvToRgba Change-Id: I22212a125d890e1fd28388fec906a1a5c07ff386	2014-08-19 14:29:32 +02:00
skal	2523aa73cb	SmartRGBYUV: fix odd-width problem with pixel replication rightmost pixel was missing a copy, which could lead to invalid read. Also added a lower dimension of 4, below which we use the regular conversion. This is to prevent corner cases, in addition to not being overkill. Change-Id: Iac12e7a3d74590f12fe8eeb1830b9891e61439f6	2014-08-18 15:58:36 -07:00
Pascal Massimino	ee52dc4e54	fix some MSVC64 warning about float conversion Change-Id: I27ab27fc15033d27d0505729f6275fb542c8d473	2014-08-16 00:15:29 -07:00
James Zern	3fca851a20	cpu: check for _MSC_VER before using msvc inline asm _M_IX86 will be defined in mingw builds after including windows.h. as the gcc inline asm is first, this missing check would only have caused an error if the code was reorganized. Change-Id: I395679bcfc43e94d308d1ceb0c0fbf932b2c378c	2014-08-15 15:11:40 -07:00

... 3 4 5 6 7 ...

1632 Commits