libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-08-11 02:20:33 +02:00

Author	SHA1	Message	Date
James Zern	8e517eca68	bit_reader/kVP8NewRange: range_t -> uint8_t decreases the size of each entry from 4 bytes to 1. Change-Id: I3e6a50bcbc279e5edfa411edb97b04300dedc7ae	2014-11-24 22:16:26 -08:00
James Zern	e18571393d	dsp: initialize VP8PredChroma8 in VP8DspInit() the table becomes non-const to allow for platform-specific optimizations Change-Id: I32d2b51480020dc653ecfafd20b6b0f096af349f	2014-11-24 22:12:42 -08:00
Vikas Arora	e0c809ad23	Move Entropy methods to lossless.c Move all the Entropy evaluation methods to lossless.c (from histogram.c). There's slight difference in the way entropy is computed for evaluating entropy in prediction methods and histogram (literal) for huffman trees. Plan (later) to merge few (static) methods and reduce the code size. This change has no impact on the compression speed/density. Change-Id: Ife3d96a3c4a8d78a91723d9e0a8d1b78c0256a15	2014-11-20 13:48:05 -08:00
Vikas Arora	a0df55104e	Remove handling for WEBP_HINT_GRAPH Remove handling for WEBP_HINT_GRAPH w.r.t use_palette flag. The WEBP_HINT_GRAPH is now used at one place, to set the initial size of the Bit Writer as bpp for photo images are generally larger than the graphical images. Change-Id: I1b9c4436c85a8f69da74c0dbcd292397323f2696	2014-11-13 15:49:23 -08:00
Vikas Arora	413dfc0c4b	Move static method definition before its usage. Change-Id: Id766c2bea92e7ebf0de65046f73429b74b4fdda4	2014-11-13 13:18:30 -08:00
Vikas Arora	0f23566558	Update BackwardRefsWithLocalCache. Update BackwardRefsWithLocalCache to do in-place update of backward references w.r.t local color cache index. No impact on the compression density or compression speed. Change-Id: Ie066251464c3928c044e037b43df3af28b48ca30	2014-11-13 11:54:26 -08:00
Vikas Arora	d69e36ec59	Remove TODOs from lossless encoder code. histogram.c: - Verified (earlier) that there's low correlation between Red & Blue colors (particularly after applying Cross-color transform). The Bin based histogram merge, bins on three entropies viz literal, red & blue symbols. Removing either of blue or red increases the compression density. So keeping the bins for red & blue sybmols. - Keeping the compact bins method as-is. This way it's simpler to read. huffman_encode.h: Added field comments for struct HuffmanTree and removed the TODO. Change-Id: Ia76f7bc730079d1b3b644038c5d9931db3797f0e	2014-11-12 16:10:16 -08:00
Vikas Arora	fdaac8e0ca	Optmize VP8LGetBackwardReferences LZ77 references. Use the refs_lz77 computed (with cache_bits=0) in the method 'CalculateBestCacheSize' to regenerate the LZ77 references corresponding to the optimum cache_bits and avoid calling costly 'BackwardReferencesLz77' one extra time. This change leaves the compression density unchanged and speeds up compression by 10-15%. Change-Id: I5a92e11788d3c3f656aa7e1fba54fb5d96ee0027	2014-11-12 14:50:04 -08:00
Djordje Pesut	2f0e2ba826	MIPS: dspr2: added optimization for function Select Change-Id: I22470d8b9ab8c5e90c5330ff12c9852676da1a3d	2014-11-07 09:44:16 +01:00
pascal massimino	a3e79a46f6	Merge "WebPEncode: Support encoding same pic twice (even if modified)"	2014-11-06 22:20:01 -08:00
Urvang Joshi	e4f4dddba3	WebPEncode: Support encoding same pic twice (even if modified) This wasn't working for this specific scenario: - Encode an RGBA 'pic' (with trivial alpha) using lossy encoding. (so that pic->a == NULL after import happens). - Modify the 'pic->argb' so that it has non-trivial alpha. - Encode the same 'pic' again. This used to fail to encode alpha data as pic->a == NULL. Change-Id: Ieaaa7bd09825c42f54fbd99e6781d98f0b19cc0c	2014-11-06 13:52:48 -08:00
pascal massimino	cbc3fbb4d7	Merge "Updated VP8LGetBackwardReferences and color cache."	2014-11-06 13:47:21 -08:00
Vikas Arora	95a9bd85c4	Updated VP8LGetBackwardReferences and color cache. - The optimal cache bits is evaluated inside the method 'VP8LGetBackwardReferences'. - The input cache_bits to 'VP8LGetBackwardReferences' sets the maximum cache bits to use (passing 0 implies disabling the local color cache). - The local color cache is disabled for lowerf (<= 25) quality levels (as before). - Enabled local color cache for palette images as well. This saves additional 0.017% bytes with a slight (2-3%) improvement in the compression speed. - Removed 'use_2d_locality' parameter from method VP8LGetBackwardReferences, as this option is not an option now (after we freeze the lossless bit-stream). Change-Id: I33430401e465474fa1be899f330387cd2b466280	2014-11-06 13:14:05 -08:00
Djordje Pesut	54f2c14cce	MIPS: dspr2: added optimization for function FTransform Change-Id: Ib5850edbc2a586ec9781f494b2337f024e22af78	2014-11-06 14:21:33 +01:00
Djordje Pesut	aa42f4231f	MIPS: dspr2: Added optimization for function VP8LSubtractGreenFromBlueAndRed Change-Id: I683c73cceee4a40ca810deba15e54fbf7dbe8918	2014-11-06 10:56:18 +01:00
Djordje Pesut	95ca44a718	MIPS: dspr2: added optimization for Disto4x4 enc/dec common macros moved to mips_macro.h Change-Id: I38d491e772554ac663dd5eb4d15485c0343f23b1	2014-11-05 12:06:15 +01:00
James Zern	4171b6724e	backward_references.c: reindent after `c8581b0` Change-Id: Icfc0fe8e266c0f67a70b8cb095e5aaee155290b6	2014-11-04 17:40:04 +01:00
Vikas Arora	c8581b06e1	Optimize BackwardReferences for RLE encoding. Updated BackwardReferencesRle method by utilizing the local color cache. Also changed the name of method BackwardReferencesHashChain to BackwardReferencesLz77 to reflect the LZ77 coding. For the 1000 image corpus, this change saves 0.2% bytes (at default settings) and is 2-5% faster to encode. Change-Id: Ic3f288253b3bbb101a69945a80994c3fd0917f8b	2014-11-04 08:12:07 -08:00
Djordje Pesut	5798eee6be	MIPS: dspr2: unfilters bugfix (Ie7b7387478a6b5c3f08691628ae00f059cf6d899) Change-Id: I78d97960efbd1ec1af51a5426e38dc01bdb48140	2014-11-03 15:39:00 +01:00
Vikas Arora	4167a3f5f7	Optimize backwardreferences Optimize backwardreferences (about 0.1% byte savings) with almost same compression speed (3% faster on defaut compression settings). 1.) Simplified iteration logic for HashChainFindCopy. - Remapped the iter_max constant. 2.) Simplified main for loop for BackwardReferencesHashChain - Removed 'if' conditions for corner cases in the main loop. - Refactored the method(AddSingleLiteral) for adding one pixel. Change-Id: I1bc44832fd81f11e714868a13e606c8f83157e64	2014-10-31 18:08:38 -07:00
James Zern	d18554c30d	Merge "webp/types.h: use inline for clang++/-std=c++11"	2014-10-31 03:53:06 -07:00
Vikas Arora	77bdddf016	Speed up BackwardReferences Speed up BackwardReferencesHashChainDistanceOnly method by: 1.) Remove for loop for shortmax code path. 2.) Execute the shortmax code path after regular call to HashChainFindCopy, only if HashChainFindCopy() returns length > 2 (MIN_LENGTH). 3.) Also for shortmax, call method HashChainFindOffset (for length = 2), instead of expensive method HashChainFindCopy(). 4.) Handling first pixel (i==0) outside main loop and removing one if condition (i > 0) per pixel. 5.) Handle the last pixel outside the main 'for' loop. Overall compression speedup observed is around 5% (+/- noise). Change-Id: Ifa30c4035f8d26e6e43e3c4881244d777961c22b	2014-10-30 10:58:24 -07:00
James Zern	6638710b9e	webp/types.h: use inline for clang++/-std=c++11 at least clang 3.[45] in c++ mode with -std=c++11 define __STRICT_ANSI__ this change set WEBP_INLINE to inline for c++/non-strict-ansi/> c99 fixes crbug.com/428383 Change-Id: Ief2b934353c336a75865c73c90cc3dc5e4f83913	2014-10-30 15:25:27 +01:00
Vikas Arora	abf04205b3	Enable entropy based merge histo for (q<100) Enable bin-partition entropy based heuristic for merging histograms for higher (q >= 90) qualities as well. Keep the old behavior at the maximum quality level (q==100). This speeds up the compression between Q=90-99 (method=4) by factor 5-7X and with loss of 0.5-0.8% in the compression density. Change-Id: I011182cb8ae5403c565a150362bc302630b3f330	2014-10-30 03:59:36 -07:00
James Zern	572022a350	filters_mips_dsp_r2.c: disable unfilters the output does not match the C-code. Change-Id: Ie7b7387478a6b5c3f08691628ae00f059cf6d899	2014-10-30 11:10:11 +01:00
Djordje Pesut	a28e21b141	MIPS: dspr2: Added optimization for function ClampedAddSubtractFull Change-Id: Iee98eaf007158f44a299dd5ba8d972d0d4108380	2014-10-29 13:08:06 +01:00
Djordje Pesut	18d5a1efa8	MIPS: dspr2: added optimization for function ClampedAddSubtractHalf Change-Id: Iec22e897a4f56e79c18ec00f8caa9cefac67f186	2014-10-29 11:08:37 +01:00
Djordje Pesut	829a8c19a0	MIPS: dspr2: added optimization for ITransform Change-Id: I3534fca143535c53d18a3749b3a1b0c8a7563463	2014-10-28 14:28:14 +01:00
Vikas Arora	653ace55c3	Increase the MAX_COLOR_CACHE_BITS from 9 to 10. The Maximum allowed limit is 11. The Q=25 and below is not impacted as cache bits are forced to 0. This saves 0.05% - 0.1% bytes for other quality with almost same compression speed (+/- 2-3%, that's more of a noise). Change-Id: Icf972a98f298c89e140e37a627baf709134be9a0	2014-10-27 14:19:04 -07:00
Vikas Arora	919220c7e6	Change the logic adjusting the Histogram bits. Updated the logic to limit the Histogram size to a constant, instead of computing the same based on the Histogram size (that's variable size based on the cache bits) for the maximum possible cache bits. The actual cache bits may be lower than the maximum. Note: The constant 2600 is 16MB/Sizeof(HistogramSize(MAX_COLOR_CACHE_BITS)). The compression density remains the same with this change, with little faster compression speed. Change-Id: I3149894962852e9dad2501b9aa16bb847a20fd86	2014-10-27 09:57:17 -07:00
pascal massimino	53b096c0d7	Merge "Fix bug in VP8LCalculateEstimateForCacheSize."	2014-10-27 02:31:10 -07:00
Vikas Arora	e912bd55be	Fix bug in VP8LCalculateEstimateForCacheSize. The method VP8LCalculateEstimateForCacheSize is not evaluating the all possible range for cache_bits. Also added a small penality for choosing the larger cache-size. This is done to strike a balance between additional memory/CPU cost (with larger cache-size) and byte savings from smaller WebP lossless files. This change saves about 0.07% bytes and speeds up compression by 8% (default settings). There's small speedup at Q=50 along with byte savings as well. Compression at Quality=25 is not effected by this change. Change-Id: Id8f87dee6b5bccb2baa6dbdee479ee9cda8f4f77	2014-10-26 20:05:48 -07:00
James Zern	22881c999e	dec_neon: add RD4 intra predictor based on the SSE2 version; a bit rough around the loads, but still ~38% faster. Change-Id: I22426d939a7354cbc9a85ca8c68235d6081b882f	2014-10-24 21:22:07 +02:00
James Zern	1304eb3418	Merge "dec_neon: DC4: use pair-wise adds for top row"	2014-10-23 08:08:34 -07:00
pascal massimino	7083006b61	Merge "dsp/dec_{neon,sse2}: VE4: normalize variable names"	2014-10-23 07:29:27 -07:00
James Zern	0db9031c79	dsp/dec_{neon,sse2}: VE4: normalize variable names use '0' rather than '_' when dealing with variables that result from a shift Change-Id: I29280c0dead645ce39dc4bb42c3e19929b302fd4	2014-10-23 16:04:13 +02:00
James Zern	b5bc15305b	dec_neon: DC4: use pair-wise adds for top row reduces load count, slightly faster Change-Id: I880340ef8ef75ce4ce321c330f56f86b758bda08	2014-10-23 15:48:49 +02:00
Pascal Massimino	5b90d8fe42	Unify the API between VP8BitWriter and VP8LBitWriter BitReader will be next... Change-Id: Icd9e7ab2e3890131e664c0523627d9b8c5399a74	2014-10-23 15:35:16 +02:00
pascal massimino	f7ada560ce	Merge changes I2e06907b,Ia9ed4ca6,I782282ff * changes: dec_neon: add DC4 intra predictor dec_neon: add TM4 intra predictor dec_neon: add LD4 intra predictor	2014-10-23 06:31:54 -07:00
pascal massimino	5beb6bf070	Merge "dec_neon: add VE4 intra predictor"	2014-10-23 05:38:41 -07:00
James Zern	eba6ce06c3	dec_neon: add DC4 intra predictor ~70% faster Change-Id: I2e06907b8d69be71a8c5581832c931923c24bab0	2014-10-23 14:21:08 +02:00
James Zern	79abfbd9df	dec_neon: add TM4 intra predictor ~21% faster Change-Id: Ia9ed4ca650f9d544821fa1faf3173611806a272a	2014-10-23 14:21:08 +02:00
James Zern	fe395f0e4d	dec_neon: add LD4 intra predictor based on SSE2 version, ~55% faster Change-Id: I782282ffc31dcf238890b3ba0decccf1d793dad0	2014-10-23 14:20:47 +02:00
James Zern	32de385eca	dec_neon: add VE4 intra predictor based on SSE2 version, ~59% faster Change-Id: Iaa2181eb51bd975de0e9fe5c7b66ed18188f0e3b	2014-10-23 11:46:08 +02:00
Vikas Arora	c2b5a0396a	Modify CostModel to allocate optimal memory. Change-Id: I7d52675d28bfc109d4e901581fc24cd36fcb79ee	2014-10-22 13:30:33 -07:00
Pascal Massimino	b7a33d7e91	implement VE4/HE4/RD4/... in SSE2 (30% faster prediction functions, but overall speed-up is ~1% only) Change-Id: I2c6e7074aa26a2359c9198a9015e5cbe143c2765	2014-10-22 18:25:36 +02:00
Pascal Massimino	97c76f1f30	make VP8PredLuma4[] non-const and initialize array in VP8DspInit() also convert 'type dst' to 'type dst' Change-Id: I41ab66ad15b548cc45d1cb8b10bbca4fe1528cae	2014-10-22 18:14:20 +02:00
pascal massimino	0ea8c6c219	Merge "PrintReg: output to stderr"	2014-10-22 08:55:10 -07:00
James Zern	f85ec712b0	PrintReg: output to stderr allows use of '-o -' while testing Change-Id: Ibc02d7cede2df4eb8be0a28c0ca4bf5e91864191	2014-10-22 17:28:19 +02:00
Vikas Arora	139142e440	Optimize BackwardReferenceHashChainFollowPath. Instead of calling HashChainFindMethod, call a new (subset) method HashChainFindOffset to get the offset/distance for a given length. The encoding is tad faster at default compression Before After bpp/rate bpp/rate 442 Palette 0.2720/5.270 MP/s 0.2720/5.790 MP/s 558 non-palette 3.7607/0.797 MP/s 3.7607/0.816 MP/s Change-Id: If4041a9c18f7e972f49fcbab8c3e2f013d8bf1cf	2014-10-21 10:04:27 -07:00

1 2 3 4 5 ...

1562 Commits