libwebp

mirror of https://github.com/webmproject/libwebp.git synced 2025-07-02 09:04:29 +02:00

Author	SHA1	Message	Date
Pascal Massimino	ab2dc8939f	Rescaler: fix rounding error We saturate the result to [0..255] It's the easiest and safest, given the wide variety of scaling range we cover: we're not using floats, so precision is always an issue at one end or the other of the scaling spectrum. we also use: round(a - floor(b)) instead of: floor(a - round(b)) to handle difficult cases (ratio ~= .99, e.g.) MIPS code is still disabled (and wrong) Change-Id: I18d3f5ddc4c524879c257b928329b1c648fa7fb5	2019-03-30 06:43:55 +00:00
Pascal Massimino	2563db4759	fix rescaling rounding inaccuracy We should be using 'floor' when doing the final divide. -> new MACRO is MULT_FIX_FLOOR() XXX* Mips code is DISABLED for now *XXX I'll update and re-enable it in a later patch, since this code needs some refactoring first. BUG=oss-fuzz:9179 Change-Id: Ic0693cdca4e71f5beab1029475e35c4d06b12d13	2018-07-10 22:45:50 -07:00
Pascal Massimino	c1cb86af5f	fix 16b overflow in SSE2 the 'accum' variable can be larger than 15b for large rescale values. Assert triggered: src/dsp/rescaler_sse2.c:249: RescalerExportRowExpand_SSE2: Assertion `v >= 0 && v <= 255' failed. src/dsp/rescaler_sse2.c:350: RescalerExportRowShrink_SSE2: Assertion `v >= 0 && v <= 255' failed. -> fall back to C implementation in this case for now Change-Id: I7ea1cb72301cafc1459be403f6a6f4e3cbc89bb1	2018-04-11 21:25:06 +00:00
Pascal Massimino	0df22b9eed	WEBP_REDUCE_SIZE: disable all rescaler code BUG=webp:355 Change-Id: Id87cb11902e3fb8544a214308526ea9665ce8440	2017-11-24 22:08:32 +00:00
Pascal Massimino	0a17f4712c	Merge "WIP: list includes as descendants of the project dir"	2017-10-11 08:21:42 +00:00
James Zern	a439972175	WIP: list includes as descendants of the project dir #include "(.\|..)/..." -> #include "src/..." Change-Id: I772880aa097a770722043c8a4393552ba38a89b6	2017-10-10 23:04:05 -07:00
James Zern	582a1b572a	rescaler_sse2: harmonize function suffixes BUG=webp:355 Change-Id: I978fd826ff90149c0ffd9d7607dcc6f88082d3e6	2017-10-08 14:06:19 -07:00
skal	c4568b47fd	Rescaler: harmonize the suffix naming BUG=webp:355 Change-Id: I7720502c62f96c780793d3d881eac7b3afae1418	2017-08-01 23:49:44 +00:00
James Zern	4ea49f6b82	rescaler_sse2.c: fix WEBP_RESCALER_FIX -> _RFIX typo quiets -Wundef Change-Id: I8f1facf401b6f1ab393005c93086ac3e2ae354d5	2017-07-11 15:35:27 -07:00
James Zern	668e1dd44f	src/{dec,enc,utils}: give filenames a unique suffix this avoids duplicates between these trees and dsp/, e.g., enc/tree.c, dec/tree.c, making pulling the whole library source tree into one target possible BUG=webp:279 Change-Id: I060a614833c7c24ddd37bf641702ae6a5eef1775	2017-01-19 19:09:48 -08:00
James Zern	ea0be354a0	dsp.h: remove utils.h include include utils.h directly where needed to allow utils.h to rely on defines from dsp.h in a follow-up. Change-Id: I32e26aaeb0b04ba60b3332f685f9a2be5a0a8d3d	2016-05-11 23:17:21 -07:00
Pascal Massimino	2c08aac81a	introduce WebPMemToUint32 and WebPUint32ToMem for memory access it uses memcpy() when unaligned memory write is tricky Change-Id: I5d966ca9d19e9b43ac90140fa487824116982874	2015-12-04 13:43:01 +00:00
Pascal Massimino	67c547fdcd	rescaler: ~20% faster SSE2 implementation for lossless ImportRowExpand lossy (1-channel) speed-up is more on the 5% side. Change-Id: Id19d97b9e9a34804b59604a5b48f94a37fdafd62	2015-10-14 07:32:12 +02:00
Pascal Massimino	74fb458bbc	fix for weird msvc warning message " warning C4098: 'RescalerImportRowShrinkSSE2' : 'void' function returning a value" Change-Id: Ifa893502e3e4b394910e142d954393dda9d59d1a	2015-10-10 22:35:59 -07:00
Pascal Massimino	932fd4df61	SSE2 implementation of ImportRowShrink some limitations: only for RGBA output, and if reduction factor is not too small (dst_width > src_width / 128) 20-25% faster, ~4-6% global improvement total decoding. Change-Id: I95366ddaa4a38e0a96bed754dfe790126f7bb84a	2015-10-09 13:04:54 -07:00
skal	b4e731cd93	neon-implementation for rescaler code It's better to stay with a 32b fixed-point precision overall, otherwise the C-version on ARM gets slower. Actually, gcc ARM compiler optimizes some instructions pretty well when WEBP_RESCALER_FIX is exactly 32, even in C. Change-Id: I0eea97f7db5947470f5af355dee098eca81e178d	2015-10-07 21:18:39 -07:00
Pascal Massimino	306ce4fde1	rescaler: move the 1x1 or 2x1 handling one level up => no need to handle it in the sub-functions. Change-Id: I4b0211ecfafbc9c80a73bf2206809a13c94e7911	2015-09-25 14:35:35 -07:00
Pascal Massimino	cced974bb2	remove _mm_set_epi64x(), which is too specific Change-Id: I4b1035f9c548b804f31c68a00b0a1aa8e13550bb	2015-09-25 14:35:33 -07:00
Pascal Massimino	56668c9fc5	fix warnings about uint64_t -> uint32_t conversion Change-Id: Iee027979b404d4b7edda506b844d354aa1026dae	2015-09-25 17:36:11 +02:00
Pascal Massimino	76a7dc39e5	rescaler: add some SSE2 code The rounding and arithmetic is not the same as previously, to prevent overflow cases for large upscale factors. We still rely on 32b x 32b -> 64b multiplies. Raised the fixed-point precision to 32b so that we have some nice shifts from epi64 to epi32. Changed rescaler_t type to 'uint32_t' in order to squeeze in all the precision required. The MIPS code has been disabled because it's now out-of-sync. Will be fixed in a subsequent CL when the dust settles. ~30-35% faster Change-Id: I32e4ddc00933f1b1aa3463403086199fd5dad07b	2015-09-25 15:07:13 +02:00

20 Commits