Change-Id: If122da22b74a974262063d232f6ca0ab902ff64e
Change-Id: I79cf95bdac1164fc4de899828e9380c23df8d141
Change-Id: Ida11b079077a47fe3b92754f08aa30d81c301fcf
The transpose refactoring will help removing a transpose in a later CL. The horizontal add function helps removing a _mm_sad_epu8 in DC8uv => the latency/throughput went from 29/25 to 23/19 Change-Id: I5f3dfd4aad614eb079b1e83631e6a7cef49a3766