Aug. 23rd, 2012

izard: (Default)
It is fourth time this year I have to re-benchmark memory copy for a customer project. Each time there was a customer telling me: we need memcpy that is better then default in some specific configuration (CPU arch, OS, data amount, alignment, data location).

And it was usually possible to select the right one. Performance of different implementations of memcpy, including one from compiler library/libc differs significantly.

Except for my current case. I have three implementations: naive, SSE3, and REP MOV, and performance is exactly the same! Of course it is evident what kind of corner case I am exploring now :)

Profile

izard: (Default)
izard

July 2025

S M T W T F S
  12345
67 8 91011 12
13141516171819
20212223242526
27 28293031  

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Aug. 12th, 2025 05:26 am
Powered by Dreamwidth Studios