Aug. 23rd, 2012

izard: (Default)
It is fourth time this year I have to re-benchmark memory copy for a customer project. Each time there was a customer telling me: we need memcpy that is better then default in some specific configuration (CPU arch, OS, data amount, alignment, data location).

And it was usually possible to select the right one. Performance of different implementations of memcpy, including one from compiler library/libc differs significantly.

Except for my current case. I have three implementations: naive, SSE3, and REP MOV, and performance is exactly the same! Of course it is evident what kind of corner case I am exploring now :)

Profile

izard: (Default)
izard

November 2025

S M T W T F S
       1
2345678
910 1112131415
16171819202122
23242526272829
30      

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Nov. 16th, 2025 11:45 am
Powered by Dreamwidth Studios