izard: (Default)
[personal profile] izard
It is fourth time this year I have to re-benchmark memory copy for a customer project. Each time there was a customer telling me: we need memcpy that is better then default in some specific configuration (CPU arch, OS, data amount, alignment, data location).

And it was usually possible to select the right one. Performance of different implementations of memcpy, including one from compiler library/libc differs significantly.

Except for my current case. I have three implementations: naive, SSE3, and REP MOV, and performance is exactly the same! Of course it is evident what kind of corner case I am exploring now :)

Profile

izard: (Default)
izard

November 2025

S M T W T F S
       1
2345678
910 1112131415
1617 1819202122
23242526272829
30      

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated Dec. 5th, 2025 07:30 am
Powered by Dreamwidth Studios