Aug. 23rd, 2012

izard: (Default)
It is fourth time this year I have to re-benchmark memory copy for a customer project. Each time there was a customer telling me: we need memcpy that is better then default in some specific configuration (CPU arch, OS, data amount, alignment, data location).

And it was usually possible to select the right one. Performance of different implementations of memcpy, including one from compiler library/libc differs significantly.

Except for my current case. I have three implementations: naive, SSE3, and REP MOV, and performance is exactly the same! Of course it is evident what kind of corner case I am exploring now :)

Profile

izard: (Default)
izard

September 2025

S M T W T F S
 1 23456
78910111213
14151617181920
21222324252627
282930    

Most Popular Tags

Page Summary

Style Credit

Expand Cut Tags

No cut tags
Page generated Sep. 6th, 2025 04:35 am
Powered by Dreamwidth Studios