Re: [PATCH] optimize ia32 memmove

From: H. Peter Anvin
Date: Tue Dec 30 2003 - 15:19:56 EST


Followup to: <3FF15DAB.8080203@xxxxxxxxxxxxxxxx>
By author: Manfred Spraul <manfred@xxxxxxxxxxxxxxxx>
In newsgroup: linux.dev.kernel
>
> AMD recommends to perform bulk copies backwards: That defeats the hw
> prefecher, and results in even better access patterns. Doesn't matter in
> this case, memmove is never used for bulk copies.
>

That's also a microoptimization for one particular microarchitecture
*bug*. Hardware prefetchers are going omnidirectional going forward.
Additionally, nearly all bulk copies are performed forward (DF=0) in
existing codebases.

-hpa
--
<hpa@xxxxxxxxxxxxx> at work, <hpa@xxxxxxxxx> in private!
If you send me mail in HTML format I will assume it's spam.
"Unix gives you enough rope to shoot yourself in the foot."
Architectures needed: ia64 m68k mips64 ppc ppc64 s390 s390x sh v850 x86-64
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/