Re: NS83820 2.6.0-test5 driver seems unstable on IA64
From: Andi Kleen
Date: Fri Sep 26 2003 - 23:29:06 EST
"David S. Miller" <davem@xxxxxxxxxx> writes:
> On Fri, 26 Sep 2003 10:47:12 -0700
> David Mosberger <davidm@xxxxxxxxxxxxxxxxx> wrote:
>
>> Arches that don't like/support unaligned accesses will certainly
>> have a copy_user() which handles misalignment just fine. For
>> example, on ia64, the copy will run slightly slower because of an
>> additional shift in the loop, but that penalty only shows on fully
>> cached data.
>
> Exactly correct, and on many platforms (sparc64 happens to be one)
> there is _ZERO_ performance penalty for misalignment or source or
> destination buffer during a memcpy/memmove.
You handle misalignment->misalignment copies with zero or small cost -
when both source and destination have the same misalignment. I guess
you do that by just aligning the pointer at the beginning of the
function. That works as long as both source and destination have
the same misalignment.
But that is not what happens here. The copy is a misaligned->aligned
copy and that cannot be handled at zero cost (unless you have zero
misalignment penalty in load/store). Either the load or the store in the
copy loop will be always misaligned, no matter what tricks you play. You
cannot avoid this by aligning the pointers.
The user buffers tend to be aligned, so when the kernel data is misaligned
the copy ends up requiring an misaligned load in the inner copy loop.
Of course you could try to teach the user to use intentionally misaligned
buffers in his programs to avoid this, but it would be likely a hard
sell and be a bad idea on less crippled NICs.
The copy inside the device driver has the same problem: misaligned
data -> aligned skb => the load in the copy will be misaligned, the store
aligned. You avoid a few unaligned loads in the core stack, at the cost
of hundreds in the copy. Seems like a bad trade-off to me.
-Andi
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/