Re: Bad Code generated from __{get,put}_unaligned functions

From: Falk Hueffner
Date: Fri Aug 07 2009 - 11:41:01 EST


On Thu, Aug 06, 2009 at 01:56:12PM -0400, Matt Turner wrote:
> I was researching different ways of writing unaligned load/store
> macros, so I checked how the kernel did it -- the most general way
> possible. See include/linux/unaligned.h. As such, very bad code is
> generated, for example on alpha with BWX, we can implement all these
> functions with a single instruction, whereas we get stuff like this
> generated from the generic functions.
>
> __get_unaligned_le32:
> .frame $30,0,$26,0
> .prologue 0
> ldbu $0,1($16)
> ldbu $1,2($16)
> ldbu $2,3($16)
> ldbu $3,0($16)
> sll $1,16,$1
> sll $0,8,$0
> bis $0,$1,$0
> sll $2,24,$2
> bis $0,$3,$0
> bis $0,$2,$0
> addl $31,$0,$0
> ret $31,($26),1
>
> 4 load byte instructions, shift, shift, or, shift, or, or, sign extend
> -- or ldl_u instruction. The code is more than doubly-bad for le64.
>
> Do we use the generic functions for a reason I don't see? It appears
> that it would be easy enough to add architecture-specific unaligned
> get/put functions in arch/*/include/asm/unaligned.h

There should be no need for architecture specific code for Alpha. GCC
can generate the optimal code sequence for reads from unaligned struct
members as in linux/unaligned/packed_struct.h, and this code should be
used. So you should try to find out why it isn't.

Falk
--
To unsubscribe from this list: send the line "unsubscribe linux-alpha" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html