Re: 2.6.14-mm1 RAID-1 in D< state

From: Chris Boot
Date: Wed Nov 09 2005 - 18:15:59 EST


On 9 Nov 2005, at 22:23, Neil Brown wrote:

On Wednesday November 9, bootc@xxxxxxxxx wrote:
Hi all,

I haven't noticed this until today...but my load average has been
skyrocketing past 3.00 since Monday, which is when I upgraded to
2.6.14-mm1. I've got 3 Software RAID-1 arrays across 4 SATA disks, and
all 3 processes are locked in an uninterruptible sleep.

What's interesting, though, is I haven't noticed a degradation of
performance at all, and all the arrays work absolutely fine. They aren't
rebuilding or doing anything strange that I can see.

Any ideas?

Can you
echo t > /proc/sysrq-trigger
dmesg > /tmp/log
and post the log created, possibly removing everything before
SysRq : Show State

So that's what the sysrq-trigger is for... :-) Certainly easier that way when your system still works!

If you can't find the 'Show State', then maybe your log buffer isn't
big enough. use 'dmesg -s ...' to make it bigger and try again

It was too small, but the serial console got it:

[4329954.200000] md2_raid1 D F7D776E0 0 809 6 810 799 (L-TLB)
[4329954.200000] f7db7f30 f7d2ba8c c02809e0 f7d776e0 c02c14f2 e9924580 c1b48b60 c1b8e200
[4329954.200000] f7c5bd40 7fffffff f7db7f88 00000000 23c37e00 000f6206 f7d6fa50 f7d6fb78
[4329954.200000] 7fffffff 7fffffff f7db7f88 f7db6000 c0338098 c1b8e200 f7db7f94 f7db7f88
[4329954.200000] Call Trace:
[4329954.200000] [<c02809e0>] generic_unplug_device+0x10/0x20
[4329954.200000] [<c02c14f2>] unplug_slaves+0xd2/0xe0
[4329954.200000] [<c0338098>] schedule_timeout+0x98/0xa0
[4329954.200000] [<c01295a9>] finish_wait+0x39/0x50
[4329954.200000] [<c02c9309>] md_thread+0xc9/0x100
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c01142d7>] __wake_up_common+0x37/0x60
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c02c9240>] md_thread+0x0/0x100
[4329954.200000] [<c0129174>] kthread+0xa4/0xe0
[4329954.200000] [<c01290d0>] kthread+0x0/0xe0
[4329954.200000] [<c0100f35>] kernel_thread_helper+0x5/0x10
[4329954.200000] md0_raid1 D F7D774A0 0 810 6 812 809 (L-TLB)
[4329954.200000] f7db5f30 f7d2b79c c02809e0 f7d774a0 c02c14f2 c0383bc0 c1b48ae0 c1b8e400
[4329954.200000] f7c5bb60 7fffffff f7db5f88 00000000 9bd42ec0 000f6211 f7d69090 f7d691b8
[4329954.200000] 7fffffff 7fffffff f7db5f88 f7db4000 c0338098 c1b8e400 00000002 f7db4000
[4329954.200000] Call Trace:
[4329954.200000] [<c02809e0>] generic_unplug_device+0x10/0x20
[4329954.200000] [<c02c14f2>] unplug_slaves+0xd2/0xe0
[4329954.200000] [<c0338098>] schedule_timeout+0x98/0xa0
[4329954.200000] [<c0129501>] prepare_to_wait+0x41/0x50
[4329954.200000] [<c02c9309>] md_thread+0xc9/0x100
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c01142d7>] __wake_up_common+0x37/0x60
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c02c9240>] md_thread+0x0/0x100
[4329954.200000] [<c0129174>] kthread+0xa4/0xe0
[4329954.200000] [<c01290d0>] kthread+0x0/0xe0
[4329954.200000] [<c0100f35>] kernel_thread_helper+0x5/0x10
[4329954.200000] md1_raid1 D F7D77860 0 812 6 813 810 (L-TLB)
[4329954.200000] f7dbbf30 f7d2bc04 c02809e0 f7d77860 c02c14f2 e9924580 c1b48a60 c1b8e000
[4329954.200000] f7c5f920 7fffffff f7dbbf88 00000000 2358ae40 000f6206 f7d5b5c0 f7d5b6e8
[4329954.200000] 7fffffff 7fffffff f7dbbf88 f7dba000 c0338098 c1b8e000 f7dbbf88 f7dba000
[4329954.200000] Call Trace:
[4329954.200000] [<c02809e0>] generic_unplug_device+0x10/0x20
[4329954.200000] [<c02c14f2>] unplug_slaves+0xd2/0xe0
[4329954.200000] [<c0338098>] schedule_timeout+0x98/0xa0
[4329954.200000] [<c02c29ba>] raid1d+0x32a/0x350
[4329954.200000] [<c02c9309>] md_thread+0xc9/0x100
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c01142d7>] __wake_up_common+0x37/0x60
[4329954.200000] [<c01295c0>] autoremove_wake_function+0x0/0x50
[4329954.200000] [<c02c9240>] md_thread+0x0/0x100
[4329954.200000] [<c0129174>] kthread+0xa4/0xe0
[4329954.200000] [<c01290d0>] kthread+0x0/0xe0
[4329954.200000] [<c0100f35>] kernel_thread_helper+0x5/0x10

Let me know if you need dumps of any other processes.

NeilBrown

Cheers,
Chris

--
Chris Boot
bootc@xxxxxxxxx
http://www.bootc.net/


Attachment: smime.p7s
Description: S/MIME cryptographic signature