2.4.23 && md raid1 && reiserfs panic

From: James Bromberger
Date: Sat Feb 07 2004 - 06:25:28 EST




Hello all,

Can someone point me towards the right mailing list to help get this
issue resolved; I think I've hit something in either the MD, block or
reiserfs sections, but I could be completely wrong.

The symptoms: rm a file from a working RAID1 md reiserfs filesystem,
and I get a panic, rm(1) segfaults, and all further I/O to any interactive
shells stop. The entire system is rednered incapable; reboot (via
ctrl-alt-del) doesnt shutdown and the only action is to hard reset the box.

Attached is a copy of the oops as recorded by syslog, the script I run to
start the error. The exact line that Segfaults is the rm() on a 2.2 GB
tar file.

For reference, the system is documented at
http://www.james.rcpt.to/programs/debian/raid1/

This is only using the Debian packages of the kernel.
kernel-image-2.4.23-686_2.4.23-1.

If there is any further info I can send, please let me know.

Please CC me as I am not subscribed to LKML. Ta.

Regards,
James
--
James Bromberger <james_AT_rcpt.to> www.james.rcpt.to
Remainder moved to http://www.james.rcpt.to/james/sig.html

I am in London on UK mobile +44 7952 042920.
The kernel: 2.4.23-1-686-

The script:

DAY=`date +%d`
MONTH=`date +%m`
YEAR=`date +%Y`
FROM="/usr/local/fileshare /usr/local/psql"
TO="/usr/local/share/${YEAR}-${MONTH}-${DAY}-fileshare-backup.tbz"
TAR="/bin/tar"
CMD="$TAR -Pcjlf $TO $FROM"
/bin/echo -n "Removing old backup..."
/bin/rm /usr/local/share/????-??-??-fileshare-backup.tbz

The script running:

++ date +%d
+ DAY=07
++ date +%m
+ MONTH=02
++ date +%Y
+ YEAR=2004
+ FROM=/usr/local/fileshare /usr/local/psql
+ TO=/usr/local/share/2004-02-07-fileshare-backup.tbz
+ TAR=/bin/tar
+ CMD=/bin/tar -Pcjlf /usr/local/share/2004-02-07-fileshare-backup.tbz /usr/local/fileshare /usr/local/psql
+ /bin/echo -n 'Removing old backup...'
Removing old backup...+ /bin/rm /usr/local/share/2004-01-30-fileshare-backup.tbz./makecompressedbackup.sh: line 23: 9729 Segmentation fault /bin/rm /usr/local/share/????-??-??-fileshare-backup.tbz
++ date
+ NOW=Sat Feb 7 17:21:19 WST 2004
+ echo 'Sat Feb 7 17:21:19 WST 2004 Doing /bin/tar -Pcjlf /usr/local/share/2004-02-07-fileshare-backup.tbz /usr/local/fileshare /usr/local/psql...'
Sat Feb 7 17:21:19 WST 2004 Doing /bin/tar -Pcjlf /usr/local/share/2004-02-07-fileshare-backup.tbz /usr/local/fileshare /usr/local/psql...
+ /bin/tar -Pcjf /usr/local/share/2004-02-07-fileshare-backup.tbz /usr/local/fileshare /usr/local/psql



From /var/log/syslog:


Feb 7 17:21:19 phobe kernel: md(9,5):vs-4075: reiserfs_free_block: block 1076949926 is out of range on md(9,5)
Feb 7 17:21:19 phobe kernel: md(9,5):vs-4075: reiserfs_free_block: block 2212879009 is out of range on md(9,5)
Feb 7 17:21:19 phobe kernel: Unable to handle kernel NULL pointer dereference at virtual address 0000028c
Feb 7 17:21:19 phobe kernel: printing eip:
Feb 7 17:21:19 phobe kernel: e085124d
Feb 7 17:21:19 phobe kernel: *pde = 00000000
Feb 7 17:21:19 phobe kernel: Oops: 0002
Feb 7 17:21:19 phobe kernel: CPU: 0
Feb 7 17:21:19 phobe kernel: EIP: 0010:[ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1772979/96] Not tainted
Feb 7 17:21:19 phobe kernel: EFLAGS: 00010282
Feb 7 17:21:19 phobe kernel: eax: 00000000 ebx: 0001d0a2 ecx: df47dc00 edx: e0ca624c
Feb 7 17:21:19 phobe kernel: esi: 0000146b edi: e0c170d4 ebp: 000003f4 esp: d56b1b88
Feb 7 17:21:19 phobe kernel: ds: 0018 es: 0018 ss: 0018
Feb 7 17:21:19 phobe kernel: Process rm (pid: 9729, stackpage=d56b1000)
Feb 7 17:21:19 phobe kernel: Stack: d56b1f1c d56b1f1c e851146b 00000000 e0855af0 df47dc00 e851146b e0c170d4
Feb 7 17:21:19 phobe kernel: 00005aa1 e08327ca d56b1f1c e851146b c763be68 000003f4 e08328c6 d56b1f1c
Feb 7 17:21:19 phobe kernel: df47dc00 e851146b e851146b 00000065 e084d735 d56b1f1c e851146b cbf92b00
Feb 7 17:21:19 phobe kernel: Call Trace: [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1754384/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1898550/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1898298/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1788107/96] [__make_request+892/1936]
Feb 7 17:21:19 phobe kernel: [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1541824/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1541784/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1786838/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1784312/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1782530/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1785591/96]
Feb 7 17:21:19 phobe kernel: [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1868477/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1717594/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1868624/96] [ide-floppy:__insmod_ide-floppy_O/lib/modules/2.4.23-1-686/kernel/drive+-1716640/96] [iput+304/640] [cached_lookup+27/112]
Feb 7 17:21:19 phobe kernel: [vfs_unlink+242/448] [sys_unlink+186/288] [system_call+51/56]
Feb 7 17:21:19 phobe kernel:
Feb 7 17:21:19 phobe kernel: Code: 0f ab 30 8b 5c 24 04 31 c0 8b 74 24 08 8b 7c 24 0c 83 c4 10
...
Feb 7 17:26:23 phobe sm-mta[1274]: rejecting connections on daemon MTA: load average: 12
Feb 7 17:26:23 phobe sm-mta[1274]: rejecting connections on daemon MSA: load average: 12

Attachment: signature.asc
Description: Digital signature