RE: Unknown Issue.

From: Piszcz, Justin Michael
Date: Mon Dec 13 2004 - 09:01:11 EST


Patrick,

I had the same problem on two machines with XFS. Both slackware-current
machines. The kernel on the Dell GX1 was built with GCC-3.4.2 and on my
main box was GCC-3.4.3.

There seems to be a bug in XFS with some configurations of 2.6.9 and
2.6.10-rc series.

After re-installing Slackware-10.0 and upgrading to -current, I have
installed 2.6.10-rc3 and so far, I have not been able to reproduce the
problem.

Some questions for you:

1] What kernel are you running?
2] What did you last change before you started getting these errors?

As far as severity goes, I ran XFS' fsck from a KNOPPIX CD and as a
result, I had about 500-600mb of files in my /lost+found directory when
it was finished. Files were missing from all parts of the file system.
I had to restore from backup. I would say stick with your previous
2.6.9 configuration (if you were running it) or go back to 2.6.8.1, some
2.6.9 configurations and 2.6.10-rc1 and/or 2.6.10-rc2 definitely cause
file corruption with XFS. So far, however, I have not been able to
reproduce the error with 2.6.10-rc3.

Justin.

-----Original Message-----
From: linux-kernel-owner@xxxxxxxxxxxxxxx
[mailto:linux-kernel-owner@xxxxxxxxxxxxxxx] On Behalf Of Patrick
Sent: Sunday, December 12, 2004 4:15 PM
To: linux-kernel@xxxxxxxxxxxxxxx
Subject: Unknown Issue.

Hi,

I've got a computer running gentoo, on a clean install where i've got
an odd problem :

after a while, the computer refuses to spawn processes anymore :

-/bin/bash: /bin/ps: Input/output error
-/bin/bash: /usr/bin/w: Input/output error
-/bin/bash: /bin/df: Input/output error
-/bin/bash: /bin/mount: Input/output error

It happen's randomly, i've tried everything from changing the computer
from running software raid ( scsi ) to running a hardware solution and
reinstalling, I've run the memory through memtest as well as i've
remounted the drives and i've tested the ram to make sure it was
properly mounted.

The only thing running on this box is mysql, which runs perfectly at
7500 q/s ( running super smack ) now, i'm not sure if this is a linux
kernel thing, or a gentoo thing, or a hardware thing.

I've checked and i'm not running out of file descriptors ( by looking
in /proc/sys/fs/file-nr ) and i've increased the ammount in (
/proc/sys/fs/file-max ( if i member correctly ) ) by adding a 0 after
the end of the value thus increasing it alot.

It's running XFS on the root partition with a single partition, dual
xeon 2.66 with hyperthreading enabled, dual intel gbe and a adaptec
2120S AACraid card. Dual 36gb 10krpm scsi drives in raid1.

Does anyone have any ideas on what i can do, what i can test, if it's
hardware ? software ?

guys ?

P

--
</N>

------
In the beginning, there was nothing. And God said, 'Let there be
Light.' And there was still nothing, but you could see a bit better.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel"
in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/