Re: OOM problems on 2.6.12-rc1 with many fsx tests

From: Martin J. Bligh
Date: Mon Apr 04 2005 - 15:44:03 EST


>> > > I run into OOM problem again on 2.6.12-rc1. I run some(20) fsx tests on
>> > > 2.6.12-rc1 kernel(and 2.6.11-mm4) on ext3 filesystem, after about 10
>> > > hours the system hit OOM, and OOM keep killing processes one by one. I
>> > > could reproduce this problem very constantly on a 2 way PIII 700MHZ with
>> > > 512MB RAM. Also the problem could be reproduced on running the same test
>> > > on reiser fs.
>> > >
>> > > The fsx command is:
>> > >
>> > > ./fsx -c 10 -n -r 4096 -w 4096 /mnt/test/foo1 &
>> >
>> >
>> > This ext3 bug goes all the way back to 2.6.6.
>>
>> > I don't know yet why you saw problems with reiser3 and I'm pretty sure I
>> > saw problems with ext2. More testing is needed there.
>> >
>>
>> We (Janet and I) are chasing this bug as well. Janet is able to
>> reproduce this bug on 2.6.9 but I can't. Glad to know you have nail down
>> this issue on ext3. I am pretty sure I saw this on Reiser3 once, I will
>> double check it. Will try your patch today.
>
> There's a second leak, with similar-looking symptoms. At ~50
> commits/second it has leaked ~10MB in 24 hours, so it's very slow - less
> than a hundredth the rate of the first one.

What are you using to see these with, just kgdb, and a large cranial
capacity? Or is there some more magic?

m.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/