Re: OOM problems on 2.6.12-rc1 with many fsx tests

From: Andrew Morton
Date: Mon Apr 04 2005 - 15:18:02 EST


Mingming Cao <cmm@xxxxxxxxxx> wrote:
>
> On Sun, 2005-04-03 at 18:35 -0700, Andrew Morton wrote:
> > Mingming Cao <cmm@xxxxxxxxxx> wrote:
> > >
> > > I run into OOM problem again on 2.6.12-rc1. I run some(20) fsx tests on
> > > 2.6.12-rc1 kernel(and 2.6.11-mm4) on ext3 filesystem, after about 10
> > > hours the system hit OOM, and OOM keep killing processes one by one. I
> > > could reproduce this problem very constantly on a 2 way PIII 700MHZ with
> > > 512MB RAM. Also the problem could be reproduced on running the same test
> > > on reiser fs.
> > >
> > > The fsx command is:
> > >
> > > ./fsx -c 10 -n -r 4096 -w 4096 /mnt/test/foo1 &
> >
> >
> > This ext3 bug goes all the way back to 2.6.6.
>
> > I don't know yet why you saw problems with reiser3 and I'm pretty sure I
> > saw problems with ext2. More testing is needed there.
> >
>
> We (Janet and I) are chasing this bug as well. Janet is able to
> reproduce this bug on 2.6.9 but I can't. Glad to know you have nail down
> this issue on ext3. I am pretty sure I saw this on Reiser3 once, I will
> double check it. Will try your patch today.

There's a second leak, with similar-looking symptoms. At ~50
commits/second it has leaked ~10MB in 24 hours, so it's very slow - less
than a hundredth the rate of the first one.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/