Re: [BUG?] 2.6.25-rc[23]-mm1 cgroup list corruption under load with VM Scalability patches

From: KOSAKI Motohiro
Date: Thu Mar 20 2008 - 03:58:27 EST


Hi

CC'ed KAMEZAWA-san

> > > list_del corruption in cgroup_exit() on 16 cpu, 32GB ia64 NUMA platform.
> > >
> > > I've been seeing this for a while now, but we've had known problems
> > > [page leaks, ...] with the VM scalability series. Now the system
> > > appears to be running very well with these patches under stress loads
> > > that would hang it or cause OOM kill of tests with plenty of swap space
> > > left. Eventually, [after 40-45 minutes], I hit a list corruption in
> > > cgroup_exit().
> > >
> > > I can't say for sure that our patches aren't causing this, but I've been
> > > unable to keep the system up long enough under the stress load w/o the
> > > splitlru+noreclaim patches to hit the problem.

sorry for late responce, I don't notice this thread.
fujitsu guys investigated that problem too.

AFAIK this problem already fixed KAMEZAWA-san.
but I don't know that patch merged -mm or not.

kamezawa-san, please let us know its status?
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/