Re: [PATCH 1/5] oom: select_bad_process: check PF_KTHREAD insteadof !mm to skip kthreads

From: David Rientjes
Date: Wed Jun 02 2010 - 17:46:45 EST


On Wed, 2 Jun 2010, Oleg Nesterov wrote:

> > This isn't a bugfix, it simply prevents a recall to the oom killer after
> > the kthread has called unuse_mm(). Please show where any side effects of
> > oom killing a kthread, which cannot exit, as a result of use_mm() causes a
> > problem _anywhere_.
>
> I already showed you the side effects, but you removed this part in your
> reply.
>
> From http://marc.info/?l=linux-kernel&m=127542732121077
>
> It can't die but force_sig() does bad things which shouldn't be done
> with workqueue thread. Note that it removes SIG_IGN, sets
> SIGNAL_GROUP_EXIT, makes signal_pending/fatal_signal_pedning true, etc.
>
> A workqueue thread must not run with SIGNAL_GROUP_EXIT set, SIGKILL
> must be ignored, signal_pending() must not be true.
>
> This is bug. It is minor, agreed, currently use_mm() is only used by aio.
>

It's a problem that would probably never happen in practice because you're
talking about a race between select_bad_process() and __oom_kill_task()
which is wide since it iterates the entire tasklist, which workqueue
threads will be near the beginning of, and there is an extremely small
chance that the badness score for the mm that it assumed would be
considered the ideal task to kill. If you think this is rc material, then
push it to Andrew and say that.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/