Re: [RFC,PATCH 3/3] do_wait: fix waiting for stopped group with dead leader
From: Eric W. Biederman
Date: Mon Dec 10 2007 - 19:44:12 EST
Oleg Nesterov <oleg@xxxxxxxxxx> writes:
> do_wait(WSTOPPED) assumes that p->state must be == TASK_STOPPED, this is not
> true if the leader is already dead. Check SIGNAL_STOP_STOPPED instead and use
> ->signal->group_exit_code.
>
> This patch is not complete if not buggy. At the very minimum it needs cleanup.
Thinking about this set of problems. Testing SIGNAL_STOP_STOPPED
seems more correct then testing TASK_STOPPED. It ensures we don't
have a race, and except for ptrace the only way to stop a task
triggers SIGNAL_STOP_STOPPED.
We need a similar flag for thread group exit, to mark when every task
in the thread group has exited.
With those in place we can have race free tests of our status.
/proc/<tgid>/status needs to be updated to use those the per
signal struct status bits as well.
As for the exit_code, we set tsk->exit_code = sig->group_exit_code
so that doesn't seem to be a problem either.
So to get a task group status looking at bits on the signal struct
looks like the right approach, as this ensures we can avoid races in
setting the status, and we don't need to test a dozen other fields.
There is still some value in my other approach but even it will
have small races if we continue look at per task status bits when
what we want is a per thread group status.
Eric
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/