Re: [PATCHSET RFC] ptrace,signal: clean transition between STOPPEDand TRACED

From: Tejun Heo
Date: Thu Jan 13 2011 - 10:05:24 EST


Hello, Jan. (Please ignore the other reply. I pressed the wrong keys
and it went out prematurely.)

On Wed, Jan 12, 2011 at 10:43:50PM +0100, Jan Kratochvil wrote:
> FSF GDB for `T (stopped)' processes currently does:
> PTRACE_ATTACH
> check /proc/%d/status for `T (stopped)' (by GDB's pid_is_stopped)
> if found then kill (PID, SIGSTOP) && ptrace (PTRACE_CONT, PID, 0, 0).
> waitpid (pid, &status, 0) - so that this one does not get stuck if the stop
> event was already eaten out before.
>
> If the `T (stopped)' will now always FAIL then at leat the waitpid then should
> never get stuck.

Now the child will always stop inside ptrace_stop() which
unconditionally sets ->exit_code (the state wait(2) consumes) so it
won't get stuck unless someone else consumes it inbetween, of course.

> > 2. The transition between STOPPED and TRACED involves a short window
> > of RUNNING inbetween. On attach, the transition is hidden from the
> > tracer using GROUP_STOP_TRAPPING but it still is visible to other
> > threads in the tracer's group. IOW, if another thread performs
> > WNOHANG wait(2) on the tracee while attach is in progress, the
> > wait(2) may fail even if the tracee is known to be in stopped state
> > before.
> >
> > The same problem exists the other direction during detach.
> > Currently, the code doesn't try to hide this transition even from
> > the tracer. IOW, if the tracer attaches to a stopped task,
> > detaches, reattaches and then performs WNOHANG wait(2), the wait(2)
> > may fail. However, given the previous behavior where the tracee is
> > always woken up by wake_up_process() on detach, this is highly
> > unlikely to cause any problem.
>
> FSF gdbserver --multi does PTRACE_ATTACH followed by waitpid (WNOHANG) and it
> fails if it returns ECHILD on the first try.
>
> ptrace(PTRACE_ATTACH, 22049, 0, 0) = 0
> wait4(-1, [{WIFSTOPPED(s) && WSTOPSIG(s) == SIGSTOP}], WNOHANG, NULL) = 22049
>
> It may be also a gdbserver bug, though.

How is that supposed to work even with the current code? Does it
check whether the target thread is stopped before doing that? Reading
gdbserver manpage.... Hmm... doesn't help much. Can you please help
me understanding what it expects there?

Thank you.

--
tejun
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/