Re: Fw: BUG_ONs in signal.c?

From: Jesse Barnes
Date: Mon Oct 25 2004 - 11:32:55 EST


On Friday, October 22, 2004 9:29 pm, Roland McGrath wrote:
> Oh, duh. The race is obvious. Sorry for the confusion there.
> I think this is the way to fix it.
>
> --- linux-2.6/kernel/signal.c 19 Oct 2004 15:03:02 -0000 1.143
> +++ linux-2.6/kernel/signal.c 23 Oct 2004 04:23:31 -0000
> @@ -1909,22 +1910,16 @@ relock:
> * Anything else is fatal, maybe with a core dump.
> */
> current->flags |= PF_SIGNALED;
> - if (sig_kernel_coredump(signr) &&
> - do_coredump((long)signr, signr, regs)) {
> + if (sig_kernel_coredump(signr)) {
> /*
> - * That killed all other threads in the group and
> - * synchronized with their demise, so there can't
> - * be any more left to kill now. The group_exit
> - * flags are set by do_coredump. Note that
> - * thread_group_empty won't always be true yet,
> - * because those threads were blocked in __exit_mm
> - * and we just let them go to finish dying.
> - */
> - const int code = signr | 0x80;
> - BUG_ON(!current->signal->group_exit);
> - BUG_ON(current->signal->group_exit_code != code);
> - do_exit(code);
> - /* NOTREACHED */
> + * If it was able to dump core, this kills all
> + * other threads in the group and synchronizes with
> + * their demise. If we lost the race with another
> + * thread getting here, it set group_exit_code
> + * first and our do_group_exit call below will use
> + * that value and ignore the one we pass it.
> + */
> + do_coredump((long)signr, signr, regs);
> }
>
> /*

Yeah, this looks good, although we'll end up calling do_group_exit instead of
do_exit for the dumped task, is that ok?

> While looking at this, I noticed a bug (not directly related) in
> do_coredump. It was setting the "core dumped" flag even when the format
> dumping hook failed (e.g. for memory allocation failures).
>
>
> --- linux-2.6/fs/exec.c 19 Oct 2004 15:05:13 -0000 1.146
> +++ linux-2.6/fs/exec.c 23 Oct 2004 04:23:42 -0000
> @@ -1417,7 +1417,8 @@ int do_coredump(long signr, int exit_cod
>
> retval = binfmt->core_dump(signr, regs, file);
>
> - current->signal->group_exit_code |= 0x80;
> + if (retval)
> + current->signal->group_exit_code |= 0x80;
> close_fail:
> filp_close(file, NULL);
> fail_unlock:

Yeah, saw this too, sorry I forgot to mention it.

Thanks,
Jesse
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/