Re: [PATCH] exec: Avoid RLIMIT_STACK races with prlimit()
From: Serge E. Hallyn
Date:  Wed Nov 29 2017 - 13:20:17 EST
Quoting Kees Cook (keescook@xxxxxxxxxxxx):
> While the defense-in-depth RLIMIT_STACK limit on setuid processes was
> protected against races from other threads calling setrlimit(), I missed
> protecting it against races from external processes calling prlimit().
> This adds locking around the change and makes sure that rlim_max is set
> too.
> 
> Reported-by: Ben Hutchings <ben.hutchings@xxxxxxxxxxxxxxx>
> Reported-by: Brad Spengler <spender@xxxxxxxxxxxxxx>
> Fixes: 64701dee4178e ("exec: Use sane stack rlimit under secureexec")
> Cc: stable@xxxxxxxxxxxxxxx
> Cc: James Morris <james.l.morris@xxxxxxxxxx>
> Cc: Serge Hallyn <serge@xxxxxxxxxx>
Acked-by: Serge Hallyn <serge@xxxxxxxxxx>
The only thing i'm wondering is in do_prlimit():
. 1480         if (new_rlim) {
. 1481                 if (new_rlim->rlim_cur > new_rlim->rlim_max)
. 1482                         return -EINVAL;
that bit is done not under the lock.  Does that still allow a
race, if this check is done before the below block, and then the
rest proceeds after?
I *think* not, because later in do_prlimit() it will return -EPERM if
. 1500                 if (new_rlim->rlim_max > rlim->rlim_max &&
. 1501                                 !capable(CAP_SYS_RESOURCE))
Although rlim is gathered before the lock, but that is a struct *
so should be ok?
> Cc: Andy Lutomirski <luto@xxxxxxxxxx>
> Cc: Oleg Nesterov <oleg@xxxxxxxxxx>
> Cc: Jiri Slaby <jslaby@xxxxxxx>
> Signed-off-by: Kees Cook <keescook@xxxxxxxxxxxx>
> ---
>  fs/exec.c | 7 ++++++-
>  1 file changed, 6 insertions(+), 1 deletion(-)
> 
> diff --git a/fs/exec.c b/fs/exec.c
> index 1d6243d9f2b6..6be2aa0ab26f 100644
> --- a/fs/exec.c
> +++ b/fs/exec.c
> @@ -1340,10 +1340,15 @@ void setup_new_exec(struct linux_binprm * bprm)
>  		 * avoid bad behavior from the prior rlimits. This has to
>  		 * happen before arch_pick_mmap_layout(), which examines
>  		 * RLIMIT_STACK, but after the point of no return to avoid
> -		 * needing to clean up the change on failure.
> +		 * races from other threads changing the limits. This also
> +		 * must be protected from races with prlimit() calls.
>  		 */
> +		task_lock(current->group_leader);
>  		if (current->signal->rlim[RLIMIT_STACK].rlim_cur > _STK_LIM)
>  			current->signal->rlim[RLIMIT_STACK].rlim_cur = _STK_LIM;
> +		if (current->signal->rlim[RLIMIT_STACK].rlim_max > _STK_LIM)
> +			current->signal->rlim[RLIMIT_STACK].rlim_max = _STK_LIM;
> +		task_unlock(current->group_leader);
>  	}
>  
>  	arch_pick_mmap_layout(current->mm);
> -- 
> 2.7.4
> 
> 
> -- 
> Kees Cook
> Pixel Security