Re: [PATCH 0/3] extend get/setrlimit to support setting rlimitsexternal to a process (v7)

From: Ingo Molnar
Date: Fri Nov 06 2009 - 04:26:24 EST



* Neil Horman <nhorman@xxxxxxxxxxxxx> wrote:

> On Wed, Nov 04, 2009 at 12:26:32PM +0100, Ingo Molnar wrote:
> >
> > * Neil Horman <nhorman@xxxxxxxxxxxxx> wrote:
> >
> > > On Mon, Nov 02, 2009 at 07:51:37PM +0100, Ingo Molnar wrote:
> > > >
> > > > * Neil Horman <nhorman@xxxxxxxxxxxxx> wrote:
> > > >
> > > > > > Have you ensured that no rlimit gets propagated during task init
> > > > > > into some other value - under the previously correct assumption that
> > > > > > rlimits dont change asynchronously under the feet of tasks?
> > > > >
> > > > > I've looked, and the only place that I see the rlim array getting
> > > > > copied is via copy_signal when we're in the clone path. The
> > > > > entire rlim array is copied from old task_struct to new
> > > > > task_struct under the protection of the current->group_leader task
> > > > > lock, which I also hold when updating via sys_setprlimit, so I
> > > > > think we're safe in this case.
> > > >
> > > > I mean - do we set up any data structure based on a particular
> > > > rlimit, that can get out of sync with the rlimit being updated?
> > > >
> > > > A prominent example would be the stack limit - we base address
> > > > layout decisions on it. Check arch/x86/mm/mmap.c. RLIM_INFINITY has
> > > > a special meaning plus we also set mmap_base() based on the rlim.
> > >
> > > Ah, I didn't consider those. Yes it looks like some locking might be
> > > needed for cases like that. what would you suggest, simply grabbing
> > > the task lock before looking at the rlim array? That seems a bit
> > > heavy handed, especially if we want to use the locking consistently.
> > > What if we just converted the int array of rlimit to atomic_t's?
> > > Would that be sufficient, or still to heavy?
>
> Just to provide a quick update on this, it appears that (unbeknowst to me),
> Jiri Slaby got almost this exact same feature in via the linux-next tree:
> commits
> ba9ba971a9241250646091935d77d2f31b7c15af
> 4a4a4e5f51d866284db401ea4d8ba5f0c91cc1eb
> c1b9b7eaf7386a7f142d59a2bb433ac8217b0ad1
>
> It still likely needs an audit to make sure theres no race with task
> access on the rlimit array, but it doesn't currently require
> additional security checks because the only access for a process to
> another processes limits is by writing to the /proc/<pid>/limits file,
> as I had initial proposed. I think theres still value in the
> sysscall, so I'll keep going with that aspect, but the rest of the
> work appears done.

(Cc:-ed Jiri)

Jiri, i think your patches are incomplete for the same reasons i
outlined to Neil.

Also, the locking there looks messy:

+ /* optimization: 'current' doesn't need locking, e.g. setrlimit */
+ if (tsk != current) {
+ /* protect tsk->signal and tsk->sighand from disappearing */
+ read_lock(&tasklist_lock);
+ if (!tsk->sighand) {
+ retval = -ESRCH;
+ goto out;
+ }
}

Neil's splitup into a helper function looks _far_ cleaner.

I'm also wondering, how did these commits get into linux-next? It
appears that that the 'writable_limits' tree got added by sfr to
linux-next on Oct 26 just based on Jiri's request, without acks/review
from the people generally involved with this code.

Stephen, this is the Nth incident of linux-next merging random new
feature trees on its own, without apparently having pinged/Cc:-ed the
maintainers/developers involved and without you having thought through
the stuff you merge. (Perfmon was perhaps the worst incident, about a
year ago - but there's been other cases as well since then.)

As things stand now you are treating linux-next as your own tree in
essence, merging/unmerging trees to your own desire, allowing
unreviewed/unacked commits into linux-next - which is fine but then
please lets not call it the 'next Linux' but sfr-next or so ...

Btw., this is not against Jiri's tree - i think out of Jiri's and Neil's
patches a nice rlimits feature could be done for 2.6.33 - but IMHO this
chaotic (non-)quality merge process of linux-next cannot go on like this
...

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/