Re: [RFC PATCH 01/14] addDocumentation/namespaces/user_namespace.txt

From: Serge E. Hallyn
Date: Wed Jul 13 2011 - 22:37:32 EST


Quoting David Howells (dhowells@xxxxxxxxxx):
> Serge Hallyn <serge@xxxxxxxxxx> wrote:

Thanks for the detailed comments, David.

> > +... The user namespace allows tasks to
> > +have different views of the uids and gids associated with tasks and
> > +other resources.
>
> How does this relate to UIDs/GIDs stored on disk?

The current plan (see 'flexible uid mapping' at
https://wiki.ubuntu.com/UserNamespace) is:

The uid/gid stored on disk will be that in the init_user_ns. Most
likely uid/gid in other namespaces will be stored in xattrs. But
Eric was advocating (a few years ago) leaving the details up to
filesystems while providing a lib/ stock implementation. See the
thread around here
http://www.mail-archive.com/devel@xxxxxxxxxx/msg09331.html

...

> > ... Opening such files will result in obtaining the 'user other'
> > permissions.
>
> How about "Permission to open such files will be granted according to the
> 'user other' permissions."?
>
> Do you mean 'user other' or just 'other'?

'user other'

> > ... UID comparisons will return false, and privilege will be denied.
>
> UID and GID both?

Right, GID also part of the user namespace.

> You should probably be consistent about using all 'UID/GID' or all 'uid/gid'.
> I prefer the former as it's an acronym, but that's up to you.

ok.

> > When a task belonging to userid 500 in the initial user namespace
>
> Is 500 special? Or is this just a worked example?

example.

...

> > Working notes
> > =============
> > capable checks for actions related to syslog must be against the
> > init_user_ns until syslog is containerized.
>
> Do you mean the 'capable' function? If so, I recommend you suffix it with
> '()'. Or did you mean 'Capability checks'?

I meant capability checks.

> > Same is true for reboot and power, control groups, devices, and time.
> >
> > Perf actions (kernel/event/core.c for instance) will always be
> > constrained to init_user_ns.
> >
> > Q:
> > Is accounting considered properly containerized wrt pidns? (it
> > appears to be). If so, then we can change the capable check in
>
> 'capability check' or 'capable() call'? Anyone reading this ought to know what
> capable() does.

Here I meant capable(). I definately see I need to be clearer.

> > kernel/acct.c to 'ns_capable(current_pid_ns()->user_ns, CAP_PACCT)'
> >
...

> > For drivers/tty/tty_io.c and drivers/tty/vt/vt.c, we'll want to (for
> > some of them) target at the user_ns owning the tty. That will have
> > to wait until we get userns owning files straightened out.
>
> Target what at the user_ns?

Target the capability check at the user_ns.

> > We need to figure out how to label devices. Should we just toss a user_ns
> > right into struct device?
>
> Would that isolate a device and make it exclusively accessible by that user_ns?

I think so, which is probably too restrictive until a devices namespace
can help us work around it when needed.

Thanks again,

-serge
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/