[GIT PULL] user namespace and namespace infrastructure changes for 3.8

From: Eric W. Biederman
Date: Tue Dec 11 2012 - 16:17:18 EST



Linus,

Please pull the for-linus git tree from:

git://git.kernel.org:/pub/scm/linux/kernel/git/ebiederm/user-namespace.git for-linus

HEAD: 98f842e675f96ffac96e6c50315790912b2812be proc: Usable inode numbers for the namespace file descriptors.

This tree is against v3.7-rc3

While small this set of changes is very significant with respect to
containers in general and user namespaces in particular. The user space
interface is now complete.

This set of changes adds support for unprivileged users to create user
namespaces and as a user namespace root to create other namespaces. The
tyrrany of supporting suid root preventing unprivileged users from using
cool new kernel features is broken.

This set of changes completes the work on setns, adding support for
the pid, user, mount namespaces.

This set of changes includes a bunch of basic pid namespace
cleanups/simplifications. Of particular significance is the rework of
the pid namespace cleanup so it no longer requires sending out tendrils
into all kinds of unexpected cleanup paths for operation. At least one
case of broken error handling is fixed by this cleanup.

The files under /proc/<pid>/ns/ have been converted from regular files
to magic symlinks which prevents incorrect caching by the VFS, ensuring
the files always refer to the namespace the process is currently using
and ensuring that the ptrace_mayaccess permission checks are always
applied.

The files under /proc/<pid>/ns/ have been given stable inode numbers so
it is now possible to see if different processes share the same
namespaces.

Through the David Miller's net tree are changes to relax many of the
permission checks in the networking stack to allowing the user namespace
root to usefully use the networking stack. Similar changes for the
mount namespace and the pid namespace are coming through my tree.

Two small nework namespace changes were double committed here and in
David Millers -net tree so that I could complete the work on the
/proc/<pid>/ns/ files in this tree.

The user namespace work that remains is converting, 9p, afs, ceph, cifs,
coda, gfs2, ncpfs, nfs, nfsd, ocfs2, and xfs so they are safe to enable
when user namespaces are enabled, and implementing unprivileged mounts
of more than just /proc and /sys.

I had hoped to get through more of those changes this cycle but
I turned into a cold magnet this season and the UAPI changes caused
a lot of churn late into the 3.7 -rc cycle that made a stable starting
place hard to work from hard to find.

Eric W. Biederman (37):
userns: Support autofs4 interacing with multiple user namespaces
userns: Support fuse interacting with multiple user namespaces
netns: Deduplicate and fix copy_net_ns when !CONFIG_NET_NS
userns: make each net (net_ns) belong to a user_ns
userns: On mips modify check_same_owner to use uid_eq
procfs: Use the proc generic infrastructure for proc/self.
procfs: Don't cache a pid in the root inode.
pidns: Capture the user namespace and filter ns_last_pid
pidns: Use task_active_pid_ns where appropriate
pidns: Make the pidns proc mount/umount logic obvious.
pidns: Don't allow new processes in a dead pid namespace.
pidns: Wait in zap_pid_ns_processes until pid_ns->nr_hashed == 1
pidns: Deny strange cases when creating pid namespaces.
pidns: Add setns support
pidns: Consolidate initialzation of special init task state
pidns: Support unsharing the pid namespace.
vfs: Allow chroot if you have CAP_SYS_CHROOT in your user namespace
vfs: Add setns support for the mount namespace
vfs: Add a user namespace reference from struct mnt_namespace
vfs: Only support slave subtrees across different user namespaces
vfs: Allow unprivileged manipulation of the mount namespace.
userns: Ignore suid and sgid on binaries if the uid or gid can not be mapped
userns: Allow unprivileged users to create user namespaces.
userns: Allow chown and setgid preservation
userns: Allow setting a userns mapping to your current uid.
userns: Allow unprivileged users to create new namespaces
userns: Allow unprivileged use of setns.
userns: Make create_new_namespaces take a user_ns parameter
userns: Kill task_user_ns
userns: Implent proc namespace operations
userns: Implement unshare of the user namespace
procfs: Print task uids and gids in the userns that opened the proc file
userns: For /proc/self/{uid,gid}_map derive the lower userns from the struct file
userns: Allow unprivilged mounts of proc and sysfs
proc: Generalize proc inode allocation
proc: Fix the namespace inode permission checks.
proc: Usable inode numbers for the namespace file descriptors.

Zhao Hongjiang (1):
userns: fix return value on mntns_install() failure

arch/mips/kernel/mips-mt-fpaff.c | 4 +-
arch/powerpc/platforms/cell/spufs/sched.c | 2 +-
arch/um/drivers/mconsole_kern.c | 2 +-
drivers/staging/android/binder.c | 3 +-
fs/attr.c | 11 +-
fs/autofs4/autofs_i.h | 8 +-
fs/autofs4/dev-ioctl.c | 4 +-
fs/autofs4/inode.c | 24 ++--
fs/autofs4/waitq.c | 5 +-
fs/exec.c | 9 +-
fs/fuse/dev.c | 4 +-
fs/fuse/dir.c | 20 ++--
fs/fuse/fuse_i.h | 4 +-
fs/fuse/inode.c | 23 ++--
fs/hppfs/hppfs.c | 2 +-
fs/mount.h | 3 +
fs/namespace.c | 211 ++++++++++++++++++++++++-----
fs/open.c | 2 +-
fs/pnode.h | 1 +
fs/proc/Makefile | 1 +
fs/proc/array.c | 2 +-
fs/proc/base.c | 169 +-----------------------
fs/proc/generic.c | 26 ++--
fs/proc/inode.c | 6 +-
fs/proc/internal.h | 1 +
fs/proc/namespaces.c | 185 ++++++++++++++++++++++---
fs/proc/root.c | 17 +--
fs/proc/self.c | 59 ++++++++
fs/sysfs/mount.c | 1 +
include/linux/cred.h | 2 -
include/linux/fs.h | 2 +
include/linux/ipc_namespace.h | 9 +-
include/linux/mnt_namespace.h | 3 +-
include/linux/nsproxy.h | 2 +-
include/linux/pid_namespace.h | 11 +-
include/linux/proc_fs.h | 26 ++++-
include/linux/user_namespace.h | 10 ++
include/linux/utsname.h | 7 +-
include/net/net_namespace.h | 26 +++-
init/Kconfig | 2 -
init/main.c | 1 -
init/version.c | 2 +
ipc/msgutil.c | 2 +
ipc/namespace.c | 32 ++++-
kernel/cgroup.c | 2 +-
kernel/events/core.c | 2 +-
kernel/exit.c | 12 --
kernel/fork.c | 69 +++++++---
kernel/nsproxy.c | 36 +++---
kernel/pid.c | 47 ++++++-
kernel/pid_namespace.c | 112 ++++++++++++---
kernel/ptrace.c | 10 +-
kernel/sched/core.c | 10 +-
kernel/signal.c | 2 +-
kernel/sysctl_binary.c | 2 +-
kernel/user.c | 2 +
kernel/user_namespace.c | 147 +++++++++++++++++---
kernel/utsname.c | 33 ++++-
net/core/net_namespace.c | 54 ++++++--
security/yama/yama_lsm.c | 12 ++-
60 files changed, 1026 insertions(+), 472 deletions(-)
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/