mirror_lxc

mirror of https://git.proxmox.com/git/mirror_lxc synced 2025-08-04 14:55:36 +00:00

Author	SHA1	Message	Date
Wolfgang Bumiller	e6ec0a9e71	apparmor: allow various remount,bind options RW bind mounts need to be restricted for some paths in order to avoid MAC restriction bypasses, but read-only bind mounts shouldn't have that problem. Additionally, combinations of 'nosuid', 'nodev' and 'noexec' flags shouldn't be a problem either and are required with newer systemd versions, so let's allow those as long as they're combined with 'ro,remount,bind'. Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2018-11-16 12:17:30 +01:00
Cameron Nemo	b19c5d1237	apparmor: account for specified rootfs path (closes #2617 ) Signed-off-by: Cameron Nemo <camerontnorman@gmail.com>	2018-09-20 15:56:05 -07:00
Wolfgang Bumiller	242a9fa7ee	apparmor: allow start-container to change to lxc- For generated profiles with apparmor namespaces we get profile names with slashes in them. To match those, we need to allow changing to lxc-, not just lxc-*. Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2018-07-25 14:40:26 +02:00
Wolfgang Bumiller	6e6aca3e3e	apparmor: update current profiles remove cgmanager rules and add fstype=cgroup2 variants for the existing fstype=cgroup rules Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2018-07-25 12:13:17 +02:00
Markos Chandras	733e375760	apparmor: Allow /usr/lib* paths for mount and pivot_root openSUSE Leap 15 is using --libdir=/usr/lib64 when building for x86_64 so we need to allow this path in the apparmor profiles. Link: https://bugzilla.opensuse.org/show_bug.cgi?id=1099239 Signed-off-by: Markos Chandras <mchandras@suse.de>	2018-07-19 12:17:24 +01:00
Christian Brauner	a9145d622f	tree-wide: cleanup - remove legacy binaries - conditionalize creation of docs and tests for the command line tools and the shared library helper commands Signed-off-by: Christian Brauner <christian.brauner@ubuntu.com>	2018-02-28 10:05:33 +01:00
Tycho Andersen	408dd86c76	drop useless apparmor denies mem and kmem are really in /dev, so this does us no good. Signed-off-by: Tycho Andersen <tycho@tycho.ws>	2017-10-03 09:40:08 -06:00
Frédéric Dalleau	d680929bbc	Allow containers to start in AppArmor namespaces This patch allows users to start containers in AppArmor namespaces. Users can define their own profiles for their containers, but lxc-start must be allowed to change to a namespace. A container configuration file can wrap a container in an AppArmor profile using lxc.aa_profile. A process in an AppArmor namespace is restricted to view or manage only the profiles belonging to this namespace, as if no other profiles existed. A namespace can be created as follow: sudo mkdir /sys/kernel/security/apparmor/policy/namespaces/$NAMESPACE AppArmor can stack profiles so that the contained process is bound by the intersection of all profiles of the stack. This is achieved using the '//&' operator as follow: lxc.aa_profile = $PROFILE//&:$NAMESPACE://unconfined In this case, even the guest process appears unconfined in the namespace, it is still confined by $PROFILE. A guest allowed to access "/sys/kernel/security/apparmor/** rwklix," will be able to manage its own profile set, while still being enclosed in the topmost profile $PROFILE: Different guests can be assigned the same namespace or different namespaces. In the first case, they will share their profiles. In the second case, they will have distinct sets of profiles. This is validated on privileged containers. Signed-off-by: Frédéric Dalleau <frederic.dalleau@collabora.com>	2017-07-07 12:33:38 +02:00
Stéphane Graber	60cfbd8a92	apparmor: Refresh generated file Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-06-28 15:35:58 -04:00
Wolfgang Bumiller	667cfb7c2d	AppArmor: add make-rslave to usr.bin.lxc-start The profile already contains mount options=(rw, make-slave) -> **, Which allows going through all mountpoints with make-slave, so it seems to make sense to also allow the directly recursive variant with "make-rslave". Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com> Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com>	2016-06-27 16:20:00 -04:00
Stéphane Graber	7f2b13275d	apparmor: Update mount states handling Properly list all of the states and the right apparmor stanza for them, then comment them all as actually enabling this would currently let the user bypass apparmor entirely. Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-06-27 16:03:49 -04:00
Stéphane Graber	af5f70c4b5	apparmor: allow mount move Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-06-27 15:11:47 -04:00
Stéphane Graber	e96e7a1ac7	apparmor: Allow bind-mounts and {r}shared/{r}private Bind-mounts aren't harmful in containers, so long as they're not used to bypass MAC policies. This change allows bind-mounting of any path which isn't a dangerous filesystem that's otherwise blocked by apparmor. This also allows switching paths {r}shared or {r}private. Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-06-23 16:01:29 -04:00
Stéphane Graber	e6faa0670c	Also allow fstype=fuse for fuse filesystems Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-05-11 12:59:23 -04:00
Aleksandr Mezin	a5c28c86f5	Fix installation of out-of-tree (VPATH) builds Signed-off-by: Aleksandr Mezin <mezin.alexander@gmail.com>	2016-03-25 19:30:49 +06:00
Serge Hallyn	4845c17aff	Prevent access to pci devices Prevent privileged containers from messing with the host's pci devices directly. Refuse access under /proc/bus, and drop cap_sys_rawio. Some containers may need to re-enable cap_sys_rawio (i.e. if they run an X server). It may be desirable to break some of this stuff into files which can be separately included (or not included), but this patch isn't the right place for that. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2016-03-16 15:18:51 -07:00
Serge Hallyn	537188a8ee	prevent containers from reading /sys/kernel/debug Unprivileged containers cannot read it anyway, but also prevent root owned containers from doing so. Sadly upstart's mountall won't run if we try to prevent it from being mounted at all. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2016-03-07 19:10:58 -08:00
Stéphane Graber	056f8bba0d	Properly update the generated apparmor profiles Some changes happened but the final profiles weren't generated... Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-03-07 15:01:17 -05:00
Stéphane Graber	eab570bcaf	Revert "allow cgroupfs mounts under /sys/fs/cgroup" This reverts commit `833bf9c2b2`. This change wasn't actually safe and is now superseded by the cgns profile. Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-03-07 15:00:05 -05:00
Serge Hallyn	787ff6e2d2	allow cgroup mounting in nesting profile Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2016-02-21 20:45:04 -08:00
Serge Hallyn	dc76ac7ab5	add lxc-default-cgns profile This isn't safe for privileged containers which do not use cgroup namespaces, but is required for systemd containers with cgroup namespaces. So create a new profile for it which lxc will use as the default when it knows it can. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2016-02-21 20:44:59 -08:00
Stéphane Graber	97f9856db3	Allow cgroupfs remount by systemd Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-02-15 20:08:25 -05:00
Stéphane Graber	15966fd086	Allow sysfs remount by mountall Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2016-02-15 20:03:50 -05:00
Wolfgang Bumiller	4a491a31c2	apparmor: allow binding /run/{,lock/} -> /var/run/{,lock/} Some systems need to be able to bind-mount /run to /var/run and /run/lock to /var/run/lock. (Tested with opensuse 13.1 containers migrated from openvz.) Signed-off-by: Wolfgang Bumiller <w.bumiller@proxmox.com>	2016-02-02 09:18:11 +01:00
Serge Hallyn	833bf9c2b2	allow cgroupfs mounts under /sys/fs/cgroup Systemd needs to be able to do these, and it does not bypass any of our apparmor rules. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2016-01-28 17:49:43 +01:00
Fox Wilson	e2f91e3434	Use /usr/bin/env python3 instead of /usr/bin/python3 project-wide This fixes invocations of certain commands when python3 is installed in a nonstandard path (/usr/local/bin, for example). Signed-off-by: Fox Wilson <2016fwilson@tjhsst.edu>	2015-11-10 15:53:33 -05:00
Serge Hallyn	108b88ce31	Add a nesting.conf which can be included to support nesting containers (v2) Newer kernels have added a new restriction: if /proc or /sys on the host has files or non-empty directories which are over-mounted, and there is no /proc which fully visible, then it assumes there is a "security" reason for this. It prevents anyone in a non-initial user namespace from creating a new proc or sysfs mount. To work around this, this patch adds a new 'nesting.conf' which can be lxc.include'd from a container configuration file. It adds a non-overmounted mount of /proc and /sys under /dev/.lxc, so that the kernel can see that we're not trying to hide things like /proc/uptime. and /sys/devices/virtual/net. If the host adds this to the config file for container w1, then container w1 will support unprivileged child containers. The nesting.conf file also sets the apparmor profile to the with-nesting variant, since that is required anyway. This actually means that supporting nesting isn't really more work than it used to be, just different. Instead of adding lxc.aa_profile = lxc-container-default-with-nesting you now just need to lxc.include = /usr/share/lxc/config/nesting.conf (Look, fewer characters :) Finally, in order to maintain the current apparmor protections on proc and sys, we make /dev/.lxc/{proc,sys} non-read/writeable. We don't need to be able to use them, we're just showing the kernel what's what. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2015-09-21 11:35:29 -04:00
Serge Hallyn	1b0c17462a	apparmor: support lxc.ttydir when bind-mounting ptys Because we now create the ttys from inside the container, we had to add an apparmor rule for start-container to bind-mount /dev/pts/** -> /dev/tty*/. However that's not sufficient if the container sets lxc.ttydir, in which case we need to support mounting onto files in subdirs of /dev. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2015-01-30 01:02:06 +01:00
Serge Hallyn	e8bd4e430b	create lxc.tty ptys from container process Lxc has always created the ptys for use by console and ttys early on from the monitor process. This has some advantages, but also has disadvantages, namely (1) container ptys counting against the max ptys for the host, and (2) not having a /dev/pts/N in the container to pass to getty. (2) was not a problem for us historically because we bind-mounted the host's /dev/pts/N onto a /dev/ttyN in the container. However, systemd hardocdes a check for container_ttys that the path have 'pts/' in it. If it were only for (2) I'd have opted for a systemd patch to check the device major number, but (1) made it worth moving the openpty to the container namespace. So this patch moves the tty creation into the task which becomes the container init. It then passes the fds for the opened ptys back to the monitor over a unix socketpair (for use by lxc-console). The /dev/console is still created in the monitor process, so that it can for instance be used by lxc.logfd. So now if you have a foreground container with lxc.tty = 4, you should end up with one host /dev/pts entry per container rather than 5. And lxc-console now works with systemd containers. Note that if the container init mounts its own devpts over the one mounted by lxc, the tty /dev/pts/n will be hidden. This is ok since it's only systemd that needs it, and systemd won't do that. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2015-01-29 11:46:02 +01:00
Martin Pitt	64b4c7a34b	apparmor: Fix slave bind mounts The permission to make a mount "slave" is spelt "make-slave", not "slave", see https://launchpad.net/bugs/1401619. Also, we need to make all mounts slave, not just the root dir. https://launchpad.net/bugs/1350947	2015-01-08 13:09:37 +01:00
Stéphane Graber	98b745498b	apparmor: Block access to /proc/kcore Just like we block access to mem and kmem, there's no good reason for the container to have access to kcore. Reported-by: Marc Schaefer Signed-off-by: Stéphane Graber <stgraber@ubuntu.com> Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com>	2015-01-05 16:28:33 -05:00
Jamie Strandboge	807f4c9e1e	apparmor: restrict signal and ptrace for processes Restrict signal and ptrace for processes running under the container profile. Rules based on AppArmor base abstraction. Add unix rules for processes running under the container profile. Signed-off-by: Jamie Strandboge <jamie@canonical.com> Acked-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-09-29 12:40:52 -04:00
Serge Hallyn	097796ac6c	apparmor: silence 'silent' mount denials newer lxc uses 'silent' when remounting on shutdown. Silence that denial too Author: Jamie Strandboge <jamie@canonical.com> Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-09-25 10:59:26 -04:00
Martin Pitt	7987eddb9f	apparmor: Allow slave bind mounts Without this, if the system uses shared subtrees by default (like systemd), you get a large stream of lxc-start: Permission denied - Failed to make /<mountpoint> rslave lxc-start: Continuing... with apparmor="DENIED" operation="mount" info="failed flags match" error=-13 profile="/usr/bin/lxc-start" name="/" pid=17284 comm="lxc-start" flags="rw, slave" and eventual failure plus a lot of leftover mounts in the host. https://launchpad.net/bugs/1325468	2014-07-30 16:43:10 +02:00
Jesse Tane	f2f545857c	Apparmor: allow hugetlbfs mounts everywhere Signed-off-by: Jesse Tane <jesse.tane@gmail.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-06-30 17:06:52 -04:00
Serge Hallyn	773bd28258	apparmor: allow writes to sem* and msg* sysctls /proc/sys/kernel/sem* and /proc/sys/kernel/msg* are ipc sysctls which are properly namespaced. Allow writes to them from containers. Reported-by: Dan Kegel <dank@kegel.com> Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-04-29 16:45:16 -05:00
Stéphane Graber	524505b971	apparmor: Use more generic allow rule for pivot Recent fixes in the apparmor kernel code is now making at least the CI environment and quite possibly some others fail due to an invalid path in the pivot_root stanza. So update both lines to allow a more generic pivot_root call for anything in LXC's work directory. Signed-off-by: Stéphane Graber <stgraber@ubuntu.com> Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com>	2014-04-08 14:45:00 -04:00
Stéphane Graber	2a31251cc5	apparmor: Update profiles for current upstream parser Signed-off-by: Stéphane Graber <stgraber@ubuntu.com> Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com>	2014-04-04 17:45:35 -04:00
Serge Hallyn	94a77f3fd8	apparmor: deny writes to most of /proc/sys (v2) Allow writes to kernel.shm, net., kernel/domainname and kernel/hostname, Also fix a bug in the lxc-generate-aa-rules.py script in a path which wasn't being exercised before, which returned a path element rather than its child. Changelog (v2): remove trailing / from block path Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-04-02 09:54:54 -05:00
Serge Hallyn	198b363fff	apparmor: auto-generate the blacklist rules This uses the generate-apparmor-rules.py script I sent out some time ago to auto-generate apparmor rules based on a higher level set of block/allow rules. Add apparmor policy testcase to make sure that some of the paths we expect to be denied (and allowed) write access to are in fact in effect in the final policy. With this policy, libvirt in a container is able to start its default network, which previously it could not. v2: address feedback from stgraber put lxc-generate-aa-rules.py into EXTRA_DIST add lxc-test-apparmor, container-base and container-rules to .gitignore take lxc-test-apparmor out of EXTRA_DIST make lxc-generate-aa-rules.py pep8-compliant don't automatically generate apparmor rules This is only bc we can't be guaranteed that python3 will be available. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-04-01 13:49:43 -04:00
Serge Hallyn	dc8114afd7	apparmor: don't allow mounting cgroupfs by default Leave the line to do it (commented out) as some users may not be using cgmanager, and may in fact still need those mounts. Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com> Acked-by: Stéphane Graber <stgraber@ubuntu.com>	2014-04-01 13:49:39 -04:00
Stéphane Graber	da289b2c34	apparmor: Drop unused profile This profile was never meant to get into git... Signed-off-by: Stéphane Graber <stgraber@ubuntu.com>	2014-02-20 13:51:21 -05:00
Serge Hallyn	c08a0b7c4e	cgmanager: container-base apparmor abstraction: allow mount move Signed-off-by: Serge Hallyn <serge.hallyn@ubuntu.com>	2014-02-03 15:17:43 -06:00
Stéphane Graber	8da250dad4	apparmor: Add profiles Signed-off-by: Stéphane Graber <stgraber@ubuntu.com> Acked-by: Serge E. Hallyn <serge.hallyn@ubuntu.com>	2014-01-16 17:49:23 -05:00

44 Commits