Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
Package: linux-2.6
Version: 2.6.39-3~bpo60+1
Severity: normal
I am experiencing the following issue with a Debian squeeze based server and the most recent squeeze-backports kernel:
I realized that some vstfpd daemons randomly drop connections (sending a FIN right after the initial TCP hand shake was completed). Furthermore, a "Failed to create cgroup nnnn: -17" message is logged by the kernel.
Furthermore, I am observing a steadily increasing number of directories named like pids being created in the root of the cgroup virtual filesystem (mounted at /cgroup). For each connection attempt to a vsftpd daemon a new directory is created. Those directories seem to be never deleted. After a few days of uptime there are about 7,500 directories while there constantly are only about 150 processes running (more or less idling, this server usually has low load).
When stracing vsftpd the call that fails seems to be this one (full output below):
Which makes me believe that those "zombie directories" in /cgroup might conflict with the new pid . The longer the server is up the more likely it becomes that connections are dropped.
Side note: The affected vsftpd daemons are running on a server that also is hosting an LXC-based virtual server. I have experienced a steadily increasing soft IRQ load on the server while a cgroup virtual filesystem being mounted. I have upgraded to the recent squeeze-backports kernel which seems not to suffer from this soft IRQ issue. vsftpd daemons running inside LXC containers do not drop connections.
Below is some information I thought might be useful. If required, I will gladly provide any additional information.
I am using vsftpd 2.3.2-3 which AFAIK is the most recent version available from the squeeze and squeeze-backports repositories.
** Model information
sys_vendor: HP
product_name: ProLiant DL380 G6
product_version:
chassis_vendor: HP
chassis_version:
bios_vendor: HP
bios_version: P62
Kernel: Linux 2.6.39-bpo.2-amd64 (SMP w/8 CPU cores)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash
Versions of packages linux-image-2.6.39-bpo.2-amd64 depends on:
ii debconf [debconf-2.0] 1.5.36.1 Debian configuration management sy
ii initramfs-tools [linux-init 0.99~bpo60+1 tools for generating an initramfs
ii linux-base 3.3~bpo60+1 Linux image base package
ii module-init-tools 3.12-1 tools for managing Linux kernel mo
Versions of packages linux-image-2.6.39-bpo.2-amd64 recommends:
ii firmware-linux-free 2.6.32-35 Binary firmware for various driver
Versions of packages linux-image-2.6.39-bpo.2-amd64 suggests:
ii grub-pc 1.98+20100804-14 GRand Unified Bootloader, version
pn linux-doc-2.6.39 <none> (no description available)
Versions of packages linux-image-2.6.39-bpo.2-amd64 is related to:
ii firmware-bnx2 0.32~bpo60+1 Binary firmware for Broadcom NetXt
pn firmware-bnx2x <none> (no description available)
pn firmware-ipw2x00 <none> (no description available)
pn firmware-ivtv <none> (no description available)
pn firmware-iwlwifi <none> (no description available)
pn firmware-linux <none> (no description available)
ii firmware-linux-nonfree 0.32~bpo60+1 Binary firmware for various driver
pn firmware-qlogic <none> (no description available)
pn firmware-ralink <none> (no description available)
pn xen-hypervisor <none> (no description available)
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20110827103849.4976.9707.reportbug@lenard.weinhard t.biz">http://lists.debian.org/20110827103849.4976.9707.reportbug@lenard.weinhard t.biz
08-27-2011, 12:48 PM
Ben Hutchings
Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
On Sat, 2011-08-27 at 12:38 +0200, Dirk Weinhardt wrote:
> Package: linux-2.6
> Version: 2.6.39-3~bpo60+1
> Severity: normal
>
>
> I am experiencing the following issue with a Debian squeeze based
> server and the most recent squeeze-backports kernel:
>
> I realized that some vstfpd daemons randomly drop connections (sending
> a FIN right after the initial TCP hand shake was completed).
> Furthermore, a "Failed to create cgroup nnnn: -17" message is logged
> by the kernel.
>
> Furthermore, I am observing a steadily increasing number of
> directories named like pids being created in the root of the cgroup
> virtual filesystem (mounted at /cgroup). For each connection attempt
> to a vsftpd daemon a new directory is created. Those directories seem
> to be never deleted. After a few days of uptime there are about 7,500
> directories while there constantly are only about 150 processes
> running (more or less idling, this server usually has low load).
Do they persist if you restart vsftpd?
> When stracing vsftpd the call that fails seems to be this one (full
> output below):
>
> clone(child_stack=0, flags=0x28000000|SIGCHLD) = -1 EEXIST (File
> exists)
>
> Which makes me believe that those "zombie directories" in /cgroup
> might conflict with the new pid . The longer the server is up the more
> likely it becomes that connections are dropped.
>
> Side note: The affected vsftpd daemons are running on a server that
> also is hosting an LXC-based virtual server. I have experienced a
> steadily increasing soft IRQ load on the server while a cgroup virtual
> filesystem being mounted. I have upgraded to the recent
> squeeze-backports kernel which seems not to suffer from this soft IRQ
> issue.
[...]
That is bug #629373.
Ben.
08-27-2011, 01:16 PM
Dirk Weinhardt
Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
Furthermore, I am observing a steadily increasing number of
directories named like pids being created in the root of the cgroup
virtual filesystem (mounted at /cgroup). For each connection attempt
to a vsftpd daemon a new directory is created. Those directories seem
to be never deleted. After a few days of uptime there are about 7,500
directories while there constantly are only about 150 processes
running (more or less idling, this server usually has low load).
Do they persist if you restart vsftpd?
Yes, all those directories persist if vsftp is restarted. The only way
(I am aware of so far) to get rid of those is rebooting the system.
Thanks for your quick response.
Dirk
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4E58EE40.30107@weinhardt.biz">http://lists.debian.org/4E58EE40.30107@weinhardt.biz
08-28-2011, 03:42 PM
Ben Hutchings
Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
On Sat, 2011-08-27 at 15:16 +0200, Dirk Weinhardt wrote:
> >> Furthermore, I am observing a steadily increasing number of
> >> directories named like pids being created in the root of the cgroup
> >> virtual filesystem (mounted at /cgroup). For each connection attempt
> >> to a vsftpd daemon a new directory is created. Those directories seem
> >> to be never deleted. After a few days of uptime there are about 7,500
> >> directories while there constantly are only about 150 processes
> >> running (more or less idling, this server usually has low load).
> >
> > Do they persist if you restart vsftpd?
>
> Yes, all those directories persist if vsftp is restarted. The only way
> (I am aware of so far) to get rid of those is rebooting the system.
>
> Thanks for your quick response.
Please can you test whether this is fixed in Linux 3.0 (available in
testing and unstable).
Ben.
08-28-2011, 08:40 PM
Dirk Weinhardt
Bug#639453: linux-image-2.6.39-bpo.2-amd64: Dropped Connections and "Failed to create cgroup nnnn: -17" Kernel Message When vsftpd Spawning a New Process
Hi Ben,
Please can you test whether this is fixed in Linux 3.0 (available in
testing and unstable).
I installed the 3.0.0.1 testing kernel on the squeeze box and reran the
test. The number of directories in /cgroup does not increase even after
opening and closing 50 connections to vsftpd (using nagios' check_ftp
command).
xxx:~# uname -a
Linux xxx 3.0.0-1-amd64 #1 SMP Sun Jul 24 02:24:44 UTC 2011 x86_64 GNU/Linux
Would the test need to be done on a system that was entirely upgraded to
testing or even cleanly installed from testing?
I noticed that after upgrading to Linux 3.0 starting any LXC container
fails with this output:
xxx:~# lxc-start -n vm0
lxc-start: No such file or directory - failed to rename cgroup
/cgroup/7587->/cgroup/vm0
lxc-start: failed to spawn 'vm0'
lxc-start: No such file or directory - failed to remove cgroup '/cgroup/vm0'
Is any additional information or testing required?
Dirk
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4E5AA7A7.3080208@weinhardt.biz">http://lists.debian.org/4E5AA7A7.3080208@weinhardt.biz