Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
Package: linux-2.6
Version: 2.6.37-2
Severity: normal
-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Hi. Log files in /var/log gets loaded with messages seen below (CPU#1
stuck for 63s!), when my computer is supposed to be idle - meaning
computer in not used for 15 minutes and monitor is off. But instead
fans keep spinning like wild and log files growing.
Debian Sid @OptiPlex GX270
mihkel
- -- Package-specific info:
** Version:
Linux version 2.6.37-2-686 (Debian 2.6.37-2) (ben@decadent.org.uk) (gcc version 4.4.5 (Debian 4.4.5-11) ) #1 SMP Sun Feb 27 10:51:32 UTC 2011
auto 6in4
iface 6in4 inet6 v4tunnel
address 2001:7d0:0:f000::1209
netmask 127
endpoint 194.126.101.242
ttl 64
local 192.168.1.64
up ip link set mtu 1280 dev 6in4
up ip route add default via 2001:7d0:0:f000::1208 dev 6in4
** Network status:
*** IP interfaces and addresses:
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
inet 127.0.0.1/8 scope host lo
inet6 ::1/128 scope host
valid_lft forever preferred_lft forever
2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
link/ether 00:0d:56:1c:85:4b brd ff:ff:ff:ff:ff:ff
inet 192.168.1.64/24 brd 192.168.1.255 scope global eth0
inet6 fe80::20d:56ff:fe1c:854b/64 scope link
valid_lft forever preferred_lft forever
3: sit0: <NOARP> mtu 1480 qdisc noop state DOWN
link/sit 0.0.0.0 brd 0.0.0.0
4: 6in4: <POINTOPOINT,NOARP,UP,LOWER_UP> mtu 1280 qdisc noqueue state UNKNOWN
link/sit 192.168.1.64 peer 194.126.101.242
inet6 2001:7d0:0:f000::1209/127 scope global
valid_lft forever preferred_lft forever
inet6 fe80::c0a8:140/128 scope link
valid_lft forever preferred_lft forever
*** Protocol statistics:
Ip:
275339 total packets received
0 forwarded
0 incoming packets discarded
274893 incoming packets delivered
216041 requests sent out
Icmp:
0 ICMP messages received
0 input ICMP message failed.
ICMP input histogram:
1 ICMP messages sent
0 ICMP messages failed
ICMP output histogram:
destination unreachable: 1
IcmpMsg:
OutType3: 1
Tcp:
5078 active connections openings
1 passive connection openings
10 failed connection attempts
352 connection resets received
12 connections established
265923 segments received
207023 segments send out
187 segments retransmited
0 bad segments received.
412 resets sent
Udp:
8931 packets received
1 packets to unknown port received.
0 packet receive errors
8934 packets sent
UdpLite:
TcpExt:
466 TCP sockets finished time wait in fast timer
2 packets rejects in established connections because of timestamp
12415 delayed acks sent
Quick ack mode was activated 808 times
58 packets directly queued to recvmsg prequeue.
81 bytes directly received in process context from prequeue
199722 packet headers predicted
15844 acknowledgments not containing data payload received
31621 predicted acknowledgments
22 congestion windows recovered without slow start after partial ack
59 other TCP timeouts
802 DSACKs sent for old packets
9 DSACKs received
19 connections reset due to unexpected data
286 connections reset due to early user close
34 connections aborted due to timeout
1 times unabled to send RST due to no memory
TCPDSACKIgnoredOld: 5
TCPDSACKIgnoredNoUndo: 1
IpExt:
InBcastPkts: 444
InOctets: 250946170
OutOctets: 21870506
InBcastOctets: 55648
** USB devices:
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Bus 002 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
Bus 003 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
Bus 004 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
Bus 005 Device 001: ID 1d6b:0001 Linux Foundation 1.1 root hub
Bus 002 Device 002: ID 046d:c00e Logitech, Inc. M-BJ58/M-BJ69 Optical Wheel Mouse
Kernel: Linux 2.6.37-2-686 (SMP w/2 CPU cores)
Locale: LANG=et_EE.UTF-8, LC_CTYPE=et_EE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash
Versions of packages linux-image-2.6.37-2-686 depends on:
ii debconf [debconf-2.0] 1.5.38 Debian configuration management sy
ii initramfs-tools [linux-initra 0.98.8 tools for generating an initramfs
ii linux-base 2.6.37-2 Linux image base package
ii module-init-tools 3.12-1 tools for managing Linux kernel mo
Versions of packages linux-image-2.6.37-2-686 recommends:
ii firmware-linux-free 2.6.37-2 Binary firmware for various driver
ii libc6-i686 2.11.2-11 Embedded GNU C Library: Shared lib
Versions of packages linux-image-2.6.37-2-686 suggests:
ii grub 0.97-64 GRand Unified Bootloader (dummy pa
pn linux-doc-2.6.37 <none> (no description available)
Versions of packages linux-image-2.6.37-2-686 is related to:
pn firmware-bnx2 <none> (no description available)
pn firmware-bnx2x <none> (no description available)
pn firmware-ipw2x00 <none> (no description available)
pn firmware-ivtv <none> (no description available)
pn firmware-iwlwifi <none> (no description available)
ii firmware-linux 0.28 Binary firmware for various driver
ii firmware-linux-nonfree 0.28 Binary firmware for various driver
pn firmware-qlogic <none> (no description available)
pn firmware-ralink <none> (no description available)
pn xen-hypervisor <none> (no description available)
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20110305144447.11565.8033.reportbug@debian.lan">ht tp://lists.debian.org/20110305144447.11565.8033.reportbug@debian.lan
03-31-2012, 12:08 PM
Jean Delvare
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
I can reproduce this on an openSUSE 11.4 system (kernel
2.6.37.6-0.11-default) with integrated Intel graphics (82865G, device
ID 0x2572).
mihkel, are you using a KVM switch by any chance? I do, and I have
found that the bug only happens when the KVM is switched to my other
machine. As long as the KVM is switched to the machine with the integrated
Intel graphics, everything is fine. Do you observe the same?
--
Jean Delvare
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120331140810.2931bbcf@endymion.delvare">http://lists.debian.org/20120331140810.2931bbcf@endymion.delvare
04-02-2012, 10:46 AM
Jean Delvare
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
I have reported the bug upstream and it is being investigated. A patch
working around the issue is already available there:
https://bugzilla.kernel.org/show_bug.cgi?id=43020
--
Jean Delvare
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120402124628.7ba7d98a@endymion.delvare">http://lists.debian.org/20120402124628.7ba7d98a@endymion.delvare
04-02-2012, 02:07 PM
Jonathan Nieder
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
> I have reported the bug upstream and it is being investigated. A patch
> working around the issue is already available there:
>
> https://bugzilla.kernel.org/show_bug.cgi?id=43020
Thanks much.
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120402140740.GA3365@burratino">http://lists.debian.org/20120402140740.GA3365@burratino
04-23-2012, 05:35 PM
Jonathan Nieder
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
# regression
severity 616559 important
retitle 616559 i915: [865G] soft lockup in intel_crt_load_detect after some idle time
tags 616559 + patch upstream
quit
Hi,
mihkel wrote:
> Hi. Log files in /var/log gets loaded with messages seen below (CPU#1
> stuck for 63s!), when my computer is supposed to be idle - meaning
> computer in not used for 15 minutes and monitor is off. But instead
> fans keep spinning like wild and log files growing.
[...]
> [26707.796750] Call Trace:
> [26707.796762] [<e90d6fb0>] ? intel_crt_load_detect+0x1f0/0x260 [i915]
> [26707.796774] [<e90d7295>] ? intel_crt_detect+0x275/0x2c4 [i915]
> [26707.796785] [<e8f70cef>] ? status_show+0x0/0x2f [drm]
> [26707.796795] [<e8f70d03>] ? status_show+0x14/0x2f [drm]
> [26707.796800] [<c11c390a>] ? dev_attr_show+0x16/0x32
> [26707.796805] [<c10fc020>] ? sysfs_read_file+0x8c/0xf5
> [26707.796810] [<c10fbf94>] ? sysfs_read_file+0x0/0xf5
> [26707.796815] [<c10ba3aa>] ? vfs_read+0x7c/0xd6
> [26707.796819] [<c10b8a34>] ? do_sys_open+0xb5/0xbe
> [26707.796823] [<c10ba497>] ? sys_read+0x3c/0x60
> [26707.796828] [<c1002f9f>] ? sysenter_do_call+0x12/0x28
> [26707.796833] [<c1280000>] ? init_scattered_cpuid_features+0x31/0xb6
> [26775.997590] BUG: soft lockup - CPU#1 stuck for 61s! [upowerd:7967]
Thanks again. This is said to be fixed by the attached patch, which
comes from Intel's drm-intel-testing branch[1]. That branch does not
feed directly into linux-next but patches there tend to end up in the
drm tree eventually.
The bug is said to be a regression introduced in 2.6.30-rc8. Does
that match your observations?
If you get a chance to test the patch against 3.0.y or 3.2.y, that
would be very helpful.
Hope that helps,
Jonathan
[1] git://people.freedesktop.org/~danvet/drm-intel
From: Daniel Vetter <daniel.vetter@ffwll.ch>
Date: Fri, 20 Apr 2012 21:03:36 +0200
Subject: drm/i915: fixup load-detect on enabled, but not active pipe
Somehow we have a fast-path that tries to avoid going through
the load-detect code when the encode already has a crtc associated.
But this fails horribly when the crtc is off. The load detect pipe
itself manages this case well (and also does not forget to restore the
dpms state), so just rip out this special case.
The issue seems to go back all the way to the commit that originally
introduced load-detection on the vga output:
commit e4a5d54f924ea5ce2913d9d0687d034004816465
Author: Ma Ling <ling.ma@intel.com>
Date: Tue May 26 11:31:00 2009 +0800
drm/i915: Add support for VGA load detection (pre-945).
Bugzilla: https://bugzilla.kernel.org/show_bug.cgi?id=43020
Reported-by: Jean Delvare <khali@linux-fr.org>
Reviewed-by: Chris Wilson <chris@chris-wilson.co.uk>
Signed-Off-by: Daniel Vetter <daniel.vetter@ffwll.ch>
Signed-off-by: Jonathan Nieder <jrnieder@gmail.com>
---
drivers/gpu/drm/i915/intel_crt.c | 29 +++++++++++------------------
1 file changed, 11 insertions(+), 18 deletions(-)
if (I915_HAS_HOTPLUG(dev)) {
if (intel_crt_detect_hotplug(connector)) {
@@ -449,23 +449,16 @@ intel_crt_detect(struct drm_connector *connector, bool force)
return connector->status;
/* for pre-945g platforms use load detect */
- crtc = crt->base.base.crtc;
- if (crtc && crtc->enabled) {
- status = intel_crt_load_detect(crt);
- } else {
- struct intel_load_detect_pipe tmp;
-
- if (intel_get_load_detect_pipe(&crt->base, connector, NULL,
- &tmp)) {
- if (intel_crt_detect_ddc(connector))
- status = connector_status_connected;
- else
- status = intel_crt_load_detect(crt);
- intel_release_load_detect_pipe(&crt->base, connector,
- &tmp);
- } else
- status = connector_status_unknown;
- }
+ if (intel_get_load_detect_pipe(&crt->base, connector, NULL,
+ &tmp)) {
+ if (intel_crt_detect_ddc(connector))
+ status = connector_status_connected;
+ else
+ status = intel_crt_load_detect(crt);
+ intel_release_load_detect_pipe(&crt->base, connector,
+ &tmp);
+ } else
+ status = connector_status_unknown;
return status;
}
--
1.7.10
04-23-2012, 05:48 PM
Jonathan Nieder
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
Jonathan Nieder wrote:
> If you get a chance to test the patch against 3.0.y or 3.2.y, that
> would be very helpful.
Here's a blind backport to 2.6.32.y. I'd be interested to hear
whether the kernel in squeeze is affected, and if so, whether this
patch helps and has no bad side-effects.
04-23-2012, 06:40 PM
Jean Delvare
Bug#616559: BUG: soft lockup - CPU#1 stuck for 63s!
FWIW, I did backport the patch to kernel 2.6.37.6, and it fixed the
issue, with no bad side-effect.
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120423204046.6dc13385@endymion.delvare">http://lists.debian.org/20120423204046.6dc13385@endymion.delvare