Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
severity 648766 important
quit
Kieron Gillespie wrote:
> I've attached some images to this bug message, not sure if they will
> appear
Received; thanks much.
What version of the kernel are you using? Full "dmesg" output from a
normal boot would be useful as well, so we can get to know your
configuration a little better. (Even better is a log from boot until
and including the crash, if you can get one with netconsole[1] or a
serial console[2].)
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120403160253.GF15589@burratino">http://lists.debian.org/20120403160253.GF15589@burratino
04-03-2012, 04:24 PM
Kieron Gillespie
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
Here are the dmesg output from the current system running Linux 3.2.13
with SMP enabled with tickless disabled.
[ 0.000000] PROMLIB: Sun IEEE Boot Prom 'OBP 4.9.7 2004/05/27 07:31'
[ 0.000000] PROMLIB: Root node compatible:
[ 0.000000] Initializing cgroup subsys cpuset
[ 0.000000] Initializing cgroup subsys cpu
[ 0.000000] Linux version 3.2.13-smp (root@sparc_debian) (gcc version
4.6.3 (Debian 4.6.3-1) ) #3 SMP PREEMPT Mon Apr 2 20:05:10 EDT 2012
[ 0.000000] bootconsole [earlyprom0] enabled
[ 0.000000] ARCH: SUN4U
[ 0.000000] Ethernet address: 00:03:ba:44:d1:39
[ 0.000000] Kernel: Using 2 locked TLB entries for main kernel image.
[ 0.000000] Remapping the kernel... done.
[ 0.000000] OF stdout device is: /pci@1e,600000/isa@7/serial@0,3f8
[ 0.000000] PROM: Built device tree with 94072 bytes of memory.
[ 0.000000] Top of RAM: 0x121fed0000, Total RAM: 0x3fdba000
[ 0.000000] Memory hole size: 73217MB
[ 0.000000] [0000010004000000-fffff80200400000] page_structs=131072
node=0 entry=16/8192
[ 0.000000] [0000010004000000-fffff80200800000] page_structs=131072
node=0 entry=17/8192
[ 0.000000] [0000010024000000-fffff80200c00000] page_structs=131072
node=0 entry=144/8192
[ 0.000000] [0000010024000000-fffff80201000000] page_structs=131072
node=0 entry=145/8192
[ 0.000000] Zone PFN ranges:
[ 0.000000] Normal 0x00100000 -> 0x0090ff68
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[5] active PFN ranges
[ 0.000000] 0: 0x00100000 -> 0x00110000
[ 0.000000] 0: 0x00900000 -> 0x0090f7ff
[ 0.000000] 0: 0x0090f800 -> 0x0090fe85
[ 0.000000] 0: 0x0090ff07 -> 0x0090ff53
[ 0.000000] 0: 0x0090ff5b -> 0x0090ff68
[ 0.000000] On node 0 totalpages: 130781
[ 0.000000] Normal zone: 66047 pages used for memmap
[ 0.000000] Normal zone: 0 pages reserved
[ 0.000000] Normal zone: 64734 pages, LIFO batch:15
[ 0.000000] Booting Linux...
[ 0.000000] CPU CAPS: [flush,stbar,swap,muldiv,v9,ultra3,mul32,div32]
[ 0.000000] CPU CAPS: [v8plus,vis,vis2]
[ 0.000000] PERCPU: Embedded 6 pages/cpu @fffff80201400000 s20096
r8192 d20864 u2097152
[ 0.000000] pcpu-alloc: s20096 r8192 d20864 u2097152 alloc=1*4194304
[ 0.000000] pcpu-alloc: [0] 0 1
[ 0.000000] Built 1 zonelists in Zone order, mobility grouping on.
Total pages: 64734
[ 0.000000] Kernel command line:
root=UUID=e0fd1e8e-7fc3-40cf-80e2-b0ed00bffa09 ro ipv6.disable=1
[ 42.881833] audit: initializing netlink socket (disabled)
[ 42.881877] type=2000 audit(0.856:1): initialized
[ 42.928465] HugeTLB registered 4 MB page size, pre-allocated 0 pages
[ 42.929246] VFS: Disk quotas dquot_6.5.2
[ 42.929428] Dquot-cache hash table entries: 1024 (order 0, 8192 bytes)
[ 42.929686] msgmni has been set to 1996
[ 42.930240] alg: No test for stdrng (krng)
[ 42.930349] Block layer SCSI generic (bsg) driver version 0.4 loaded
(major 253)
[ 42.930379] io scheduler noop registered
[ 42.930396] io scheduler deadline registered
[ 42.930460] io scheduler cfq registered (default)
[ 42.931785] f008fc3c: ttyS0 at MMIO 0x7fe010003f8 (irq = 23) is a 16550A
[ 42.931809] Console: ttyS0 (SU)
[ 50.855170] console [ttyS0] enabled
[ 50.901097] f0091678: ttyS1 at MMIO 0x7fe010002e8 (irq = 23) is a 16550A
[ 50.989699] [drm] Initialized drm 1.1.0 20060810
[ 51.051731] mousedev: PS/2 mouse device common for all mice
[ 51.125618] rtc_cmos rtc_cmos: rtc core: registered rtc_cmos as rtc0
[ 51.209152] rtc0: no alarms, 114 bytes nvram
[ 51.266303] TCP cubic registered
[ 51.308627] IPv6: Loaded, but administratively disabled, reboot
required to enable
[ 51.408135] Mobile IPv6
[ 51.440150] mip6_init: can't add xfrm type(destopt)
[ 51.504210] NET: Registered protocol family 17
[ 51.562555] Registering the dns_resolver key type
[ 51.624496] registered taskstats version 1
[ 51.678728] rtc_cmos rtc_cmos: setting system clock to 2012-04-03
02:57:06 UTC (1333421826)
[ 51.788635] Initializing network drop monitor service
[ 51.897599] udevd[49]: starting version 175
[ 52.077619] tg3.c:v3.121 (November 2, 2011)
[ 52.162218] SCSI subsystem initialized
[ 52.178035] PCI: Enabling device: (0000:00:03.0), cmd 2
[ 52.283552] tg3 0000:00:03.0: vpd r/w failed. This is likely a
firmware bug on this device. Contact the card vendor for a firmware update.
[ 52.306869] usbcore: registered new interface driver usbfs
[ 52.354035] tg3 0000:00:03.0: vpd r/w failed. This is likely a
firmware bug on this device. Contact the card vendor for a firmware update.
[ 52.386928] usbcore: registered new interface driver hub
[ 52.408671] PCI: Enabling device: (0002:01:0b.0), cmd 2
[ 52.411542] tg3 0000:00:03.0: vpd r/w failed. This is likely a
firmware bug on this device. Contact the card vendor for a firmware update.
[ 52.414164] tg3 0000:00:03.0: eth0: Tigon3 [partno(none) rev 1002]
(PCI:66MHz:64-bit) MAC address 00:03:ba:44:d1:39
[ 52.414177] tg3 0000:00:03.0: eth0: attached PHY is 5703
(10/100/1000Base-T Ethernet) (WireSpeed[1], EEE[0])
[ 52.414188] tg3 0000:00:03.0: eth0: RXcsums[1] LinkChgREG[0] MIirq[0]
ASF[0] TSOcap[1]
[ 53.121329] usbcore: registered new device driver usb
[ 53.122529] sym0: No NVRAM, ID 7, Fast-80, LVD, parity checking
[ 53.134833] libata version 3.00 loaded.
[ 53.143587] PCI: Enabling device: (0002:00:0d.0), cmd 5
[ 53.151539] scsi1 : pata_ali
[ 53.309712] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[ 53.310095] sym0: SCSI BUS has been reset.
[ 53.311120] scsi0 : sym-2.2.3
[ 53.488255] ehci_hcd 0002:01:08.2: EHCI Host Controller
[ 53.489408] scsi2 : pata_ali
[ 53.489661] ata1: PATA max UDMA/100 cmd 0x7fe01000a00 ctl
0x7fe01000a18 bmdma 0x7fe01000a20 irq 29
[ 53.489670] ata2: PATA max UDMA/100 cmd 0x7fe01000a10 ctl
0x7fe01000a08 bmdma 0x7fe01000a28 irq 29
[ 53.490526] PCI: Enabling device: (0001:00:04.1), cmd 147
[ 53.491095] sym1: <1010-66> rev 0x1 at pci 0001:00:04.1 irq 13
[ 53.493411] sym1: No NVRAM, ID 7, Fast-80, LVD, parity checking
[ 53.533786] sym1: SCSI BUS has been reset.
[ 53.533818] scsi3 : sym-2.2.3
[ 53.823948] ata2.00: ATAPI: JLMS XJ-HD166S, D3S4, max UDMA/33
[ 53.823961] ata2.00: WARNING: ATAPI DMA disabled for reliability
issues. It can be enabled
[ 53.823969] ata2.00: WARNING: via pata_ali.atapi_dma modparam or
corresponding sysfs node.
[ 53.824380] ata2.00: configured for UDMA/33
[ 53.824942] scsi: waiting for bus probes to complete ...
[ 54.496327] firewire_core: created device fw0: GUID 000516000040a1e6,
S400
[ 54.499026] ehci_hcd 0002:01:08.2: new USB bus registered, assigned
bus number 1
[ 54.519576] ehci_hcd 0002:01:08.2: irq 33, io mem 0x7ff10004000
[ 54.531542] ehci_hcd 0002:01:08.2: USB 2.0 started, EHCI 1.00
[ 54.531612] usb usb1: New USB device found, idVendor=1d6b, idProduct=0002
[ 54.531620] usb usb1: New USB device strings: Mfr=3, Product=2,
SerialNumber=1
[ 54.531627] usb usb1: Product: EHCI Host Controller
[ 54.531633] usb usb1: Manufacturer: Linux 3.2.13-smp ehci_hcd
[ 54.531639] usb usb1: SerialNumber: 0002:01:08.2
[ 55.221992] hub 1-0:1.0: USB hub found
[ 55.273730] hub 1-0:1.0: 5 ports detected
[ 55.332367] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver
[ 55.414873] ohci_hcd 0002:00:0a.0: OHCI Host Controller
[ 55.483620] ohci_hcd 0002:00:0a.0: new USB bus registered, assigned
bus number 2
[ 55.580947] ohci_hcd 0002:00:0a.0: irq 27, io mem 0x7ff01000000
[ 55.715582] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[ 55.804830] usb usb2: New USB device strings: Mfr=3, Product=2,
SerialNumber=1
[ 55.899881] usb usb2: Product: OHCI Host Controller
[ 55.964451] usb usb2: Manufacturer: Linux 3.2.13-smp ohci_hcd
[ 56.040031] usb usb2: SerialNumber: 0002:00:0a.0
[ 56.101051] hub 2-0:1.0: USB hub found
[ 56.150273] hub 2-0:1.0: 2 ports detected
[ 56.203094] ohci_hcd 0002:00:0b.0: OHCI Host Controller
[ 56.271834] ohci_hcd 0002:00:0b.0: new USB bus registered, assigned
bus number 3
[ 56.503567] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[ 56.503576] usb usb3: New USB device strings: Mfr=3, Product=2,
SerialNumber=1
[ 56.503583] usb usb3: Product: OHCI Host Controller
[ 56.503588] usb usb3: Manufacturer: Linux 3.2.13-smp ohci_hcd
[ 56.503594] usb usb3: SerialNumber: 0002:00:0b.0
[ 56.951300] scsi target0:0:0: tagged command queuing enabled, command
queue depth 16.
[ 56.953697] hub 3-0:1.0: USB hub found
[ 56.953716] hub 3-0:1.0: 2 ports detected
[ 56.953858] ohci_hcd 0002:01:08.0: OHCI Host Controller
[ 56.953884] ohci_hcd 0002:01:08.0: new USB bus registered, assigned
bus number 4
[ 56.953946] ohci_hcd 0002:01:08.0: irq 31, io mem 0x7ff10000000
[ 57.037572] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[ 57.037581] usb usb4: New USB device strings: Mfr=3, Product=2,
SerialNumber=1
[ 57.037588] usb usb4: Product: OHCI Host Controller
[ 57.037594] usb usb4: Manufacturer: Linux 3.2.13-smp ohci_hcd
[ 57.037599] usb usb4: SerialNumber: 0002:01:08.0
[ 57.784094] scsi target0:0:0: Beginning Domain Validation
[ 57.786345] hub 4-0:1.0: USB hub found
[ 57.786365] hub 4-0:1.0: 3 ports detected
[ 57.786518] ohci_hcd 0002:01:08.1: OHCI Host Controller
[ 57.786546] ohci_hcd 0002:01:08.1: new USB bus registered, assigned
bus number 5
[ 57.786619] ohci_hcd 0002:01:08.1: irq 32, io mem 0x7ff10002000
[ 57.869565] usb usb5: New USB device found, idVendor=1d6b, idProduct=0001
[ 57.869575] usb usb5: New USB device strings: Mfr=3, Product=2,
SerialNumber=1
[ 57.869582] usb usb5: Product: OHCI Host Controller
[ 57.869588] usb usb5: Manufacturer: Linux 3.2.13-smp ohci_hcd
[ 57.869594] usb usb5: SerialNumber: 0002:01:08.1
[ 58.585508] hub 5-0:1.0: USB hub found
[ 58.589623] scsi target0:0:0: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5
ns, offset 31)
[ 58.612097] scsi target0:0:1: Ending Domain Validation
[ 59.249666] hub 5-0:1.0: 2 ports detected
[ 59.303519] usb 3-1: new full-speed USB device number 2 using ohci_hcd
[ 59.560956] usb 3-1: New USB device found, idVendor=046d, idProduct=c52f
[ 59.649053] usb 3-1: New USB device strings: Mfr=1, Product=2,
SerialNumber=0
[ 59.742833] usb 3-1: Product: USB Receiver
[ 59.796771] usb 3-1: Manufacturer: Logitech
[ 59.878439] input: Logitech USB Receiver as
/devices/root/f007134c/pci0002:00/0002:00:0b.0/usb3/3-1/3-1:1.0/input/input0
[ 60.021805] generic-usb 0003:046D:C52F.0001: input,hidraw0: USB HID
v1.11 Mouse [Logitech USB Receiver] on usb-0002:00:0b.0-1/input0
[ 60.178627] usb 3-2: new low-speed USB device number 3 using ohci_hcd
[ 60.271112] input: Logitech USB Receiver as
/devices/root/f007134c/pci0002:00/0002:00:0b.0/usb3/3-1/3-1:1.1/input/input1
[ 60.414413] generic-usb 0003:046D:C52F.0002: input,hiddev0,hidraw1:
USB HID v1.11 Device [Logitech USB Receiver] on usb-0002:00:0b.0-1/input1
[ 60.581879] usbcore: registered new interface driver usbhid
[ 60.655259] usbhid: USB HID core driver
[ 60.732738] usb 3-2: New USB device found, idVendor=0b38, idProduct=0010
[ 60.820807] usb 3-2: New USB device strings: Mfr=0, Product=0,
SerialNumber=0
[ 60.927141] input: HID 0b38:0010 as
/devices/root/f007134c/pci0002:00/0002:00:0b.0/usb3/3-2/3-2:1.0/input/input2
[ 61.061540] generic-usb 0003:0B38:0010.0003: input,hidraw2: USB HID
v1.10 Keyboard [HID 0b38:0010] on usb-0002:00:0b.0-2/input0
[ 61.223914] input: HID 0b38:0010 as
/devices/root/f007134c/pci0002:00/0002:00:0b.0/usb3/3-2/3-2:1.1/input/input3
[ 61.357955] generic-usb 0003:0B38:0010.0004: input,hidraw3: USB HID
v1.10 Device [HID 0b38:0010] on usb-0002:00:0b.0-2/input1
[ 62.854697] scsi 2:0:0:0: CD-ROM JLMS XJ-HD166S
D3S4 PQ: 0 ANSI: 5
I've attached some images to this bug message, not sure if they will
appear
Received; thanks much.
What version of the kernel are you using? Full "dmesg" output from a
normal boot would be useful as well, so we can get to know your
configuration a little better. (Even better is a log from boot until
and including the crash, if you can get one with netconsole[1] or a
serial console[2].)
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4F7B2432.2040900@gmail.com">http://lists.debian.org/4F7B2432.2040900@gmail.com
04-03-2012, 05:29 PM
Jonathan Nieder
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
Kieron Gillespie wrote:
> Here are the dmesg output from the current system running Linux
> 3.2.13 with SMP enabled with tickless disabled.
Great.
Is this reproducible without nouveau? It might be possible to test
by putting
blacklist nouveau
in /etc/modprobe.d/kg-disable-nouveau.conf and booting in "recovery
mode" so X doesn't get started. That might mean it continues to
access the console using the PROM or it might mean there is no
console output at all and one has to operate "blind" or using ssh;
please forgive my ignorance.
Thanks,
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120403172931.GG15589@burratino">http://lists.debian.org/20120403172931.GG15589@burratino
04-03-2012, 11:05 PM
Kieron Gillespie
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
I went one step further and removed the nvidia card all together.
Connected the system through a serial line and then put a very heavy
load on the system.
I had memtester running two times with 480MB testing at 100 loops, this
puts a lot of stress on the CPUs and Memory. But that normally won't
kill it, like I said I had it run these for 16 hours straight and the
system did fine, even with the nouveau drivers. What will normally cause
the fault is if I do something like this.
cat /dev/sda > /dev/null
With in 10 minutes of doing this the system hangs. Though I wasn't able
to get the error message through the serial connection it's very similar
to how it normally fails.. The system will normally crash when there is
a heavy load placed on the disk drives. Reading or writing. Most of my
errors occur when I am trying to install packages which makes sense.
I've attached the dmesg from boot, and I've also included the message
log. There are a lot of reboots in the message log but you can see how
there is almost no warning until the system locks up and needs to be
rebooted.
So it would seem that anytime a very heavy amount of traffic on the hard
drives occur we get this error. Looks like even though nouveau spits a
lot of warnings it isn't the reason for the problem.
-Kieron
On 04/03/2012 01:29 PM, Jonathan Nieder wrote:
Kieron Gillespie wrote:
Here are the dmesg output from the current system running Linux
3.2.13 with SMP enabled with tickless disabled.
Great.
Is this reproducible without nouveau? It might be possible to test
by putting
blacklist nouveau
in /etc/modprobe.d/kg-disable-nouveau.conf and booting in "recovery
mode" so X doesn't get started. That might mean it continues to
access the console using the PROM or it might mean there is no
console output at all and one has to operate "blind" or using ssh;
please forgive my ignorance.
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
I have also noticed, that if I am reading the trace correctly that in
both of my cases, and the original bug submitter's, and a bug posted on
old.nabble.com's case the crash always seems to happen when one CPU is
doing cheetah_xcall_deliver, and the other CPU is in the same
instruction in tl0_irq15. Here is a link to the post.
Though this person thinks it has to do with ext4 stability issues. I can
rule this out completely because my system does not have a single ext4
file system on it.
On 04/03/2012 01:29 PM, Jonathan Nieder wrote:
Kieron Gillespie wrote:
Here are the dmesg output from the current system running Linux
3.2.13 with SMP enabled with tickless disabled.
Great.
Is this reproducible without nouveau? It might be possible to test
by putting
blacklist nouveau
in /etc/modprobe.d/kg-disable-nouveau.conf and booting in "recovery
mode" so X doesn't get started. That might mean it continues to
access the console using the PROM or it might mean there is no
console output at all and one has to operate "blind" or using ssh;
please forgive my ignorance.
Thanks,
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4F7B9C53.6070302@gmail.com">http://lists.debian.org/4F7B9C53.6070302@gmail.com
04-06-2012, 03:00 AM
Jonathan Nieder
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
found 648766 linux-2.6/3.2.13-1
found 648766 linux-2.6/3.2.14-1
# 3.3.1
found 648766 linux-2.6/3.3-1~experimental.1
tags 648766 + upstream
quit
Kieron Gillespie wrote:
> Now with that said I can't seem to crash the 2.6.32 kernel in the
> same way with SMP off, haven't tried with SMP on yet, but I have a
> feeling that will work fine as well. So this seems like it's some
> sort of regression in the linux kernel.
Thanks, Kieron. Just for reference: if you want to bisect through
pre-compiled kernels to narrow down which version introduced trouble,
you can find some at <http://snapshot.debian.org/package/linux-2.6/>.
Ciao,
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120406030022.GA2508@burratino">http://lists.debian.org/20120406030022.GA2508@burratino
04-06-2012, 03:00 AM
Kieron Gillespie
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
So what have a learned after lots of test cases.
With SMP on or off, and nouveau driver loaded or not I have the same
unstable behavior and crashing on linux kernel 3.2.13, 3.2.14, 3.3.1.
All test involved with only one CPU plugged in, both CPUs plugged in,
with SMP on and off, with the NVIDIA graphics card plugged in and on,
the XVR-1200 graphics card plugged in and on, and no graphics card at
all. Still getting the same errors occurring. I can confirm that the
system is very likely to crash when the hard drive is read from heavily.
The system can do memory tests for 16+ hours hours utilizing 100% of
both CPUs without error, but the moment you do a cat /dev/sda >
/dev/null you will normally see the kernel panic with in a few minutes.
I have one more idea for a test that involves plugging in an PATA disk
drive and loading linux on that and see if these kernel versions still
crash. As the current hard drives are SCSI.
Now with that said I can't seem to crash the 2.6.32 kernel in the same
way with SMP off, haven't tried with SMP on yet, but I have a feeling
that will work fine as well. So this seems like it's some sort of
regression in the linux kernel. Which is very sad. I have a lot more
images of various kernel panic traces but they are all very similar to
the ones already posted. I am going to start looking in to what changes
where made to the Sparc specific parts of the kernel since 2.6.32, and
try to isolate something.
This seems like a real kernel bug.
-Kieron
On 04/03/2012 01:29 PM, Jonathan Nieder wrote:
Kieron Gillespie wrote:
Here are the dmesg output from the current system running Linux
3.2.13 with SMP enabled with tickless disabled.
Great.
Is this reproducible without nouveau? It might be possible to test
by putting
blacklist nouveau
in /etc/modprobe.d/kg-disable-nouveau.conf and booting in "recovery
mode" so X doesn't get started. That might mean it continues to
access the console using the PROM or it might mean there is no
console output at all and one has to operate "blind" or using ssh;
please forgive my ignorance.
Thanks,
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4F7E5C59.6070803@gmail.com">http://lists.debian.org/4F7E5C59.6070803@gmail.com
04-06-2012, 02:38 PM
Kieron Gillespie
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
I am right now testing one major kernel version at a time, and on the
3.0.0-1 I got an interesting error when I ran my brutality test on the
system.
"sd 0:0:0:0: ABORT operation complete."
I wonder if this is some symptom of the problem as well. It canceled the
cat /dev/sda > /dev/null process on a segfault after this error popped
up. I have included the entire output from dmesg. Still looking for the
kernel version that causes the panic.
On 04/05/2012 11:00 PM, Jonathan Nieder wrote:
found 648766 linux-2.6/3.2.13-1
found 648766 linux-2.6/3.2.14-1
# 3.3.1
found 648766 linux-2.6/3.3-1~experimental.1
tags 648766 + upstream
quit
Kieron Gillespie wrote:
Now with that said I can't seem to crash the 2.6.32 kernel in the
same way with SMP off, haven't tried with SMP on yet, but I have a
feeling that will work fine as well. So this seems like it's some
sort of regression in the linux kernel.
Thanks, Kieron. Just for reference: if you want to bisect through
pre-compiled kernels to narrow down which version introduced trouble,
you can find some at<http://snapshot.debian.org/package/linux-2.6/>.
Ciao,
Jonathan
[ 0.000000] PROMLIB: Sun IEEE Boot Prom 'OBP 4.9.7 2004/05/27 07:31'
[ 0.000000] PROMLIB: Root node compatible:
[ 0.000000] Initializing cgroup subsys cpuset
[ 0.000000] Initializing cgroup subsys cpu
[ 0.000000] Linux version 3.0.0 (root@BigRed) (gcc version 4.4.5 (Debian 4.4.5-8) ) #1 SMP Fri Apr 6 00:19:17 EDT 2012
[ 0.000000] bootconsole [earlyprom0] enabled
[ 0.000000] ARCH: SUN4U
[ 0.000000] Ethernet address: 00:03:ba:44:d1:39
[ 0.000000] Kernel: Using 2 locked TLB entries for main kernel image.
[ 0.000000] Remapping the kernel... done.
[ 0.000000] OF stdout device is: /pci@1f,700000/SUNW,XVR-600@2
[ 0.000000] PROM: Built device tree with 95808 bytes of memory.
[ 0.000000] Top of RAM: 0x121feb2000, Total RAM: 0x3fd8e000
[ 0.000000] Memory hole size: 73217MB
[ 0.000000] [0000010004000000-fffff80200400000] page_structs=131072 node=0 entry=16/8192
[ 0.000000] [0000010004000000-fffff80200800000] page_structs=131072 node=0 entry=17/8192
[ 0.000000] [0000010024000000-fffff80200c00000] page_structs=131072 node=0 entry=144/8192
[ 0.000000] [0000010024000000-fffff80201000000] page_structs=131072 node=0 entry=145/8192
[ 0.000000] Zone PFN ranges:
[ 0.000000] Normal 0x00100000 -> 0x0090ff59
[ 0.000000] Movable zone start PFN for each node
[ 0.000000] early_node_map[8] active PFN ranges
[ 0.000000] 0: 0x00100000 -> 0x00110000
[ 0.000000] 0: 0x00900000 -> 0x0090f7ff
[ 0.000000] 0: 0x0090f800 -> 0x0090fe64
[ 0.000000] 0: 0x0090fee6 -> 0x0090feea
[ 0.000000] 0: 0x0090feed -> 0x0090fef1
[ 0.000000] 0: 0x0090fef3 -> 0x0090fef5
[ 0.000000] 0: 0x0090fef7 -> 0x0090ff45
[ 0.000000] 0: 0x0090ff4d -> 0x0090ff59
[ 0.000000] On node 0 totalpages: 130759
[ 0.000000] Normal zone: 66047 pages used for memmap
[ 0.000000] Normal zone: 0 pages reserved
[ 0.000000] Normal zone: 64712 pages, LIFO batch:15
[ 0.000000] Booting Linux...
[ 0.000000] PERCPU: Embedded 5 pages/cpu @fffff80201400000 s11840 r8192 d20928 u2097152
[ 0.000000] pcpu-alloc: s11840 r8192 d20928 u2097152 alloc=1*4194304
[ 0.000000] pcpu-alloc: [0] 0 1
[ 0.000000] Built 1 zonelists in Zone order, mobility grouping on. Total pages: 64712
[ 0.000000] Kernel command line: root=/dev/sda2 ro
[ 0.000000] PID hash table entries: 2048 (order: 1, 16384 bytes)
[ 0.000000] Dentry cache hash table entries: 65536 (order: 6, 524288 bytes)
[ 0.000000] Inode-cache hash table entries: 32768 (order: 5, 262144 bytes)
[ 0.000000] Memory: 1012624k available (3592k kernel code, 1368k data, 224k init) [fffff80000000000,000000121feb2000]
[ 0.000000] SLUB: Genslabs=16, HWalign=32, Order=0-3, MinObjects=0, CPUs=2, Nodes=1
[ 0.000000] Hierarchical RCU implementation.
[ 0.000000] CONFIG_RCU_FANOUT set to non-default value of 32
[ 0.000000] NR_IRQS:255
[ 0.000000] clocksource: mult[53555555] shift[24]
[ 0.000000] clockevent: mult[3126e98] shift[32]
[ 0.000000] Console: colour dummy device 80x25
[ 0.000000] console [tty0] enabled, bootconsole disabled
[ 42.069076] Calibrating delay using timer specific routine.. 24.00 BogoMIPS (lpj=48002)
[ 42.069093] pid_max: default: 32768 minimum: 301
[ 42.069222] Security Framework initialized
[ 42.069241] SELinux: Disabled at boot.
[ 42.069286] Mount-cache hash table entries: 512
[ 42.069792] Initializing cgroup subsys cpuacct
[ 42.069843] Initializing cgroup subsys devices
[ 42.069852] Initializing cgroup subsys freezer
[ 42.069860] Initializing cgroup subsys net_cls
[ 42.070062] Performance events: Supported PMU type is 'ultra3i'
[ 42.071682] CPU 0: synchronized TICK with master CPU (last diff 0 cycles, maxerr 6 cycles)
[ 42.071692] Brought up 2 CPUs
[ 42.071722] Testing NMI watchdog ... OK.
[ 42.152193] devtmpfs: initialized
[ 42.152688] print_constraints: dummy:
[ 42.152826] NET: Registered protocol family 16
[ 42.156708] /pci@1c,600000: TOMATILLO PCI Bus Module ver[4:0]
[ 42.156726] /pci@1c,600000: PCI IO[7ce01000000] MEM[7cf00000000]
[ 42.158679] PCI: Scanning PBM /pci@1c,600000
[ 42.158823] pci 0000:00:03.0: PME# supported from D3hot
[ 42.158833] pci 0000:00:03.0: PME# disabled
[ 42.159022] /pci@1d,700000: TOMATILLO PCI Bus Module ver[4:0]
[ 42.159037] /pci@1d,700000: PCI IO[7c601000000] MEM[7c700000000]
[ 42.160969] PCI: Scanning PBM /pci@1d,700000
[ 42.161130] pci 0001:00:04.0: supports D1 D2
[ 42.161180] pci 0001:00:04.1: supports D1 D2
[ 42.161364] /pci@1e,600000: TOMATILLO PCI Bus Module ver[4:0]
[ 42.161385] /pci@1e,600000: PCI IO[7fe01000000] MEM[7ff00000000]
[ 42.163328] PCI: Scanning PBM /pci@1e,600000
[ 42.163514] pci 0002:00:06.0: quirk: [io 0x7fe01000800-0x7fe0100083f] claimed by ali7101 ACPI
[ 42.163544] pci 0002:00:06.0: quirk: [io 0x7fe01000600-0x7fe0100061f] claimed by ali7101 SMB
[ 42.163620] pci 0002:00:08.0: supports D1 D2
[ 42.163627] pci 0002:00:08.0: PME# supported from D2 D3hot D3cold
[ 42.163635] pci 0002:00:08.0: PME# disabled
[ 42.163690] pci 0002:00:0a.0: PME# supported from D3cold
[ 42.163696] pci 0002:00:0a.0: PME# disabled
[ 42.163754] pci 0002:00:0b.0: PME# supported from D3cold
[ 42.163761] pci 0002:00:0b.0: PME# disabled
[ 42.163950] pci 0002:01:08.0: supports D1 D2
[ 42.163956] pci 0002:01:08.0: PME# supported from D0 D1 D2 D3hot D3cold
[ 42.163964] pci 0002:01:08.0: PME# disabled
[ 42.164034] pci 0002:01:08.1: supports D1 D2
[ 42.164040] pci 0002:01:08.1: PME# supported from D0 D1 D2 D3hot D3cold
[ 42.164048] pci 0002:01:08.1: PME# disabled
[ 42.164117] pci 0002:01:08.2: supports D1 D2
[ 42.164123] pci 0002:01:08.2: PME# supported from D0 D1 D2 D3hot
[ 42.164131] pci 0002:01:08.2: PME# disabled
[ 42.164200] pci 0002:01:0b.0: supports D1 D2
[ 42.164206] pci 0002:01:0b.0: PME# supported from D0 D1 D2 D3hot
[ 42.164214] pci 0002:01:0b.0: PME# disabled
[ 42.165051] /pci@1f,700000: TOMATILLO PCI Bus Module ver[4:0]
[ 42.165071] /pci@1f,700000: PCI IO[7f601000000] MEM[7f700000000]
[ 42.167014] PCI: Scanning PBM /pci@1f,700000
[ 42.167178] pci 0003:00:02.0: supports D2
[ 42.169200] bio: create slab <bio-0> at 0
[ 42.169521] vgaarb: loaded
[ 42.170021] /pci@1e,600000/isa@7/rtc@0,70: RTC regs at 0x7fe01000070
[ 42.170519] Switching to clocksource stick
[ 42.171318] Switched to NOHz mode on CPU #0
[ 42.173039] Switched to NOHz mode on CPU #1
[ 42.177422] NET: Registered protocol family 2
[ 42.177624] IP route cache hash table entries: 4096 (order: 2, 32768 bytes)
[ 42.178172] TCP established hash table entries: 16384 (order: 5, 262144 bytes)
[ 42.179001] TCP bind hash table entries: 16384 (order: 5, 262144 bytes)
[ 42.179796] TCP: Hash tables configured (established 16384 bind 16384)
[ 42.179812] TCP reno registered
[ 42.179829] UDP hash table entries: 256 (order: 0, 8192 bytes)
[ 42.179871] UDP-Lite hash table entries: 256 (order: 0, 8192 bytes)
[ 42.180190] NET: Registered protocol family 1
[ 42.180257] pci 0002:00:07.0: Activating ISA DMA hang workarounds
[ 42.180338] PCI: CLS 64 bytes, default 64
[ 42.180450] Unpacking initramfs...
[ 42.760636] Freeing initrd memory: 9823k freed
[ 42.761478] power: Control reg at 7fe01000800
[ 42.761845] chmc: UltraSPARC-IIIi memory controller at /memory-controller@0,0
[ 42.761878] chmc: UltraSPARC-IIIi memory controller at /memory-controller@1,0
[ 42.762217] audit: initializing netlink socket (disabled)
[ 42.762256] type=2000 audit(0.772:1): initialized
[ 42.808149] HugeTLB registered 4 MB page size, pre-allocated 0 pages
[ 42.812594] VFS: Disk quotas dquot_6.5.2
[ 42.812820] Dquot-cache hash table entries: 1024 (order 0, 8192 bytes)
[ 42.813102] msgmni has been set to 1996
[ 42.813683] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253)
[ 42.813709] io scheduler noop registered
[ 42.813722] io scheduler deadline registered
[ 42.813838] io scheduler cfq registered (default)
[ 42.814374] e3d: Found device at 0003:00:02.0
[ 42.814527] fbcon: e3d (fb0) is primary device
[ 42.875062] Console: switching to colour frame buffer device 160x64
[ 42.933479] f008fc3c: ttyS0 at MMIO 0x7fe010003f8 (irq = 23) is a 16550A
[ 42.934086] f0091678: ttyS1 at MMIO 0x7fe010002e8 (irq = 23) is a 16550A
[ 42.934936] [drm] Initialized drm 1.1.0 20060810
[ 42.935906] Uniform Multi-Platform E-IDE driver
[ 42.936296] ide-gd driver 1.18
[ 42.936749] mousedev: PS/2 mouse device common for all mice
[ 42.937691] rtc_cmos rtc_cmos: rtc core: registered rtc_cmos as rtc0
[ 42.938148] rtc0: no alarms, 114 bytes nvram
[ 42.939179] TCP cubic registered
[ 42.939704] NET: Registered protocol family 10
[ 42.941337] Mobile IPv6
[ 42.941515] NET: Registered protocol family 17
[ 42.941831] Registering the dns_resolver key type
[ 42.942363] registered taskstats version 1
[ 42.943108] rtc_cmos rtc_cmos: setting system clock to 2012-04-06 14:18:59 UTC (1333721939)
[ 42.943687] Initializing network drop monitor service
[ 42.986758] udev[44]: starting version 164
[ 43.148051] SCSI subsystem initialized
[ 43.149483] tg3.c:v3.119 (May 18, 2011)
[ 43.149830] PCI: Enabling device: (0000:00:03.0), cmd 2
[ 43.160764] alim15x3 0002:00:0d.0: IDE controller (0x10b9:0x5229 rev 0xc4)
[ 43.161279] PCI: Enabling device: (0002:00:0d.0), cmd 5
[ 43.161342] alim15x3 0002:00:0d.0: 100% native mode on irq 29
[ 43.161750] ide0: BM-DMA at 0x7fe01000a20-0x7fe01000a27
[ 43.162156] ide1: BM-DMA at 0x7fe01000a28-0x7fe01000a2f
[ 43.162587] Probing IDE interface ide0...
[ 43.198759] PCI: Enabling device: (0002:01:0b.0), cmd 2
[ 43.199053] usbcore: registered new interface driver usbfs
[ 43.199515] usbcore: registered new interface driver hub
[ 43.201129] PCI: Enabling device: (0001:00:04.0), cmd 147
[ 43.201710] sym0: <1010-66> rev 0x1 at pci 0001:00:04.0 irq 12
[ 43.205740] usbcore: registered new device driver usb
[ 43.209185] ehci_hcd: USB 2.0 'Enhanced' Host Controller (EHCI) Driver
[ 43.219097] sym0: No NVRAM, ID 7, Fast-80, LVD, parity checking
[ 43.260916] sym0: SCSI BUS has been reset.
[ 43.261278] scsi0 : sym-2.2.3
[ 43.261588] firewire_ohci: Added fw-ohci device 0002:01:0b.0, OHCI v1.10, 4 IR + 8 IT contexts, quirks 0x2
[ 43.262933] PCI: Enabling device: (0002:01:08.2), cmd 2
[ 43.262956] ehci_hcd 0002:01:08.2: EHCI Host Controller
[ 43.280120] ehci_hcd 0002:01:08.2: new USB bus registered, assigned bus number 1
[ 43.294547] tg3 0000:00:03.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.
[ 43.296958] PCI: Enabling device: (0001:00:04.1), cmd 147
[ 43.297531] sym1: <1010-66> rev 0x1 at pci 0001:00:04.1 irq 13
[ 43.316770] sym1: No NVRAM, ID 7, Fast-80, LVD, parity checking
[ 43.354552] tg3 0000:00:03.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.
[ 43.368138] ehci_hcd 0002:01:08.2: irq 32, io mem 0x7ff03004000
[ 43.391707] sym1: SCSI BUS has been reset.
[ 43.408268] scsi1 : sym-2.2.3
[ 43.414549] tg3 0000:00:03.0: vpd r/w failed. This is likely a firmware bug on this device. Contact the card vendor for a firmware update.
[ 43.417568] tg3 0000:00:03.0: eth0: Tigon3 [partno(none) rev 1002] (PCI:66MHz:64-bit) MAC address 00:03:ba:44:d1:39
[ 43.417580] tg3 0000:00:03.0: eth0: attached PHY is 5703 (10/100/1000Base-T Ethernet) (WireSpeed[1], EEE[0])
[ 43.417590] tg3 0000:00:03.0: eth0: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[0] TSOcap[1]
[ 43.417598] tg3 0000:00:03.0: eth0: dma_rwctrl[763f0000] dma_mask[32-bit]
[ 43.430557] ehci_hcd 0002:01:08.2: USB 2.0 started, EHCI 1.00
[ 43.430629] usb usb1: New USB device found, idVendor=1d6b, idProduct=0002
[ 43.430637] usb usb1: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 43.430643] usb usb1: Product: EHCI Host Controller
[ 43.430649] usb usb1: Manufacturer: Linux 3.0.0 ehci_hcd
[ 43.430654] usb usb1: SerialNumber: 0002:01:08.2
[ 43.431154] hub 1-0:1.0: USB hub found
[ 43.431169] hub 1-0:1.0: 5 ports detected
[ 43.433097] ohci_hcd: USB 1.1 'Open' Host Controller (OHCI) Driver
[ 43.779014] firewire_core: created device fw0: GUID 000516000040a1e6, S400
[ 43.982608] Probing IDE interface ide1...
[ 44.382751] hdc: JLMS XJ-HD166S, ATAPI CD/DVD-ROM drive
[ 44.734714] hdc: host max PIO5 wanted PIO255(auto-tune) selected PIO4
[ 44.734777] hdc: UDMA/33 mode selected
[ 44.752061] ide0 at 0x7fe01000a00-0x7fe01000a07,0x7fe01000a1a on irq 29
[ 44.769955] ide1 at 0x7fe01000a10-0x7fe01000a17,0x7fe01000a0a on irq 29
[ 44.787541] ohci_hcd 0002:00:0a.0: OHCI Host Controller
[ 44.805531] ohci_hcd 0002:00:0a.0: new USB bus registered, assigned bus number 2
[ 44.823384] ohci_hcd 0002:00:0a.0: irq 27, io mem 0x7ff01000000
[ 44.898638] usb usb2: New USB device found, idVendor=1d6b, idProduct=0001
[ 44.916188] usb usb2: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 44.933754] usb usb2: Product: OHCI Host Controller
[ 44.951250] usb usb2: Manufacturer: Linux 3.0.0 ohci_hcd
[ 44.968653] usb usb2: SerialNumber: 0002:00:0a.0
[ 44.986414] hub 2-0:1.0: USB hub found
[ 45.003825] hub 2-0:1.0: 2 ports detected
[ 45.023019] PCI: Enabling device: (0002:00:0b.0), cmd 2
[ 45.023042] ohci_hcd 0002:00:0b.0: OHCI Host Controller
[ 45.040565] ohci_hcd 0002:00:0b.0: new USB bus registered, assigned bus number 3
[ 45.041737] libata version 3.00 loaded.
[ 45.058151] ohci_hcd 0002:00:0b.0: irq 28, io mem 0x7ff02000000
[ 45.134599] usb usb3: New USB device found, idVendor=1d6b, idProduct=0001
[ 45.152299] usb usb3: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 45.169845] usb usb3: Product: OHCI Host Controller
[ 45.187125] usb usb3: Manufacturer: Linux 3.0.0 ohci_hcd
[ 45.204350] usb usb3: SerialNumber: 0002:00:0b.0
[ 45.222017] hub 3-0:1.0: USB hub found
[ 45.239201] hub 3-0:1.0: 2 ports detected
[ 45.256430] PCI: Enabling device: (0002:01:08.0), cmd 2
[ 45.256446] ohci_hcd 0002:01:08.0: OHCI Host Controller
[ 45.264020] ide-cd driver 5.00
[ 45.264472] ide-cd: hdc: ATAPI 48X DVD-ROM drive, 512kB Cache
[ 45.264483] cdrom: Uniform CD-ROM driver Revision: 3.20
[ 45.324840] ohci_hcd 0002:01:08.0: new USB bus registered, assigned bus number 4
[ 45.342479] ohci_hcd 0002:01:08.0: irq 30, io mem 0x7ff03000000
[ 45.346558] usb 2-1: new full speed USB device number 2 using ohci_hcd
[ 45.464625] usb usb4: New USB device found, idVendor=1d6b, idProduct=0001
[ 45.482514] usb usb4: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 45.500347] usb usb4: Product: OHCI Host Controller
[ 45.518195] usb usb4: Manufacturer: Linux 3.0.0 ohci_hcd
[ 45.520463] usb 2-1: New USB device found, idVendor=046d, idProduct=c52f
[ 45.520473] usb 2-1: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[ 45.520479] usb 2-1: Product: USB Receiver
[ 45.520484] usb 2-1: Manufacturer: Logitech
[ 45.605267] usb usb4: SerialNumber: 0002:01:08.0
[ 45.622871] hub 4-0:1.0: USB hub found
[ 45.640321] hub 4-0:1.0: 3 ports detected
[ 45.657338] usb 2-2: new low speed USB device number 3 using ohci_hcd
[ 45.674283] PCI: Enabling device: (0002:01:08.1), cmd 2
[ 45.674302] ohci_hcd 0002:01:08.1: OHCI Host Controller
[ 45.691365] ohci_hcd 0002:01:08.1: new USB bus registered, assigned bus number 5
[ 45.708156] ohci_hcd 0002:01:08.1: irq 31, io mem 0x7ff03002000
[ 45.808609] usb usb5: New USB device found, idVendor=1d6b, idProduct=0001
[ 45.825194] usb usb5: New USB device strings: Mfr=3, Product=2, SerialNumber=1
[ 45.841692] usb usb5: Product: OHCI Host Controller
[ 45.858132] usb usb5: Manufacturer: Linux 3.0.0 ohci_hcd
[ 45.874479] usb usb5: SerialNumber: 0002:01:08.1
[ 45.891220] hub 5-0:1.0: USB hub found
[ 45.907547] hub 5-0:1.0: 2 ports detected
[ 45.930406] usb 2-2: New USB device found, idVendor=0b38, idProduct=0010
[ 45.947118] usb 2-2: New USB device strings: Mfr=0, Product=0, SerialNumber=0
[ 45.992085] input: Logitech USB Receiver as /devices/root/f007134c/pci0002:00/0002:00:0a.0/usb2/2-1/2-1:1.0/input/input0
[ 46.010087] generic-usb 0003:046D:C52F.0001: input,hidraw0: USB HID v1.11 Mouse [Logitech USB Receiver] on usb-0002:00:0a.0-1/input0
[ 46.038680] input: Logitech USB Receiver as /devices/root/f007134c/pci0002:00/0002:00:0a.0/usb2/2-1/2-1:1.1/input/input1
[ 46.058246] generic-usb 0003:046D:C52F.0002: input,hiddev0,hidraw1: USB HID v1.11 Device [Logitech USB Receiver] on usb-0002:00:0a.0-1/input1
[ 46.083952] input: HID 0b38:0010 as /devices/root/f007134c/pci0002:00/0002:00:0a.0/usb2/2-2/2-2:1.0/input/input2
[ 46.103364] generic-usb 0003:0B38:0010.0003: input,hidraw2: USB HID v1.10 Keyboard [HID 0b38:0010] on usb-0002:00:0a.0-2/input0
[ 46.134710] input: HID 0b38:0010 as /devices/root/f007134c/pci0002:00/0002:00:0a.0/usb2/2-2/2-2:1.1/input/input3
[ 46.154765] generic-usb 0003:0B38:0010.0004: input,hidraw3: USB HID v1.10 Device [HID 0b38:0010] on usb-0002:00:0a.0-2/input1
[ 46.175270] usbcore: registered new interface driver usbhid
[ 46.195415] usbhid: USB HID core driver
[ 46.339454] scsi 0:0:0:0: Direct-Access ModusLnk MXJ3735SC800600P M108 PQ: 0 ANSI: 3
[ 46.339481] scsi target0:0:0: tagged command queuing enabled, command queue depth 16.
[ 46.339502] scsi target0:0:0: Beginning Domain Validation
[ 46.344030] scsi target0:0:0: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 31)
[ 46.352225] scsi target0:0:0: Ending Domain Validation
[ 46.353493] scsi 0:0:1:0: Direct-Access ModusLnk MXJ3735SC800600P M108 PQ: 0 ANSI: 3
[ 46.353504] scsi target0:0:1: tagged command queuing enabled, command queue depth 16.
[ 46.353519] scsi target0:0:1: Beginning Domain Validation
[ 46.358019] scsi target0:0:1: FAST-80 WIDE SCSI 160.0 MB/s DT (12.5 ns, offset 31)
[ 46.366508] scsi target0:0:1: Ending Domain Validation
[ 50.235497] sd 0:0:1:0: [sdb] 143571316 512-byte logical blocks: (73.5 GB/68.4 GiB)
[ 50.257371] sd 0:0:0:0: [sda] 143571316 512-byte logical blocks: (73.5 GB/68.4 GiB)
[ 50.259530] sd 0:0:1:0: [sdb] Write Protect is off
[ 50.259540] sd 0:0:1:0: [sdb] Mode Sense: b3 00 00 08
[ 50.260839] sd 0:0:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 50.272837] sdb: sdb1 sdb3
[ 50.349696] sd 0:0:0:0: [sda] Write Protect is off
[ 50.372214] sd 0:0:0:0: [sda] Mode Sense: b3 00 00 08
[ 50.373476] sd 0:0:1:0: [sdb] Attached SCSI disk
[ 50.395654] sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
[ 50.432564] sda: sda1 sda2 sda3 sda4
[ 50.463327] sd 0:0:0:0: [sda] Attached SCSI disk
[ 50.690562] EXT3-fs: barriers not enabled
[ 50.813667] kjournald starting. Commit interval 5 seconds
[ 50.813788] EXT3-fs (sda2): mounted filesystem with ordered data mode
[ 51.861601] udev[334]: starting version 164
[ 52.395991] PCI: Enabling device: (0002:00:08.0), cmd 3
[ 55.954502] AC'97 1 does not respond - RESET
[ 55.990505] AC'97 1 access is not valid [0xffffffff], removing mixer.
[ 56.013704] ali mixer 1 creating error.
[ 56.503799] Adding 2996112k swap on /dev/sda4. Priority:-1 extents:1 across:2996112k
[ 56.796695] EXT3-fs (sda2): using internal journal
[ 56.938409] loop: module loaded
[ 58.307505] EXT3-fs: barriers not enabled
[ 58.337349] kjournald starting. Commit interval 5 seconds
[ 58.357947] EXT3-fs (sdb1): using internal journal
[ 58.377874] EXT3-fs (sdb1): mounted filesystem with ordered data mode
[ 59.164654] tg3 0000:00:03.0: eth0: Failed to load firmware "tigon/tg3_tso.bin"
[ 59.184272] tg3 0000:00:03.0: eth0: TSO capability disabled
[ 60.489830] tg3 0000:00:03.0: eth0: No firmware running
[ 60.545984] ADDRCONF(NETDEV_UP): eth0: link is not ready
[ 62.082103] tg3 0000:00:03.0: eth0: Link is up at 100 Mbps, full duplex
[ 62.101803] tg3 0000:00:03.0: eth0: Flow control is on for TX and on for RX
[ 62.121351] ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
[ 62.415094] sshd (1128): /proc/1128/oom_adj is deprecated, please use /proc/1128/oom_score_adj instead.
[ 72.618433] eth0: no IPv6 routers present
[ 463.780930] sd 0:0:0:0: [sda] ABORT operation started
[ 463.957621] scsi target0:0:0: control msgout: 80 20 23 d.
[ 463.967404] sd 0:0:0:0: ABORT operation complete.
[ 463.976539] sd 0:0:0:0: [sda] ABORT operation started
[ 463.985664] sd 0:0:0:0: ABORT operation failed.
[ 463.994629] sd 0:0:0:0: [sda] ABORT operation started
[ 464.002878] sd 0:0:0:0: ABORT operation failed.
[ 464.010359] sd 0:0:0:0: [sda] ABORT operation started
[ 464.018031] sd 0:0:0:0: ABORT operation failed.
[ 464.025813] sd 0:0:0:0: [sda] ABORT operation started
[ 464.033793] sd 0:0:0:0: ABORT operation failed.
[ 464.041894] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050182] sd 0:0:0:0: ABORT operation failed.
[ 464.050188] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050194] sd 0:0:0:0: ABORT operation failed.
[ 464.050199] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050204] sd 0:0:0:0: ABORT operation failed.
[ 464.050210] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050215] sd 0:0:0:0: ABORT operation failed.
[ 464.050220] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050225] sd 0:0:0:0: ABORT operation failed.
[ 464.050230] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050235] sd 0:0:0:0: ABORT operation failed.
[ 464.050240] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050245] sd 0:0:0:0: ABORT operation failed.
[ 464.050251] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050256] sd 0:0:0:0: ABORT operation failed.
[ 464.050261] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050266] sd 0:0:0:0: ABORT operation failed.
[ 464.050271] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050276] sd 0:0:0:0: ABORT operation failed.
[ 464.050281] sd 0:0:0:0: [sda] ABORT operation started
[ 464.050286] sd 0:0:0:0: ABORT operation failed.
[ 474.048873] sd 0:0:0:0: [sda] ABORT operation started
[ 474.219132] scsi target0:0:0: control msgout: 80 20 7b d.
[ 474.229785] sd 0:0:0:0: ABORT operation complete.
[ 474.240310] sd 0:0:0:0: [sda] DEVICE RESET operation started
[ 474.251039] sd 0:0:0:0: DEVICE RESET operation complete.
[ 474.479644] scsi target0:0:0: control msgout: c.
[ 474.490834] scsi target0:0:0: has been reset
[ 474.501823] sd 0:0:0:0: [sda] BUS RESET operation started
[ 474.515257] sym0: SCSI BUS reset detected.
[ 474.531042] sym0: SCSI BUS has been reset.
[ 474.542381] sd 0:0:0:0: BUS RESET operation complete.
04-06-2012, 02:46 PM
Jonathan Nieder
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
Kieron Gillespie wrote:
> I am right now testing one major kernel version at a time, and on
> the 3.0.0-1 I got
Just to be clear, if each time you test the version halfway between
the newest known-good and oldest known-bad kernel then you only have
to test log(n) kernels instead of n.
That's neither here nor there, though.
Thanks for the update.
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20120406144619.GA9933@burratino">http://lists.debian.org/20120406144619.GA9933@burratino
04-07-2012, 03:14 AM
Kieron Gillespie
Bug#648766: BUG: NMI Watchdog detected LOCKUP on CPU0
That's what I would have done except I ran into a problem.
With a completely clean install of Debian and every major version of the
Linux kernel I haven't run into this error again. Of coarse I was
running bare base of Debian with only the ssh server installed. This
error has yet to come up again. I am now on the Kernel version 3.3.0 and
I have slowly restored everything one at a time, and checked to see if
the stability has been broken by each thing I install. So far nothing.
The only major part I haven't install that may be the cause is the
non-free firmware tg3. So if I install that any still nothing wrong then
I don't know. Other then the possibility that there is something
different with the kernel.org versions and the ones that I have been
downloading from Debian. Or it could have been some improper
configuration of the kernel at the start.
On 04/06/2012 10:46 AM, Jonathan Nieder wrote:
Kieron Gillespie wrote:
I am right now testing one major kernel version at a time, and on
the 3.0.0-1 I got
Just to be clear, if each time you test the version halfway between
the newest known-good and oldest known-bad kernel then you only have
to test log(n) kernels instead of n.
That's neither here nor there, though.
Thanks for the update.
Jonathan
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 4F7FB12C.6090807@gmail.com">http://lists.debian.org/4F7FB12C.6090807@gmail.com