FAQ Search Today's Posts Mark Forums Read
» Video Reviews

» Linux Archive

Linux-archive is a website aiming to archive linux email lists and to make them easily accessible for linux users/developers.


» Sponsor

» Partners

» Sponsor

Go Back   Linux Archive > Debian > Debian Kernel

 
 
LinkBack Thread Tools
 
Old 09-18-2010, 11:48 AM
Josip Rodin
 
Default Bug#597276: qla2xxx_eh_abort(5) -> kernel NULL pointer dereference

Package: linux-2.6
Version: 2.6.32-21~bpo50+1

Hi,

Got this in dmesg on a server:

Sep 18 02:46:52 birdun kernel: [387093.744649] qla2xxx_eh_abort(5): aborting sp ffff8801b58013c0 from RISC. pid=46881441.
Sep 18 02:46:56 birdun kernel: [387093.836909] BUG: unable to handle kernel NULL pointer dereference at 0000000000000040
Sep 18 02:46:56 birdun kernel: [387093.924511] IP: [<ffffffff812f8ea1>] _spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387093.996511] PGD 22d846067 PUD 22d678067 PMD 0
Sep 18 02:46:56 birdun kernel: [387094.048511] Oops: 0002 [#1] SMP
Sep 18 02:46:56 birdun kernel: [387094.086651] last sysfs file: /sys/devices/pci0000:00/0000:00:04.0/0000:13:00.0/host4/rport-4:0-3/target4:0:3/fc_transport/target4:0:3/node_name
Sep 18 02:46:56 birdun kernel: [387094.236007] CPU 4
Sep 18 02:46:56 birdun kernel: [387094.260007] Modules linked in: ipmi_devintf nf_conntrack_ipv6 ip6t_LOG ip6table_filter ip6_tables xt_tcpudp nf_conntrack_ipv4 nf_defrag_ipv4 xt_state nf_conntrack ipt_REJECT ipt_LOG iptable_filter ip_tables x_tables bonding xfs exportfs dm_round_robin dm_multipath scsi_dh loop snd_pcsp snd_pcm snd_timer psmouse ipmi_si rng_core snd soundcore i5000_edac serio_raw hpilo ipmi_msghandler snd_page_alloc edac_core evdev container i5k_amb button processor shpchp pci_hotplug ext3 jbd mbcache dm_mirror dm_region_hash dm_log dm_snapshot dm_mod st ch osst sd_mod crc_t10dif sg sr_mod cdrom ata_piix ata_generic qla2xxx scsi_transport_fc libata scsi_tgt cciss usbhid hid bnx2 ehci_hcd uhci_hcd floppy usbcore nls_base scsi_mod thermal fan thermal_sys
Sep 18 02:46:56 birdun kernel: [387095.008511] Pid: 763, comm: scsi_eh_5 Not tainted 2.6.32-bpo.5-amd64 #1 ProLiant DL360 G5
Sep 18 02:46:56 birdun kernel: [387095.104511] RIP: 0010:[<ffffffff812f8ea1>] [<ffffffff812f8ea1>] _spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387095.204007] RSP: 0018:ffff88022b1c5d70 EFLAGS: 00010082
Sep 18 02:46:56 birdun kernel: [387095.264511] RAX: 0000000000000282 RBX: 0000000000000040 RCX: 000000000000381d
Sep 18 02:46:56 birdun kernel: [387095.348511] RDX: 0000000000010000 RSI: 0000000000000282 RDI: 0000000000000040
Sep 18 02:46:56 birdun kernel: [387095.432258] RBP: ffff8801b58013c0 R08: 00000000000a26c8 R09: 000000000000000a
Sep 18 02:46:56 birdun kernel: [387095.512512] R10: 0000000000000000 R11: ffffffff81673868 R12: 0000000000000001
Sep 18 02:46:56 birdun kernel: [387095.596512] R13: ffff88014066e100 R14: ffff8801b5801e80 R15: 0000000000000000
Sep 18 02:46:56 birdun kernel: [387095.684513] FS: 0000000000000000(0000) GS:ffff880008d00000(0000) knlGS:0000000000000000
Sep 18 02:46:56 birdun kernel: [387095.780002] CS: 0010 DS: 0018 ES: 0018 CR0: 000000008005003b
Sep 18 02:46:56 birdun kernel: [387095.844512] CR2: 0000000000000040 CR3: 000000022d42b000 CR4: 00000000000006e0
Sep 18 02:46:56 birdun kernel: [387095.928512] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Sep 18 02:46:56 birdun kernel: [387096.012511] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Sep 18 02:46:56 birdun kernel: [387096.096005] Process scsi_eh_5 (pid: 763, threadinfo ffff88022b1c4000, task ffff88022ba39c40)
Sep 18 02:46:56 birdun kernel: [387096.192511] Stack:
Sep 18 02:46:56 birdun kernel: [387096.216511] 000000000000381d ffffffffa014cb8b 0000000000000000 0000000000000286
Sep 18 02:46:56 birdun kernel: [387096.300959] <0> ffffffffffffff10 ffff8801b58013c0 0000000000002002 0000000000000286
Sep 18 02:46:56 birdun kernel: [387096.390206] <0> ffff88022df0a900 ffff88022b1c0000 ffff88022b881840 ffffffffa01407e4
Sep 18 02:46:56 birdun kernel: [387096.480511] Call Trace:
Sep 18 02:46:56 birdun kernel: [387096.508511] [<ffffffffa014cb8b>] ? qla24xx_abort_command+0x3f/0x1db [qla2xxx]
Sep 18 02:46:56 birdun kernel: [387096.592513] [<ffffffffa01407e4>] ? qla2xxx_eh_abort+0xf2/0x250 [qla2xxx]
Sep 18 02:46:56 birdun kernel: [387096.672511] [<ffffffffa001ccde>] ? scsi_error_handler+0x302/0x5b5 [scsi_mod]
Sep 18 02:46:56 birdun kernel: [387096.756512] [<ffffffffa001c9dc>] ? scsi_error_handler+0x0/0x5b5 [scsi_mod]
Sep 18 02:46:56 birdun kernel: [387096.836513] [<ffffffff81063601>] ? kthread+0x79/0x81
Sep 18 02:46:56 birdun kernel: [387096.896512] [<ffffffff81011baa>] ? child_rip+0xa/0x20
Sep 18 02:46:56 birdun kernel: [387096.956511] [<ffffffff81063588>] ? kthread+0x0/0x81
Sep 18 02:46:56 birdun kernel: [387097.012512] [<ffffffff81011ba0>] ? child_rip+0x0/0x20
Sep 18 02:46:56 birdun kernel: [387097.072511] Code: 31 d2 89 d0 c3 f0 83 2f 01 79 05 e8 ca ae e9 ff c3 48 83 ec 08 9c 58 0f 1f 44 00 00 48 89 c6 fa 66 0f 1f 44 00 00 ba 00 00 01 00 <f0> 0f c1 17 0f b7 ca c1 ea 10 39 d1 74 07 f3 90 0f b7 0f eb f5
Sep 18 02:46:56 birdun kernel: [387097.292511] RIP [<ffffffff812f8ea1>] _spin_lock_irqsave+0x1a/0x34
Sep 18 02:46:56 birdun kernel: [387097.364514] RSP <ffff88022b1c5d70>
Sep 18 02:46:56 birdun kernel: [387097.404511] CR2: 0000000000000040
Sep 18 02:46:56 birdun kernel: [387097.444512] ---[ end trace b0d3a44141011ebd ]---

I'm filing this as a Debian bug report and also Cc:ing the two upstream
kernel lists as indicated by scripts/get_maintainer.pl, because
drivers/scsi/qla2xxx/qla_mbx.c isn't mentioned anywhere under debian/
so it looks generic.

Please fix this. TIA.

--
2. That which causes joy or happiness.



--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20100918114821.GA31065@entuzijast.net">http://lists.debian.org/20100918114821.GA31065@entuzijast.net
 
Old 09-20-2010, 06:44 AM
Giridhar Malavali
 
Default Bug#597276: qla2xxx_eh_abort(5) -> kernel NULL pointer dereference

Hi Josip,

Thanks for letting us know about this problem. Can u please provide logs
with ql2xextended_error_logging enabled. Also, can u please provide more
details about the test case.

-- Giri

On 9/18/10 4:48 AM, "Josip Rodin" <joy@debbugs.entuzijast.net> wrote:

>
> so it looks generic.




--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: C8BC50F2.9A54%giridhar.malavali@qlogic.com">http://lists.debian.org/C8BC50F2.9A54%giridhar.malavali@qlogic.com
 
Old 09-22-2010, 01:57 PM
Josip Rodin
 
Default Bug#597276: qla2xxx_eh_abort(5) -> kernel NULL pointer dereference

On Sun, Sep 19, 2010 at 11:44:50PM -0700, Giridhar Malavali wrote:
> Thanks for letting us know about this problem. Can u please provide logs
> with ql2xextended_error_logging enabled. Also, can u please provide more
> details about the test case.

OK. The machine has this hardware:

% sudo lspci -v
[...]
0b:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 16
I/O ports at 5000 [size=256]
Memory at fdef0000 (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d0000000 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable-
Capabilities: [74] Vital Product Data <?>
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting <?>
Capabilities: [138] Power Budgeting <?>
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

0b:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at 5400 [size=256]
Memory at fdee0000 (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d0040000 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable-
Capabilities: [74] Vital Product Data <?>
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting <?>
Capabilities: [138] Power Budgeting <?>
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

13:00.0 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 17
I/O ports at 6000 [size=256]
Memory at fdff0000 (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d0200000 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable-
Capabilities: [74] Vital Product Data <?>
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting <?>
Capabilities: [138] Power Budgeting <?>
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

13:00.1 Fibre Channel: QLogic Corp. ISP2432-based 4Gb Fibre Channel to PCI Express HBA (rev 02)
Subsystem: Hewlett-Packard Company Device 7041
Flags: bus master, fast devsel, latency 0, IRQ 18
I/O ports at 6400 [size=256]
Memory at fdfe0000 (64-bit, non-prefetchable) [size=16K]
[virtual] Expansion ROM at d0240000 [disabled] [size=256K]
Capabilities: [44] Power Management version 2
Capabilities: [4c] Express Endpoint, MSI 00
Capabilities: [64] Message Signalled Interrupts: Mask- 64bit+ Queue=0/4 Enable-
Capabilities: [74] Vital Product Data <?>
Capabilities: [7c] MSI-X: Enable- Mask- TabSize=16
Capabilities: [100] Advanced Error Reporting <?>
Capabilities: [138] Power Budgeting <?>
Kernel driver in use: qla2xxx
Kernel modules: qla2xxx

Anyway, we had been running an earlier 2.6.32 kernel up until a few days
ago, which gave us this on boot:

[ 2.656008] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[ 2.656188] qla2xxx 0000:0b:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
[ 2.710842] qla2xxx 0000:0b:00.0: Found an ISP2432, irq 16, iobase 0xffffc90000c6c000
[ 2.719526] qla2xxx 0000:0b:00.0: MSI-X: Unsupported ISP2432 (0x2, 0x0).
[ 2.727776] alloc irq_desc for 61 on node -1
[ 2.727778] alloc kstat_irqs on node -1
[ 2.728002] qla2xxx 0000:0b:00.0: irq 61 for MSI/MSI-X
[ 2.728184] qla2xxx 0000:0b:00.0: MSI: Enabled.
[ 2.732040] IRQ 59/cciss0: IRQF_DISABLED is not guaranteed on shared IRQs
[ 2.732058] cciss0: <0x3230> at PCI 0000:06:00.0 IRQ 59 using DAC
[ 2.747326] qla2xxx 0000:0b:00.0: Configuring PCI space...
[ 2.747479] cciss/c0d0: p1
[ 2.755773] qla2xxx 0000:0b:00.0: setting latency timer to 64
[ 2.756280] p2
[ 2.760467] qla2xxx 0000:0b:00.0: FLTL[DEF] = 0x11400.
[ 2.773807] qla2xxx 0000:0b:00.0: FLT[DEF]: boot=0x0 fw=0x20000 vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400
[ 2.787143] qla2xxx 0000:0b:00.0: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 pro=0 upro=0 wrtd=0x9c blk=0x8000.
[ 2.789701] qla2xxx 0000:0b:00.0: Configure NVRAM parameters...
[ 2.832896] qla2xxx 0000:0b:00.0: Verifying loaded RISC code...
[ 2.839290] scsi(0): **** Load RISC code ****
[ 2.857585] qla2xxx 0000:0b:00.0: firmware: requesting ql2400_fw.bin
[ 2.868458] qla2xxx 0000:0b:00.0: FW: Loading via request-firmware...
[ 2.879594] scsi(0): Verifying Checksum of loaded RISC code.
[ 2.904580] scsi(0): Checksum OK, start firmware.
[ 3.116042] qla2xxx 0000:0b:00.0: Allocated (64 KB) for EFT...
[ 3.122520] qla2xxx 0000:0b:00.0: Allocated (1285 KB) for firmware dump...
[ 3.129915] scsi(0): Issue init firmware.
[ 3.148073] DEBUG: detect hba 0 at address = ffff88022c7fa000
[ 3.154329] scsi0 : qla2xxx
[ 3.157756] qla2xxx 0000:0b:00.0:
[ 3.157757] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[ 3.157758] QLogic HPAE312A - PCI-Express Dual Port 4Gb Fibre Channel HBA
[ 3.157759] ISP2432: PCIe (2.5GT/s x4) @ 0000:0b:00.0 hdma+, host#=0, fw=4.04.05 (482)
[ 3.184054] qla2xxx 0000:0b:00.1: PCI INT B -> GSI 17 (level, low) -> IRQ 17
[ 3.192018] bnx2 0000:03:00.0: firmware: requesting bnx2/bnx2-mips-06-5.0.0.j3.fw
[ 3.200121] qla2xxx 0000:0b:00.1: Found an ISP2432, irq 17, iobase 0xffffc90000c42000
[ 3.204317] bnx2 0000:03:00.0: firmware: requesting bnx2/bnx2-rv2p-06-5.0.0.j3.fw
[ 3.208555] eth0: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem f8000000, IRQ 18, node add
[ 3.208589] alloc irq_desc for 19 on node -1
[ 3.208591] alloc kstat_irqs on node -1
[ 3.208601] bnx2 0000:05:00.0: PCI INT A -> GSI 19 (level, low) -> IRQ 19
[ 3.238063] qla2xxx 0000:0b:00.1: MSI-X: Unsupported ISP2432 (0x2, 0x0).
[ 3.246404] alloc irq_desc for 62 on node -1
[ 3.246406] alloc kstat_irqs on node -1
[ 3.246631] qla2xxx 0000:0b:00.1: irq 62 for MSI/MSI-X
[ 3.246812] qla2xxx 0000:0b:00.1: MSI: Enabled.
[ 3.252133] qla2xxx 0000:0b:00.1: Configuring PCI space...
[ 3.258081] qla2xxx 0000:0b:00.1: setting latency timer to 64
[ 3.262767] qla2xxx 0000:0b:00.1: FLTL[DEF] = 0x11400.
[ 3.276093] qla2xxx 0000:0b:00.1: FLT[DEF]: boot=0x0 fw=0x20000 vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400
[ 3.289440] qla2xxx 0000:0b:00.1: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 pro=0 upro=0 wrtd=0x9c blk=0x8000.
[ 3.291987] qla2xxx 0000:0b:00.1: Configure NVRAM parameters...
[ 3.334815] qla2xxx 0000:0b:00.1: Verifying loaded RISC code...
[ 3.340340] hda: DW-224E-V, ATAPI CD/DVD-ROM drive
[ 3.346668] scsi(1): **** Load RISC code ****
[ 3.364062] qla2xxx 0000:0b:00.1: FW: Loading via request-firmware...
[ 3.375163] scsi(1): Verifying Checksum of loaded RISC code.
[ 3.396050] scsi(1): Checksum OK, start firmware.
[ 3.415411] scsi(0): Asynchronous LIP RESET (f700).
[ 3.419402] qla2xxx 0000:0b:00.0: LIP reset occurred (f700).
[ 3.435295] scsi(0): LIP occurred (f700).
[ 3.439286] qla2xxx 0000:0b:00.0: LIP occurred (f700).
[ 3.445234] scsi(0): Asynchronous LIP RESET (f7f7).
[ 3.445237] scsi(0): qla2x00_reset_marker()
[ 3.445244] scsi(0): qla2x00_loop_resync()
[ 3.449227] qla2xxx 0000:0b:00.0: LIP reset occurred (f7f7).
[ 3.470480] scsi(0): Asynchronous P2P MODE received.
[ 3.612042] qla2xxx 0000:0b:00.1: Allocated (64 KB) for EFT...
[ 3.618478] qla2xxx 0000:0b:00.1: Allocated (1285 KB) for firmware dump...
[ 3.625787] scsi(1): Issue init firmware.
[ 3.644066] DEBUG: detect hba 1 at address = ffff88022c01a800
[ 3.650306] scsi1 : qla2xxx
[ 3.653783] qla2xxx 0000:0b:00.1:
[ 3.653784] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[ 3.653785] QLogic HPAE312A - PCI-Express Dual Port 4Gb Fibre Channel HBA
[ 3.653787] ISP2432: PCIe (2.5GT/s x4) @ 0000:0b:00.1 hdma+, host#=1, fw=4.04.05 (482)
[ 3.680882] qla2xxx 0000:13:00.0: PCI INT A -> GSI 17 (level, low) -> IRQ 17
[ 3.684018] bnx2 0000:05:00.0: firmware: requesting bnx2/bnx2-mips-06-5.0.0.j3.fw
[ 3.687598] bnx2 0000:05:00.0: firmware: requesting bnx2/bnx2-rv2p-06-5.0.0.j3.fw
[ 3.691850] eth1: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem fa000000, IRQ 19, node add
[ 3.718373] qla2xxx 0000:13:00.0: Found an ISP2432, irq 17, iobase 0xffffc90000c66000
[ 3.726941] qla2xxx 0000:13:00.0: MSI-X: Unsupported ISP2432 (0x2, 0x0).
[ 3.734215] alloc irq_desc for 63 on node -1
[ 3.734217] alloc kstat_irqs on node -1
[ 3.734228] qla2xxx 0000:13:00.0: irq 63 for MSI/MSI-X
[ 3.734234] qla2xxx 0000:13:00.0: MSI: Enabled.
[ 3.739175] qla2xxx 0000:13:00.0: Configuring PCI space...
[ 3.745371] qla2xxx 0000:13:00.0: setting latency timer to 64
[ 3.747951] qla2xxx 0000:13:00.0: FLTL[DEF] = 0x11400.
[ 3.760661] qla2xxx 0000:13:00.0: FLT[DEF]: boot=0x0 fw=0x20000 vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400
[ 3.773377] qla2xxx 0000:13:00.0: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 pro=0 upro=0 wrtd=0x9c blk=0x8000.
[ 3.775807] qla2xxx 0000:13:00.0: Configure NVRAM parameters...
[ 3.817739] qla2xxx 0000:13:00.0: Verifying loaded RISC code...
[ 3.824173] scsi(2): **** Load RISC code ****
[ 3.840037] qla2xxx 0000:13:00.0: FW: Loading via request-firmware...
[ 3.850640] scsi(2): Verifying Checksum of loaded RISC code.
[ 3.868036] scsi(2): Checksum OK, start firmware.
[ 4.024061] hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
[ 4.024277] hda: UDMA/33 mode selected
[ 4.028586] Probing IDE interface ide1...
[ 4.076533] qla2xxx 0000:13:00.0: Allocated (64 KB) for EFT...
[ 4.083015] qla2xxx 0000:13:00.0: Allocated (1285 KB) for firmware dump...
[ 4.090801] scsi(2): Issue init firmware.
[ 4.108059] DEBUG: detect hba 2 at address = ffff88022c01b800
[ 4.114303] scsi2 : qla2xxx
[ 4.117553] qla2xxx 0000:13:00.0:
[ 4.117554] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[ 4.117555] QLogic HPAE312A - PCI-Express Dual Port 4Gb Fibre Channel HBA
[ 4.117557] ISP2432: PCIe (2.5GT/s x4) @ 0000:13:00.0 hdma+, host#=2, fw=4.04.05 (482)
[ 4.143760] qla2xxx 0000:13:00.1: PCI INT B -> GSI 18 (level, low) -> IRQ 18
[ 4.151751] qla2xxx 0000:13:00.1: Found an ISP2432, irq 18, iobase 0xffffc90000c6a000
[ 4.166339] qla2xxx 0000:13:00.1: MSI-X: Unsupported ISP2432 (0x2, 0x0).
[ 4.173545] alloc irq_desc for 64 on node -1
[ 4.173547] alloc kstat_irqs on node -1
[ 4.173558] qla2xxx 0000:13:00.1: irq 64 for MSI/MSI-X
[ 4.173565] qla2xxx 0000:13:00.1: MSI: Enabled.
[ 4.178495] qla2xxx 0000:13:00.1: Configuring PCI space...
[ 4.184425] qla2xxx 0000:13:00.1: setting latency timer to 64
[ 4.187007] qla2xxx 0000:13:00.1: FLTL[DEF] = 0x11400.
[ 4.199712] qla2xxx 0000:13:00.1: FLT[DEF]: boot=0x0 fw=0x20000 vpd_nvram=0x48000 vpd=0x0 nvram=0x0 fdt=0x11000 flt=0x11400
[ 4.212431] qla2xxx 0000:13:00.1: FDT[MID]: (0xbf/0x80) erase=0x7ffd0352 pro=0 upro=0 wrtd=0x9c blk=0x8000.
[ 4.214869] qla2xxx 0000:13:00.1: Configure NVRAM parameters...
[ 4.256851] qla2xxx 0000:13:00.1: Verifying loaded RISC code...
[ 4.263248] scsi(3): **** Load RISC code ****
[ 4.280036] qla2xxx 0000:13:00.1: FW: Loading via request-firmware...
[ 4.290975] scsi(3): Verifying Checksum of loaded RISC code.
[ 4.312033] scsi(3): Checksum OK, start firmware.
[ 4.376350] scsi(2): Asynchronous LIP RESET (f700).
[ 4.380342] qla2xxx 0000:13:00.0: LIP reset occurred (f700).
[ 4.418657] scsi(2): Asynchronous P2P MODE received.
[ 4.464072] scsi(0): Asynchronous LOOP UP (4 Gbps).
[ 4.468064] qla2xxx 0000:0b:00.0: LOOP UP detected (4 Gbps).
[ 4.475593] scsi(0): Asynchronous PORT UPDATE.
[ 4.479584] scsi(0): Port database changed ffff 0006 0000.
[ 4.486263] scsi(0): F/W Ready - OK
[ 4.490191] scsi(0): fw_state=3 (2b7a, a000, 2, 0) curr time=fffedf6a.
[ 4.497985] scsi(0): Configure loop -- dpc flags =0x112e1
[ 4.503917] scsi(0): RSCN queue entry[0] = [00/000000].
[ 4.509567] scsi(0): device_resync: rscn overflow.
[ 4.521527] scsi(0): RHBA exiting normally.
[ 4.524036] qla2xxx 0000:13:00.1: Allocated (64 KB) for EFT...
[ 4.524119] qla2xxx 0000:13:00.1: Allocated (1285 KB) for firmware dump...
[ 4.524126] scsi(3): Issue init firmware.
[ 4.540560] DEBUG: detect hba 3 at address = ffff88022c01c800
[ 4.540562] scsi3 : qla2xxx
[ 4.540789] qla2xxx 0000:13:00.1:
[ 4.540790] QLogic Fibre Channel HBA Driver: 8.03.01-k6-debug
[ 4.540791] QLogic HPAE312A - PCI-Express Dual Port 4Gb Fibre Channel HBA
[ 4.540793] ISP2432: PCIe (2.5GT/s x4) @ 0000:13:00.1 hdma+, host#=3, fw=4.04.05 (482)
[ 4.582990] scsi(0): RPA exiting normally.
[ 4.590009] scsi(0): RFT_ID exiting normally.
[ 4.596128] scsi(0): RFF_ID exiting normally.
[ 4.602331] scsi(0): RNN_ID exiting normally.
...
[ 5.112512] scsi(2): qla2x00_reset_marker()
[ 5.112515] scsi(2): qla2x00_loop_resync()
[ 5.152377] scsi(0): Asynchronous PORT UPDATE ignored 0000/0004/0600.
[ 5.159802] scsi(0): Asynchronous PORT UPDATE ignored 0000/0007/0b00.
[ 5.180105] scsi(3): Asynchronous LIP RESET (f7f7).
[ 5.184099] qla2xxx 0000:13:00.1: LIP reset occurred (f7f7).
[ 5.197354] scsi(0): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbdc portid=650000.
[ 5.213366] scsi(3): Asynchronous P2P MODE received.
[ 5.251171] scsi(0): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbde portid=650100.
[ 5.263936] usbcore: registered new interface driver hiddev
[ 5.264184] scsi(0): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbd8 portid=650200.
[ 5.267203] scsi(0): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbda portid=650300.
[ 5.289445] scsi(0): GID_PT entry - nn 500110a0001838ed pn 500110a0001838ec portid=651f00.
[ 5.294155] input: HP Virtual Keyboard as /devices/pci0000:00/0000:00:1e.0/0000:01:04.4/usb6/6-1/6-1:1.0/input/input2
[ 5.294221] generic-usb 0003:03F0:1027.0001: input,hidraw0: USB HID v1.01 Keyboard [HP Virtual Keyboard] on usb-0000:01:04.
[ 5.330782] input: HP Virtual Keyboard as /devices/pci0000:00/0000:00:1e.0/0000:01:04.4/usb6/6-1/6-1:1.1/input/input3
[ 5.343555] generic-usb 0003:03F0:1027.0002: input,hidraw1: USB HID v1.01 Mouse [HP Virtual Keyboard] on usb-0000:01:04.4-1
[ 5.344607] scsi(0): GPSC failed, rejected request:
[ 5.344610] 0 1 2 3 4 5 6 7 8 9 Ah Bh Ch Dh Eh Fh
[ 5.344611] --------------------------------------------------------------
[ 5.344613] 01 00 00 00 fa 01 00 00 80 01 00 00 00 0b 00 00
[ 5.344620] scsi(0): GPSC command unsupported, disabling query...
[ 5.344625] scsi(0): device wrap (651f00)
[ 5.344632] scsi(0): LOOP READY
[ 5.344634] scsi(0): qla2x00_loop_resync - end
[ 5.364012] usb 6-2: new full speed USB device using uhci_hcd and address 3
[ 5.415668] scsi(2): Asynchronous LOOP UP (4 Gbps).
[ 5.415671] qla2xxx 0000:13:00.0: LOOP UP detected (4 Gbps).
[ 5.415688] scsi(2): Asynchronous PORT UPDATE ignored ffff/0006/0000.
[ 5.435641] usbcore: registered new interface driver usbhid
[ 5.443899] usbhid: v2.6:USB HID core driver
[ 5.513606] device-mapper: uevent: version 1.0.3
[ 5.515125] usb 6-2: New USB device found, idVendor=03f0, idProduct=1327
[ 5.515129] usb 6-2: New USB device strings: Mfr=1, Product=2, SerialNumber=0
[ 5.515131] usb 6-2: Product: Virtual Hub
[ 5.515133] usb 6-2: Manufacturer: HP
[ 5.515225] usb 6-2: configuration #1 chosen from 1 choice
[ 5.540017] scsi(3): qla2x00_reset_marker()
[ 5.540019] scsi(3): qla2x00_loop_resync()
[ 5.567492] device-mapper: ioctl: 4.15.0-ioctl (2009-04-01) initialised: dm-devel@redhat.com
[ 5.570279] hub 6-2:1.0: USB hub found
[ 5.572479] hub 6-2:1.0: 7 ports detected
[ 5.612811] scsi(2): F/W Ready - OK
[ 5.617050] scsi(2): fw_state=3 (2b2d, 0, 2, 0) curr time=fffee084.
[ 5.625072] scsi(2): Configure loop -- dpc flags =0x112e0
[ 5.631450] scsi(2): RSCN queue entry[0] = [00/000000].
[ 5.637827] scsi(2): device_resync: rscn overflow.
[ 5.645517] scsi(1): qla2x00_loop_resync()
[ 5.648317] scsi(2): RHBA exiting normally.
[ 5.652223] scsi(2): RPA exiting normally.
[ 5.655852] scsi(2): RFT_ID exiting normally.
[ 5.657260] scsi(2): RFF_ID exiting normally.
[ 5.658527] scsi(2): RNN_ID exiting normally.
[ 5.675212] scsi(2): RSNN_NN exiting normally.
[ 5.675216] scsi(2): Asynchronous PORT UPDATE ignored 0000/0004/0600.
[ 5.675233] scsi(2): Asynchronous PORT UPDATE ignored 0000/0007/0b00.
[ 5.712052] scsi(2): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbdd portid=c90000.
[ 5.726277] scsi(2): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbdf portid=c90100.
[ 5.738902] scsi(2): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbd9 portid=c90200.
[ 5.750995] scsi(2): GID_PT entry - nn 50001fe15005fbd0 pn 50001fe15005fbdb portid=c90300.
[ 5.761849] scsi(2): GID_PT entry - nn 500110a000183d19 pn 500110a000183d18 portid=c91f00.
[ 5.772193] kjournald starting. Commit interval 5 seconds
[ 5.772201] EXT3-fs: mounted filesystem with ordered data mode.
[ 5.787784] scsi(2): GPSC failed, rejected request:
[ 5.793486] 0 1 2 3 4 5 6 7 8 9 Ah Bh Ch Dh Eh Fh
[ 5.801455] --------------------------------------------------------------
[ 5.801457] 01 00 00 00 fa 01 00 00 80 01 00 00 00 0b 00 00
[ 5.801464] scsi(2): GPSC command unsupported, disabling query...
[ 5.801470] scsi(2): device wrap (c91f00)
[ 5.801478] scsi(2): LOOP READY
[ 5.801480] scsi(2): qla2x00_loop_resync - end
[ 6.108524] scsi(2:2): Async-login - loop-id=81 portid=c90000 retries=0.
[ 6.108529] scsi(2:3): Async-login - loop-id=82 portid=c90100 retries=0.
[ 6.108534] scsi(2:4): Async-login - loop-id=83 portid=c90200 retries=0.
[ 6.108538] scsi(2:5): Async-login - loop-id=84 portid=c90300 retries=0.
[ 6.117436] scsi(2:2): Async-login complete - iop0=12.
[ 6.117662] scsi(2:3): Async-login complete - iop0=12.
[ 6.119438] scsi(2:4): Async-login complete - iop0=12.
[ 6.121421] scsi(2:5): Async-login complete - iop0=12.
[ 6.148022] scsi(0:2): Async-login - loop-id=81 portid=650000 retries=0.
[ 6.148027] scsi(0:3): Async-login - loop-id=82 portid=650100 retries=0.
[ 6.148032] scsi(0:4): Async-login - loop-id=83 portid=650200 retries=0.
[ 6.148036] scsi(0:5): Async-login - loop-id=84 portid=650300 retries=0.
[ 6.148039] scsi(0): qla2x00_reset_marker()
[ 6.158717] scsi(0:2): Async-login complete - iop0=12.
[ 6.158840] scsi: waiting for bus probes to complete ...
[ 6.158878] scsi(0:3): Async-login complete - iop0=12.
[ 6.164982] scsi(0:4): Async-login complete - iop0=12.
[ 6.164990] scsi(0:5): Async-login complete - iop0=12.
[ 6.314044] scsi(3): Asynchronous LOOP UP (2 Gbps).
[ 6.318037] qla2xxx 0000:13:00.1: LOOP UP detected (2 Gbps).
[ 6.326272] scsi(3): Asynchronous PORT UPDATE ignored ffff/0006/0000.
[ 6.333728] scsi(3): Asynchronous PORT UPDATE ignored 0000/0004/0600.
[ 6.340986] scsi(3): Asynchronous PORT UPDATE ignored 0000/0007/0b00.
[ 6.552562] scsi(3): F/W Ready - OK
[ 6.556812] scsi(3): fw_state=3 (2b58, 4000, 2, 0) curr time=fffee16f.
[ 6.565065] scsi(3): Configure loop -- dpc flags =0x112e0
[ 6.571367] scsi(3): RSCN queue entry[0] = [00/000000].
[ 6.577759] scsi(3): device_resync: rscn overflow.
[ 6.602712] scsi(3): RHBA exiting normally.
[ 6.625644] scsi(3): RPA exiting normally.
[ 6.633996] scsi(3): RFT_ID exiting normally.
[ 6.644306] scsi(3): RFF_ID exiting normally.
[ 6.652941] scsi(3): RNN_ID exiting normally.
[ 6.662957] scsi(3): RSNN_NN exiting normally.
[ 6.750934] scsi(3): GID_PT entry - nn 500110a000183d19 pn 500110a000183d1a portid=010800.
[ 6.767280] scsi(3): GID_PT entry - nn 500507630013d002 pn 500507630053d002 portid=010000.
[ 6.783618] scsi(3): GID_PT entry - nn 500507630013d003 pn 500507630053d003 portid=010100.
[ 6.799930] scsi(3): GID_PT entry - nn 500507630013d001 pn 500507630053d001 portid=020000.
[ 6.816215] scsi(3): GID_PT entry - nn 500507630013d004 pn 500507630053d004 portid=020100.
[ 6.832512] scsi(3): GID_PT entry - nn 500507630013d00a pn 500507630053d00a portid=020800.
[ 6.849402] scsi(3): GID_PT entry - nn 500507630013d00b pn 500507630053d00b portid=020900.
[ 6.849959] udev: starting version 141
[ 6.888055] scsi(3): GID_PT entry - nn 500507630013d009 pn 500507630053d009 portid=020700.
[ 6.917081] scsi(3): GID_PT entry - nn 500507630013d008 pn 500507630053d008 portid=020600.
[ 6.996703] scsi(3): GPSC failed, rejected request:
[ 7.002410] 0 1 2 3 4 5 6 7 8 9 Ah Bh Ch Dh Eh Fh
[ 7.010309] --------------------------------------------------------------
[ 7.018370] 01 00 00 00 fa 01 00 00 80 01 00 00 00 0b 00 00
[ 7.026929] scsi(3): GPSC command unsupported, disabling query...
[ 7.033909] scsi(3): device wrap (020600)
[ 7.038483] scsi(3): LOOP READY
[ 7.042147] scsi(3): qla2x00_loop_resync - end
...
[ 7.544525] scsi(3:2): Async-login - loop-id=81 portid=010000 retries=0.
[ 7.544531] scsi(3:3): Async-login - loop-id=82 portid=010100 retries=0.
[ 7.544536] scsi(3:4): Async-login - loop-id=83 portid=020000 retries=0.
[ 7.544540] scsi(3:5): Async-login - loop-id=84 portid=020100 retries=0.
[ 7.544545] scsi(3:6): Async-login - loop-id=85 portid=020800 retries=0.
[ 7.544549] scsi(3:7): Async-login - loop-id=86 portid=020900 retries=0.
[ 7.544554] scsi(3:8): Async-login - loop-id=87 portid=020700 retries=0.
[ 7.544558] scsi(3:9): Async-login - loop-id=88 portid=020600 retries=0.
[ 7.556056] scsi(3:2): Async-login complete - iop0=312.
[ 7.556065] scsi(3:3): Async-login complete - iop0=312.
...
[ 7.564093] scsi(3:4): Async-login complete - iop0=312.
[ 7.564329] input: PC Speaker as /devices/platform/pcspkr/input/input5
[ 7.573778] scsi(3:6): Async-login complete - iop0=312.
[ 7.574199] scsi(3:5): Async-login complete - iop0=312.
[ 7.581808] scsi(3:7): Async-login complete - iop0=312.
[ 7.581815] scsi(3:8): Async-login complete - iop0=312.
[ 7.581819] scsi(3:9): Async-login complete - iop0=312.
...
[ 8.681156] qla2xxx 0000:0b:00.0: scsi(0:0:0:0): Queue depth adjusted-up to 4.
[ 8.689694] scsi 0:0:0:0: RAID HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.689759] qla2xxx 0000:13:00.0: scsi(2:0:0:0): Queue depth adjusted-up to 4.
[ 8.689977] scsi 2:0:0:0: RAID HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.690671] scsi 2:0:0:1: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.691167] scsi 2:0:0:2: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.691572] scsi 2:0:0:3: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.691996] scsi 2:0:0:4: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.754170] qla2xxx 0000:13:00.1: scsi(3:0:0:0): Queue depth adjusted-up to 4.
[ 8.762663] scsi 2:0:0:5: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.764131] scsi 3:0:0:0: Sequential-Access IBM ULT3580-TD2 73V1 PQ: 0 ANSI: 3
[ 8.767554] qla2xxx 0000:13:00.1: scsi(3:0:1:0): Queue depth adjusted-up to 4.
[ 8.769827] scsi 3:0:1:0: Sequential-Access IBM ULT3580-TD2 73V1 PQ: 0 ANSI: 3
[ 8.773591] qla2xxx 0000:13:00.1: scsi(3:0:2:0): Queue depth adjusted-up to 4.
[ 8.775898] scsi 3:0:2:0: Sequential-Access IBM ULT3580-TD2 73V1 PQ: 0 ANSI: 3
[ 8.816992] scsi 0:0:0:1: Direct-Access HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.817024] sd 2:0:0:2: [sdb] 2147483648 512-byte logical blocks: (1.09 TB/1.00 TiB)
[ 8.817063] sd 2:0:0:4: [sdd] 209715200 512-byte logical blocks: (107 GB/100 GiB)
[ 8.817256] sd 2:0:0:3: [sdc] 524288000 512-byte logical blocks: (268 GB/250 GiB)
[ 8.817287] qla2xxx 0000:13:00.0: scsi(2:0:1:0): Queue depth adjusted-up to 4.
[ 8.817509] sd 2:0:0:4: [sdd] Write Protect is off
[ 8.817516] scsi 2:0:1:0: RAID HP HSV210 6110 PQ: 0 ANSI: 5
[ 8.817531] sd 2:0:0:4: [sdd] Mode Sense: 97 00 10 08
...

I've redacted this liberally, let us know if you need more or less...

It is plugged into a set of HP Brocades which lead to a HP EVA 8k SAN,
and also into a set of IBM Brocades which lead to an IBM Tape Library.

Then we upgraded to the kernel based on 2.6.32.21, and started getting these
messages:

Sep 20 15:09:39 birdun kernel: [ 269.976207] scsi(2:1:1) UNDERRUN status detected 0x15-0x800. resid=0xee fw_resid=0xee cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 269.976229] scsi(2:1:3) UNDERRUN status detected 0x15-0x800. resid=0xee fw_resid=0xee cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 269.976325] scsi(2:1:3) UNDERRUN status detected 0x15-0x800. resid=0xd6 fw_resid=0xd6 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 269.976576] scsi(2:3:3) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 269.976632] scsi(2:2:3) UNDERRUN status detected 0x15-0x800. resid=0xee fw_resid=0xee cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758487] scsi(4:1:1) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758492] scsi(4:2:1) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758495] scsi(4:2:3) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758499] scsi(4:1:2) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758502] scsi(4:3:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758506] scsi(4:2:2) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758510] scsi(4:2:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758513] scsi(4:3:2) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758517] scsi(4:2:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758521] scsi(4:0:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758525] scsi(4:0:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758532] scsi(2:2:3) UNDERRUN status detected 0x15-0x800. resid=0xd6 fw_resid=0xd6 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758536] scsi(2:0:1) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758540] scsi(2:1:1) UNDERRUN status detected 0x15-0x800. resid=0xd6 fw_resid=0xd6 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758544] scsi(2:1:3) UNDERRUN status detected 0x15-0x800. resid=0x2 fw_resid=0x2 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758548] scsi(2:1:1) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758551] scsi(2:3:1) UNDERRUN status detected 0x15-0x800. resid=0xd6 fw_resid=0xd6 cdb=0x12 os_underflow=0x0
Sep 20 15:09:39 birdun kernel: [ 272.758555] scsi(2:0:3) UNDERRUN status detected 0x15-0x800. resid=0xed fw_resid=0xed cdb=0x12 os_underflow=0x0

This seemed harmless, but I'm mentioning it just in case.

A few days later, this BUG happened. At about the same time, the machine had
some other OOM-related issues, so this might have been related.

--
2. That which causes joy or happiness.



--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20100922135703.GA12071@entuzijast.net">http://lists.debian.org/20100922135703.GA12071@entuzijast.net
 

Thread Tools




All times are GMT. The time now is 12:29 PM.

VBulletin, Copyright ©2000 - 2014, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.
Copyright 2007 - 2008, www.linux-archive.org