When running rsnapshot backups from an IBM fibre channel disk system using LVM2 snapshots to a Promise fibre channel disk system, the qla2xxx driver causes a system crash
and reboot. I'm running Lenny with kernel 2.6.22--3-vserver-amd64 and stock Debian qla2xxx module. I've already replaced the Qlogic HBA and the Qlogic switch connecting to the storage. Three other servers with similar hardware running the same Debian version don't have this problem. These events were logged with the ql2xextended_error_logging parameter enabled:
...(25 more port retries)...
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-0: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-4: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-5: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel: qla2xxx 0000:08:01.0: scsi(0:0:0): DEVICE RESET ISSUED.
Feb* 6 13:41:33 hqhost kernel: qla2x00_wait_for_hba_online return_status=0
Is this a hardware problem, a kernel problem, or a qlogic driver problem-- or perhaps all three at once? Thanks in advance,
--
Daniel Bakken
Systems Administrator
Economic Modeling Specialists Inc
Moscow, Idaho
02-08-2008, 04:15 AM
"Daniel Bakken"
qla2xxx mailbox timeout crashes lenny
When running rsnapshot backups from an IBM fibre channel disk system
using LVM2 snapshots to a Promise fibre channel disk system, the
qla2xxx driver causes a system crash
and reboot. I'm running Lenny with kernel 2.6.22--3-vserver-amd64 and
stock Debian qla2xxx module. I've already replaced the Qlogic HBA and
the Qlogic switch connecting to the storage. Three other servers with
similar hardware running the same Debian version don't have this
problem.
...(25 more port retries)...
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-0: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-4: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel:* rport-0:0-5: blocked FC remote port time out: removing target and saving binding
Feb* 6 13:41:33 hqhost kernel: qla2xxx 0000:08:01.0: scsi(0:0:0): DEVICE RESET ISSUED.
Feb* 6 13:41:33 hqhost kernel: qla2x00_wait_for_hba_online return_status=0
Is this a hardware problem, a kernel problem, or a qlogic driver problem-- or perhaps all three at once? Thanks in advance,--
Daniel Bakken
Systems Administrator