Bug#579393: linux-image-2.6.32-3-amd64: kernel crashes at booting when QLogic IB7220 firmware not available
On Tue, 2010-04-27 at 16:16 +0200, Sebastian Hegler wrote:
> Package: linux-2.6
> Version: 2.6.32-9
> Severity: normal
>
> Booting this kernel fails when QLogic IBA7220 hardware is present, but
> the firmware image (in package "qlogic-firmware" from the non-free
> repositorys) is not.
>
> Confirmed by removing the card: machine's booting all fine. With the
> IBA board inserted, the machine fails to boot, spitting error messages
> on the screen.
> After installing the "qlogic-firmware" package and re-inserting the
> IBA board, everything works fine.
Sorry about this. Please can you send the error messages, so I can
understand what's going wrong.
Ben.
--
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.
06-11-2010, 02:47 AM
Ben Hutchings
Bug#579393: linux-image-2.6.32-3-amd64: kernel crashes at booting when QLogic IB7220 firmware not available
On Wed, 2010-06-09 at 13:02 +0200, Sebastian Hegler wrote:
> Hi!
>
> Here we go:
>
> ==========================================
> [ 28.007218] ib_ipath 000:07:00.0: firmware: requesting qlogic/sd7220.fw
> [ 28.070446] ib_ipath 000:07:00.0: infinipath0: Failed to load IB SERDES image
> [ 28.070455] ib_ipath 000:07:00.0: infinipath0: Could not initialize SerDes, not usable
> [ 28.070456] ib_ipath 000:07:00.0: infinipath0: Failed to bring up IB link
>
> ...
>
> [ 29.068010] BUG: unable to handle kernel NULL pointer dereference at (null)
> [ 29.072001] IP: [<(null)>] (null)
> [ 29.072001] PGD 23d60c067 PUD 23d958067 PMD 0
> [ 29.072001] Thread overran stack, or stack corrupted
> [ 29.072001] Oops: 0010 [#1] SMP
> [ 29.072001] last sysfs file: /sys/kernel/uevent_seqnum
> [ 29.072001] CPU 0
> [ 29.072001] Modules linked in: snd_pcm snd_timer amd64_edac_mod snd edac_core soundcore ib_ipath i2c_nforce2 shpchp snd_page_alloc edac_mce_amd psmouse evdev i2c_core button pci_hotplug pcspkr serio_raw ib_core processor ext3 jbd dm_mod raid456 md_mod async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx sd_mod crc_t10dif ses enclosure ata_generic usb_hid hid ide_pci_generic ide_core ohci_hcd mptsas mptscsih mptbase scsi_transport_sas thermal thermal_sys sata_nv ehci_hcd forcedeth libata scsi_mod usbcore nls_base [las unloaded: scsi_wait_scan]
> [ 29.192763] Pid: 0, comm: swapper Not tainted 2.6.32-3-amd64 #1 H8DM8-2
> ==========================================
>
> Please also see the "screenshot" attached.
[...]
I can't get much information from this log, but by inspecting the code I
do see some problems with the handling of errors during initialisation.
Could you please test whether the attached patch fixes this bug?
Instructions for rebuilding the Debian kernel package are available at
<http://kernel-handbook.alioth.debian.org/ch-common-tasks.html#s-common-official>.
Ben.
--
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.
08-02-2010, 02:51 AM
Moritz Muehlenhoff
Bug#579393: linux-image-2.6.32-3-amd64: kernel crashes at booting when QLogic IB7220 firmware not available
tags 579393 moreinfo
thanks
On Fri, Jun 11, 2010 at 03:47:54AM +0100, Ben Hutchings wrote:
> On Wed, 2010-06-09 at 13:02 +0200, Sebastian Hegler wrote:
> > Hi!
> >
> > Here we go:
> >
> > ==========================================
> > [ 28.007218] ib_ipath 000:07:00.0: firmware: requesting qlogic/sd7220.fw
> > [ 28.070446] ib_ipath 000:07:00.0: infinipath0: Failed to load IB SERDES image
> > [ 28.070455] ib_ipath 000:07:00.0: infinipath0: Could not initialize SerDes, not usable
> > [ 28.070456] ib_ipath 000:07:00.0: infinipath0: Failed to bring up IB link
> >
> > ...
> >
> > [ 29.068010] BUG: unable to handle kernel NULL pointer dereference at (null)
> > [ 29.072001] IP: [<(null)>] (null)
> > [ 29.072001] PGD 23d60c067 PUD 23d958067 PMD 0
> > [ 29.072001] Thread overran stack, or stack corrupted
> > [ 29.072001] Oops: 0010 [#1] SMP
> > [ 29.072001] last sysfs file: /sys/kernel/uevent_seqnum
> > [ 29.072001] CPU 0
> > [ 29.072001] Modules linked in: snd_pcm snd_timer amd64_edac_mod snd edac_core soundcore ib_ipath i2c_nforce2 shpchp snd_page_alloc edac_mce_amd psmouse evdev i2c_core button pci_hotplug pcspkr serio_raw ib_core processor ext3 jbd dm_mod raid456 md_mod async_raid6_recov async_pq raid6_pq async_xor xor async_memcpy async_tx sd_mod crc_t10dif ses enclosure ata_generic usb_hid hid ide_pci_generic ide_core ohci_hcd mptsas mptscsih mptbase scsi_transport_sas thermal thermal_sys sata_nv ehci_hcd forcedeth libata scsi_mod usbcore nls_base [las unloaded: scsi_wait_scan]
> > [ 29.192763] Pid: 0, comm: swapper Not tainted 2.6.32-3-amd64 #1 H8DM8-2
> > ==========================================
> >
> > Please also see the "screenshot" attached.
> [...]
>
> I can't get much information from this log, but by inspecting the code I
> do see some problems with the handling of errors during initialisation.
>
> Could you please test whether the attached patch fixes this bug?
> Instructions for rebuilding the Debian kernel package are available at
> <http://kernel-handbook.alioth.debian.org/ch-common-tasks.html#s-common-official>.
Sebastian,
did you test the patch?
Cheers,
Moritz
--
To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org
with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org
Archive: 20100802025142.GA8598@galadriel.inutil.org">http://lists.debian.org/20100802025142.GA8598@galadriel.inutil.org
08-09-2010, 11:33 AM
Sebastian Hegler
Bug#579393: linux-image-2.6.32-3-amd64: kernel crashes at booting when QLogic IB7220 firmware not available
> <0001-ipath-Fix-probe-failure-path.patch>
I'm sorry to report that your patch did not fix the issue.
See "fail_2.png" attached: Compared with the previous one, the difference now is that the interrupt is being disabled.
That, however, does not stop the system from crashing erratically: sometimes it just freezes hard (as in "fail_2.png"), sometimes I get a piece of a kernel backtrace, sometimes I get a full one which is larger than a screen of text (as in "fail_1.png").
Sebastian
08-11-2010, 02:55 AM
Ben Hutchings
Bug#579393: linux-image-2.6.32-3-amd64: kernel crashes at booting when QLogic IB7220 firmware not available
On Mon, 2010-08-09 at 13:33 +0200, Sebastian Hegler wrote:
> > <0001-ipath-Fix-probe-failure-path.patch>
>
> I'm sorry to report that your patch did not fix the issue.
>
> See "fail_2.png" attached: Compared with the previous one, the
> difference now is that the interrupt is being disabled.
This is correct behaviour when the firmware is not installed. So I'm
going to include the change to ib_ipath in the next kernel package
version (2.6.32-20).
> That, however, does not stop the system from crashing erratically:
> sometimes it just freezes hard (as in "fail_2.png"), sometimes I get a
> piece of a kernel backtrace, sometimes I get a full one which is
> larger than a screen of text (as in "fail_1.png").
This seems to be a separate bug. If you can reproduce this kind of
crash with that version, please make a new bug report.
Ben.
--
Ben Hutchings
Once a job is fouled up, anything done to improve it makes it worse.