Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Package: linux-2.6
Version: 2.6.32-16 Severity: important I have been running linux-image-2.6.32-5-amd64 2.6.32-15 mostly without problems (except for #580889) since 2010-06-14 in debian squeeze running under Xen. When I heard that -16 includes CONFIG_KPROBES I wanted to test it early. I installed the package from unstable with wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-image-2.6.32-5-amd64_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-image-2.6.32-5-amd64-dbg_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-image-2.6.32-5-amd64-common_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-headers-2.6.32-5-amd64-common_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-headers-2.6.32-5-common_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-headers-2.6.32-5-amd64_2.6.32-16_amd64.deb wget http://ftp.fi.debian.org/debian/pool/main/l/linux-2.6/linux-base_2.6.32-16_all.deb sudo dpkg -i *.deb but boot stopped at [ 0.000000] Initializing cgroup subsys cpuset [ 0.000000] Initializing cgroup subsys cpu [ 0.000000] Linux version 2.6.32-5-amd64 (Debian 2.6.32-16) (ben@decadent.org.uk) (gcc version 4.3.5 (Debian 4.3.5-1) ) #1 SMP Tue Jul 6 14:14:38 UTC 2010 [ 0.000000] Command line: root=/dev/xvda ro console=hvc0 [ 0.000000] KERNEL supported cpus: [ 0.000000] Intel GenuineIntel [ 0.000000] AMD AuthenticAMD [ 0.000000] Centaur CentaurHauls [ 0.000000] ACPI in unprivileged domain disabled [ 0.000000] BIOS-provided physical RAM map: [ 0.000000] Xen: 0000000000000000 - 00000000000a0000 (usable) [ 0.000000] Xen: 00000000000a0000 - 0000000000100000 (reserved) [ 0.000000] Xen: 0000000000100000 - 0000000080000000 (usable) [ 0.000000] DMI not present or invalid. [ 0.000000] last_pfn = 0x80000 max_arch_pfn = 0x400000000 [ 0.000000] init_memory_mapping: 0000000000000000-0000000080000000 [ 0.000000] RAMDISK: 016a5000 - 03085000 [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at 0000000000000000-0000000080000000 [ 0.000000] Bootmem setup node 0 0000000000000000-0000000080000000 [ 0.000000] NODE_DATA [0000000000008000 - 000000000000ffff] [ 0.000000] bootmap [0000000000010000 - 000000000001ffff] pages 10 [ 0.000000] (7 early reservations) ==> bootmem [0000000000 - 0080000000] [ 0.000000] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000] [ 0.000000] #1 [0003488000 - 00034a7000] XEN PAGETABLES ==> [0003488000 - 00034a7000] [ 0.000000] #2 [0000006000 - 0000008000] TRAMPOLINE ==> [0000006000 - 0000008000] [ 0.000000] #3 [0001000000 - 0001684754] TEXT DATA BSS ==> [0001000000 - 0001684754] [ 0.000000] #4 [00016a5000 - 0003085000] RAMDISK ==> [00016a5000 - 0003085000] [ 0.000000] #5 [0003085000 - 0003488000] XEN START INFO ==> [0003085000 - 0003488000] [ 0.000000] #6 [0000100000 - 00004e1000] PGTABLE ==> [0000100000 - 00004e1000] [ 0.000000] Zone PFN ranges: [ 0.000000] DMA 0x00000000 -> 0x00001000 [ 0.000000] DMA32 0x00001000 -> 0x00100000 [ 0.000000] Normal 0x00100000 -> 0x00100000 [ 0.000000] Movable zone start PFN for each node [ 0.000000] early_node_map[2] active PFN ranges [ 0.000000] 0: 0x00000000 -> 0x000000a0 [ 0.000000] 0: 0x00000100 -> 0x00080000 [ 0.000000] SFI: Simple Firmware Interface v0.7 http://simplefirmware.org [ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs [ 0.000000] No local APIC present [ 0.000000] APIC: disable apic facility [ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 0000000000100000 [ 0.000000] Allocating PCI resources starting at 80000000 (gap: 80000000:80000000) [ 0.000000] Booting paravirtualized kernel on Xen [ 0.000000] Xen version: 3.4.1_19718_04-2.1 (preserve-AD) [ 0.000000] NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:2 nr_node_ids:1 [ 0.000000] PERCPU: Embedded 30 pages/cpu @ffff8800034dd000 s90328 r8192 d24360 u122880 [ 0.000000] pcpu-alloc: s90328 r8192 d24360 u122880 alloc=30*4096 [ 0.000000] pcpu-alloc: [0] 0 [0] 1 [ 0.000000] Xen: using vcpu_info placement [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 516028 [ 0.000000] Policy zone: DMA32 [ 0.000000] Kernel command line: root=/dev/xvda ro console=hvc0 [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] Initializing CPU#0 [ 0.000000] Checking aperture... [ 0.000000] No AGP bridge found [ 0.000000] Memory: 2026096k/2097152k available (3067k kernel code, 384k absent, 70672k reserved, 1887k data, 588k init) [ 0.000000] SLUB: Genslabs=14, HWalign=64, Order=0-3, MinObjects=0, CPUs=2, Nodes=1 [ 0.000000] Hierarchical RCU implementation. [ 0.000000] NR_IRQS:4352 nr_irqs:288 [ 0.000000] Console: colour dummy device 80x25 [ 0.000000] console [tty0] enabled [ 0.000000] console [hvc0] enabled [ 0.000000] installing Xen timer for CPU 0 [ 0.000000] Detected 2393.998 MHz processor. [ 0.004000] Calibrating delay loop (skipped), value calculated using timer frequency.. 4787.99 BogoMIPS (lpj=9575992) [ 0.004000] Security Framework initialized [ 0.004000] SELinux: Disabled at boot. [ 0.004000] Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes) [ 0.004000] Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes) [ 0.004000] Mount-cache hash table entries: 256 [ 0.004000] Initializing cgroup subsys ns [ 0.004000] Initializing cgroup subsys cpuacct [ 0.004000] Initializing cgroup subsys devices [ 0.004000] Initializing cgroup subsys freezer [ 0.004000] Initializing cgroup subsys net_cls [ 0.004000] CPU: L1 I cache: 32K, L1 D cache: 32K [ 0.004000] CPU: L2 cache: 4096K [ 0.004000] CPU 0/0x3 -> Node 0 [ 0.004000] CPU: Unsupported number of siblings 4 [ 0.004000] Performance Events: unsupported p6 CPU model 15 no PMU driver, software events only. [ 0.004000] SMP alternatives: switching to UP code [ 0.004256] installing Xen timer for CPU 1 [ 0.004299] SMP alternatives: switching to SMP code [ 0.000011] Initializing CPU#1 [ 0.000056] CPU: L1 I cache: 32K, L1 D cache: 32K [ 0.000058] CPU: L2 cache: 4096K [ 0.000062] CPU 1/0x3 -> Node 0 [ 0.000065] CPU: Unsupported number of siblings 4 [ 0.008111] Brought up 2 CPUs [ 0.008244] devtmpfs: initialized [ 0.010738] Grant table initialized [ 0.010738] regulator: core version 0.5 [ 0.010738] NET: Registered protocol family 16 [ 0.012092] PCI: Fatal: No config space access function found [ 0.012450] bio: create slab <bio-0> at 0 [ 0.012450] ACPI: Interpreter disabled. [ 0.012450] xen_balloon: Initialising balloon driver. [ 0.012450] vgaarb: loaded [ 0.012450] PCI: System does not support PCI [ 0.012450] PCI: System does not support PCI [ 0.012450] Switching to clocksource xen [ 0.013525] pnp: PnP ACPI: disabled [ 0.013799] NET: Registered protocol family 2 [ 0.013980] IP route cache hash table entries: 65536 (order: 7, 524288 bytes) [ 0.015183] TCP established hash table entries: 262144 (order: 10, 4194304 bytes) [ 0.016788] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes) [ 0.017179] TCP: Hash tables configured (established 262144 bind 65536) [ 0.017186] TCP reno registered [ 0.017294] NET: Registered protocol family 1 [ 0.017387] Unpacking initramfs... [ 0.048660] Freeing initrd memory: 26496k freed [ 0.059989] platform rtc_cmos: registered platform RTC device (no PNP device found) [ 0.060322] audit: initializing netlink socket (disabled) [ 0.060340] type=2000 audit(1278662017.485:1): initialized [ 0.063019] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 0.064516] VFS: Disk quotas dquot_6.5.2 [ 0.064572] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.064663] msgmni has been set to 4008 [ 0.064927] alg: No test for stdrng (krng) [ 0.064990] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253) [ 0.064999] io scheduler noop registered [ 0.065004] io scheduler anticipatory registered [ 0.068938] io scheduler deadline registered [ 0.068938] io scheduler cfq registered (default) [ 0.147536] Linux agpgart interface v0.103 [ 0.147575] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 0.147766] input: Macintosh mouse button emulation as /devices/virtual/input/input0 [ 0.147819] PNP: No PS/2 controller found. Probing ports directly. [ 0.148640] i8042.c: No controller found. [ 0.148712] mice: PS/2 mouse device common for all mice [ 0.148759] rtc_cmos: probe of rtc_cmos failed with error -16 [ 0.148786] cpuidle: using governor ladder [ 0.148792] cpuidle: using governor menu [ 0.148799] No iBFT detected. [ 0.149076] TCP cubic registered [ 0.149194] NET: Registered protocol family 10 [ 0.149632] lo: Disabled Privacy Extensions [ 0.149861] Mobile IPv6 [ 0.149867] NET: Registered protocol family 17 [ 0.149963] registered taskstats version 1 [ 0.149982] XENBUS: Device with no driver: device/vbd/51712 [ 0.149988] XENBUS: Device with no driver: device/vbd/51728 [ 0.149993] XENBUS: Device with no driver: device/vif/0 [ 0.149997] XENBUS: Device with no driver: device/console/0 [ 0.150013] /build/buildd-linux-2.6_2.6.32-16-amd64-pXyi6J/linux-2.6-2.6.32/debian/build/source_amd64_none/drivers/rtc/hctosys.c: unable to open rtc device (rtc0) [ 0.150038] Initalizing network drop monitor service [ 0.150104] Freeing unused kernel memory: 588k freed [ 0.150287] Write protecting the kernel read-only data: 4220k In this state the kernel does not react to any of the following commands: sudo xm sysrq lindi1 h sudo xm sysrq lindi1 s sudo xm sysrq lindi1 r sudo xm sysrq lindi1 b which suggests that the kernel is totally dead. With -15 the boot messages are [ 0.000000] Initializing cgroup subsys cpuset [ 0.000000] Initializing cgroup subsys cpu [ 0.000000] Linux version 2.6.32-5-amd64 (Debian 2.6.32-15) (ben@decadent.org.uk) (gcc version 4.3.5 (Debian 4.3.5-1) ) #1 SMP Tue Jun 1 04:34:03 UTC 2010 [ 0.000000] Command line: root=/dev/xvda ro console=hvc0 [ 0.000000] KERNEL supported cpus: [ 0.000000] Intel GenuineIntel [ 0.000000] AMD AuthenticAMD [ 0.000000] Centaur CentaurHauls [ 0.000000] ACPI in unprivileged domain disabled [ 0.000000] BIOS-provided physical RAM map: [ 0.000000] Xen: 0000000000000000 - 00000000000a0000 (usable) [ 0.000000] Xen: 00000000000a0000 - 0000000000100000 (reserved) [ 0.000000] Xen: 0000000000100000 - 0000000080000000 (usable) [ 0.000000] DMI not present or invalid. [ 0.000000] last_pfn = 0x80000 max_arch_pfn = 0x400000000 [ 0.000000] init_memory_mapping: 0000000000000000-0000000080000000 [ 0.000000] RAMDISK: 016a1000 - 0307e000 [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at 0000000000000000-0000000080000000 [ 0.000000] Bootmem setup node 0 0000000000000000-0000000080000000 [ 0.000000] NODE_DATA [0000000000008000 - 000000000000ffff] [ 0.000000] bootmap [0000000000010000 - 000000000001ffff] pages 10 [ 0.000000] (7 early reservations) ==> bootmem [0000000000 - 0080000000] [ 0.000000] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000] [ 0.000000] #1 [0003481000 - 00034a0000] XEN PAGETABLES ==> [0003481000 - 00034a0000] [ 0.000000] #2 [0000006000 - 0000008000] TRAMPOLINE ==> [0000006000 - 0000008000] [ 0.000000] #3 [0001000000 - 0001680314] TEXT DATA BSS ==> [0001000000 - 0001680314] [ 0.000000] #4 [00016a1000 - 000307e000] RAMDISK ==> [00016a1000 - 000307e000] [ 0.000000] #5 [000307e000 - 0003481000] XEN START INFO ==> [000307e000 - 0003481000] [ 0.000000] #6 [0000100000 - 00004e1000] PGTABLE ==> [0000100000 - 00004e1000] [ 0.000000] Zone PFN ranges: [ 0.000000] DMA 0x00000000 -> 0x00001000 [ 0.000000] DMA32 0x00001000 -> 0x00100000 [ 0.000000] Normal 0x00100000 -> 0x00100000 [ 0.000000] Movable zone start PFN for each node [ 0.000000] early_node_map[2] active PFN ranges [ 0.000000] 0: 0x00000000 -> 0x000000a0 [ 0.000000] 0: 0x00000100 -> 0x00080000 [ 0.000000] SFI: Simple Firmware Interface v0.7 http://simplefirmware.org [ 0.000000] SMP: Allowing 2 CPUs, 0 hotplug CPUs [ 0.000000] No local APIC present [ 0.000000] APIC: disable apic facility [ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 0000000000100000 [ 0.000000] Allocating PCI resources starting at 80000000 (gap: 80000000:80000000) [ 0.000000] Booting paravirtualized kernel on Xen [ 0.000000] Xen version: 3.4.1_19718_04-2.1 (preserve-AD) [ 0.000000] NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:2 nr_node_ids:1 [ 0.000000] PERCPU: Embedded 29 pages/cpu @ffff8800034d6000 s90008 r8192 d20584 u118784 [ 0.000000] pcpu-alloc: s90008 r8192 d20584 u118784 alloc=29*4096 [ 0.000000] pcpu-alloc: [0] 0 [0] 1 [ 0.000000] Xen: using vcpu_info placement [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 516028 [ 0.000000] Policy zone: DMA32 [ 0.000000] Kernel command line: root=/dev/xvda ro console=hvc0 [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] Initializing CPU#0 [ 0.000000] Checking aperture... [ 0.000000] No AGP bridge found [ 0.000000] Memory: 2026132k/2097152k available (3060k kernel code, 384k absent, 70636k reserved, 1885k data, 584k init) [ 0.000000] SLUB: Genslabs=14, HWalign=64, Order=0-3, MinObjects=0, CPUs=2, Nodes=1 [ 0.000000] Hierarchical RCU implementation. [ 0.000000] NR_IRQS:4352 nr_irqs:288 [ 0.000000] Console: colour dummy device 80x25 [ 0.000000] console [tty0] enabled [ 0.000000] console [hvc0] enabled [ 0.000000] installing Xen timer for CPU 0 [ 0.000000] Detected 2393.998 MHz processor. [ 0.004000] Calibrating delay loop (skipped), value calculated using timer frequency.. 4787.99 BogoMIPS (lpj=9575992) [ 0.004000] Security Framework initialized [ 0.004000] SELinux: Disabled at boot. [ 0.004000] Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes) [ 0.004000] Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes) [ 0.004000] Mount-cache hash table entries: 256 [ 0.004000] Initializing cgroup subsys ns [ 0.004000] Initializing cgroup subsys cpuacct [ 0.004000] Initializing cgroup subsys devices [ 0.004000] Initializing cgroup subsys freezer [ 0.004000] Initializing cgroup subsys net_cls [ 0.004000] CPU: L1 I cache: 32K, L1 D cache: 32K [ 0.004000] CPU: L2 cache: 4096K [ 0.004000] CPU 0/0x0 -> Node 0 [ 0.004000] CPU: Unsupported number of siblings 4 [ 0.004000] Performance Events: unsupported p6 CPU model 15 no PMU driver, software events only. [ 0.004000] SMP alternatives: switching to UP code [ 0.004256] installing Xen timer for CPU 1 [ 0.004296] SMP alternatives: switching to SMP code [ 0.000011] Initializing CPU#1 [ 0.000056] CPU: L1 I cache: 32K, L1 D cache: 32K [ 0.000059] CPU: L2 cache: 4096K [ 0.000062] CPU 1/0x0 -> Node 0 [ 0.000065] CPU: Unsupported number of siblings 4 [ 0.011074] Brought up 2 CPUs [ 0.011204] devtmpfs: initialized [ 0.014219] Grant table initialized [ 0.014219] regulator: core version 0.5 [ 0.014219] NET: Registered protocol family 16 [ 0.014219] PCI: Fatal: No config space access function found [ 0.016066] bio: create slab <bio-0> at 0 [ 0.016070] ACPI: Interpreter disabled. [ 0.016070] xen_balloon: Initialising balloon driver. [ 0.016070] vgaarb: loaded [ 0.016093] PCI: System does not support PCI [ 0.016093] PCI: System does not support PCI [ 0.016132] Switching to clocksource xen [ 0.017497] pnp: PnP ACPI: disabled [ 0.017757] NET: Registered protocol family 2 [ 0.017941] IP route cache hash table entries: 65536 (order: 7, 524288 bytes) [ 0.019183] TCP established hash table entries: 262144 (order: 10, 4194304 bytes) [ 0.020794] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes) [ 0.021188] TCP: Hash tables configured (established 262144 bind 65536) [ 0.021195] TCP reno registered [ 0.021301] NET: Registered protocol family 1 [ 0.021367] Unpacking initramfs... [ 0.052691] Freeing initrd memory: 26484k freed [ 0.063529] platform rtc_cmos: registered platform RTC device (no PNP device found) [ 0.063830] audit: initializing netlink socket (disabled) [ 0.063847] type=2000 audit(1278663313.982:1): initialized [ 0.064602] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 0.066082] VFS: Disk quotas dquot_6.5.2 [ 0.066138] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.066228] msgmni has been set to 4009 [ 0.066520] alg: No test for stdrng (krng) [ 0.066591] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253) [ 0.066602] io scheduler noop registered [ 0.070536] io scheduler anticipatory registered [ 0.070536] io scheduler deadline registered [ 0.070536] io scheduler cfq registered (default) [ 0.149547] Linux agpgart interface v0.103 [ 0.149587] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 0.149785] input: Macintosh mouse button emulation as /devices/virtual/input/input0 [ 0.149839] PNP: No PS/2 controller found. Probing ports directly. [ 0.150660] i8042.c: No controller found. [ 0.150728] mice: PS/2 mouse device common for all mice [ 0.150778] rtc_cmos: probe of rtc_cmos failed with error -16 [ 0.150805] cpuidle: using governor ladder [ 0.150811] cpuidle: using governor menu [ 0.150818] No iBFT detected. [ 0.151085] TCP cubic registered [ 0.151201] NET: Registered protocol family 10 [ 0.151632] lo: Disabled Privacy Extensions [ 0.151860] Mobile IPv6 [ 0.151867] NET: Registered protocol family 17 [ 0.151958] registered taskstats version 1 [ 0.151976] XENBUS: Device with no driver: device/vbd/51712 [ 0.151982] XENBUS: Device with no driver: device/vbd/51728 [ 0.151987] XENBUS: Device with no driver: device/vif/0 [ 0.151992] XENBUS: Device with no driver: device/console/0 [ 0.152021] /build/buildd-linux-2.6_2.6.32-15-amd64-PisqNL/linux-2.6-2.6.32/debian/build/source_amd64_none/drivers/rtc/hctosys.c: unable to open rtc device (rtc0) [ 0.152047] Initalizing network drop monitor service [ 0.152118] Freeing unused kernel memory: 584k freed [ 0.152296] Write protecting the kernel read-only data: 4208k Loading, please wait... [ 0.184507] udev: starting version 158 [ 0.226097] Initialising Xen virtual ethernet driver. [ 0.236524] blkfront: xvda: barriers enabled [ 0.236792] xvda: unknown partition table [ 0.248869] blkfront: xvdb: barriers enabled [ 0.249054] xvdb: unknown partition table Begin: Loading essential drivers ... [ 0.473159] nbd: registered device at major 43 done. Begin: Running /scripts/init-premount ... done. Begin: Mounting root file system ... Begin: Running /scripts/local-top ... Begin: Setting up nbd-client ... Failure: Insufficient information to set up nbd, quitting (nbdsrv= nbdport= nbdroot= root=) Begin: Loading[ 0.493366] md: linear personality registered for level -1 Success: loaded module linear. [ 0.496673] md: multipath personality registered for level -4 Success: loaded module multipath. [ 0.499504] md: raid0 personality registered for level 0 Success: loaded module raid0. [ 0.503467] md: raid1 personality registered for level 1 Success: loaded module raid1. [ 0.506094] async_tx: api initialized (async) [ 0.506541] xor: automatically using best checksumming function: generic_sse [ 0.524008] generic_sse: 2616.000 MB/sec [ 0.524014] xor: using function: generic_sse (2616.000 MB/sec) [ 0.592006] raid6: int64x1 1995 MB/s [ 0.660029] raid6: int64x2 2699 MB/s [ 0.728038] raid6: int64x4 2115 MB/s [ 0.796034] raid6: int64x8 1814 MB/s [ 0.864018] raid6: sse2x1 3985 MB/s [ 0.932018] raid6: sse2x2 4203 MB/s [ 1.000044] raid6: sse2x4 6372 MB/s [ 1.000050] raid6: using algorithm sse2x4 (6372 MB/s) [ 1.003857] md: raid6 personality registered for level 6 [ 1.003865] md: raid5 personality registered for level 5 [ 1.003870] md: raid4 personality registered for level 4 Success: loaded module raid456. Success: loaded module raid5. Success: loaded module raid6. [ 1.012709] md: raid10 personality registered for level 10 Success: loaded module raid10. done. Begin: Assembling all MD arrays ... mdadm: No arrays found in config file or automatically Failure: failed to assemble all arrays. done. done. Begin: Running /scripts/local-premount ... done. [ 1.057208] kjournald starting. Commit interval 5 seconds [ 1.057226] EXT3-fs: mounted filesystem with ordered data mode. Begin: Running /scripts/local-bottom ... done. done. Begin: Running /scripts/init-bottom ... done. INIT: version 2.88 booting Using makefile-style concurrent boot in runlevel S. Starting the hotplug events dispatcher: udevdudevd[206]: BUS= will be removed in a future udev version, please use SUBSYSTEM= to match the event device, or SUBSYSTEMS= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:6 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:10 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:14 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:18 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:20 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:24 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:26 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:28 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:30 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:32 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:34 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:36 udevd[206]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/52-nut-usbups.rules:38 [removed hundreds of lines from udevd] udevd[207]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/z60_xserver-xorg-input-wacom.rules:71 udevd[207]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/z60_xserver-xorg-input-wacom.rules:72 udevd[207]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/z60_xserver-xorg-input-wacom.rules:73 udevd[207]: SYSFS{}= will be removed in a future udev version, please use ATTR{}= to match the event device, or ATTRS{}= to match a parent device, in /etc/udev/rules.d/z60_xserver-xorg-input-wacom.rules:78 [ 4.610051] input: PC Speaker as /devices/platform/pcspkr/input/input1 done. Setting parameters of disc: (none). @Activating swap...[ 5.648619] Adding 6291448k swap on /dev/xvdb. Priority:-1 extents:1 across:6291448k SS done. Checking root file system...fsck from util-linux-ng 2.17.2 /dev/xvda: clean, 519373/1638400 files, 5956632/6553600 blocks done. [ 5.857372] EXT3 FS on xvda, internal journal Cleaning up ifupdown.... Setting up networking.... [ 6.045779] loop: module loaded Loading kernel modules...done. Activating lvm and md swap...done. Checking file systems...fsck from util-linux-ng 2.17.2 done. Mounting local filesystems...done. Activating swapfile swap...done. Cleaning up temporary files.... Setting kernel variables ...done. Configuring network interfaces...[ 7.306534] Bridge firewalling registered [ 7.336191] device eth0 entered promiscuous mode [ 7.346179] br0: port 1(eth0) entering learning state Waiting for br0 to get ready (MAXWAIT is 20 seconds). [ 16.344025] br0: port 1(eth0) entering forwarding state Starting portmap daemon.... Starting NFS common utilities: statd. mount.nfs: Failed to resolve server nfs: Temporary failure in name resolution done. Starting portmap daemon...Already running.. Starting NFS common utilities: statd. Cleaning up temporary files.... Setting up ALSA...done (none loaded). Setting console screen modes and fonts. cannot (un)set powersave mode 9;30]14;30][ 17.623522] fuse init (API version 7.13) Recovering schroot sessions:. Running scripts in rcS.d/ took 16 seconds. INIT: Entering runlevel: 2 Using makefile-style concurrent boot in runlevel 2. CPUFreq Utilities: Setting ondemand CPUFreq governor...disabled, governor not available...done. Starting NFS common utilities: statd. Starting portmap daemon...Already running.. Enabling additional executable binary formats: binfmt-support. Starting enhanced syslogd: rsyslogd. Starting web server: apache2apache2: apr_sockaddr_info_get() failed for lindi1 apache2: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1 for ServerName . Starting domain name service...: bind9. Starting periodic command scheduler: cron. Starting system message bus: dbus. Starting MTA:nut disabled, please adjust the configuration to your needs. Then set MODE to a suitable value in /etc/nut/nut.conf to enable it. SSL tunnels disabled, see /etc/default/stunnel4 Starting Nagios Service Check Acceptor: nsca. Starting NTP server: ntpd. Starting OpenBSD Secure Shell server: sshd. exim4. Starting nagios3 monitoring daemon: nagios3. [ 22.793187] RPC: Registered udp transport module. [ 22.793197] RPC: Registered tcp transport module. [ 22.793203] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 22.837231] Slow work thread pool: Starting up [ 22.837312] Slow work thread pool: Ready [ 22.837383] FS-Cache: Loaded [ 22.893685] FS-Cache: Netfs 'nfs' registered for caching [ 23.006650] svc: failed to register lockdv1 RPC service (errno 97). Running scripts in rc2.d/ took 5 seconds. Debian GNU/Linux squeeze/sid lindi1 hvc0 lindi1 login: In this state the kernel does react to sudo xm sysrq lindi1 h with SysRq : HELP : loglevel(0-9) reBoot Crashdump terminate-all-tasks(E) memory-full-oom-kill(F) kill-all-tasks(I) thaw-filesystems(J) saK show-backtrace-all-active-cpus(L) show-memory-usage(M) nice-all-RT-tasks(N) powerOff show-registers(P) show-all-timers(Q) unRaw Sync show-task-states(T) Unmount show-blocked-tasks(W) dump-ftrace-buffer(Z) as expected. The following information is naturally about -15 since I can not boot -16: -- Package-specific info: ** Version: Linux version 2.6.32-5-amd64 (Debian 2.6.32-15) (ben@decadent.org.uk) (gcc version 4.3.5 (Debian 4.3.5-1) ) #1 SMP Tue Jun 1 04:34:03 UTC 2010 ** Command line: root=/dev/xvda ro console=hvc0 ** Not tainted ** Kernel log: [ 0.016093] PCI: System does not support PCI [ 0.016093] PCI: System does not support PCI [ 0.016132] Switching to clocksource xen [ 0.017497] pnp: PnP ACPI: disabled [ 0.017757] NET: Registered protocol family 2 [ 0.017941] IP route cache hash table entries: 65536 (order: 7, 524288 bytes) [ 0.019183] TCP established hash table entries: 262144 (order: 10, 4194304 bytes) [ 0.020794] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes) [ 0.021188] TCP: Hash tables configured (established 262144 bind 65536) [ 0.021195] TCP reno registered [ 0.021301] NET: Registered protocol family 1 [ 0.021367] Unpacking initramfs... [ 0.052691] Freeing initrd memory: 26484k freed [ 0.063529] platform rtc_cmos: registered platform RTC device (no PNP device found) [ 0.063830] audit: initializing netlink socket (disabled) [ 0.063847] type=2000 audit(1278663313.982:1): initialized [ 0.064602] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 0.066082] VFS: Disk quotas dquot_6.5.2 [ 0.066138] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.066228] msgmni has been set to 4009 [ 0.066520] alg: No test for stdrng (krng) [ 0.066591] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253) [ 0.066602] io scheduler noop registered [ 0.070536] io scheduler anticipatory registered [ 0.070536] io scheduler deadline registered [ 0.070536] io scheduler cfq registered (default) [ 0.149547] Linux agpgart interface v0.103 [ 0.149587] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 0.149785] input: Macintosh mouse button emulation as /devices/virtual/input/input0 [ 0.149839] PNP: No PS/2 controller found. Probing ports directly. [ 0.150660] i8042.c: No controller found. [ 0.150728] mice: PS/2 mouse device common for all mice [ 0.150778] rtc_cmos: probe of rtc_cmos failed with error -16 [ 0.150805] cpuidle: using governor ladder [ 0.150811] cpuidle: using governor menu [ 0.150818] No iBFT detected. [ 0.151085] TCP cubic registered [ 0.151201] NET: Registered protocol family 10 [ 0.151632] lo: Disabled Privacy Extensions [ 0.151860] Mobile IPv6 [ 0.151867] NET: Registered protocol family 17 [ 0.151951] PM: Resume from disk failed. [ 0.151958] registered taskstats version 1 [ 0.151976] XENBUS: Device with no driver: device/vbd/51712 [ 0.151982] XENBUS: Device with no driver: device/vbd/51728 [ 0.151987] XENBUS: Device with no driver: device/vif/0 [ 0.151992] XENBUS: Device with no driver: device/console/0 [ 0.152021] /build/buildd-linux-2.6_2.6.32-15-amd64-PisqNL/linux-2.6-2.6.32/debian/build/source_amd64_none/drivers/rtc/hctosys.c: unable to open rtc device (rtc0) [ 0.152047] Initalizing network drop monitor service [ 0.152118] Freeing unused kernel memory: 584k freed [ 0.152296] Write protecting the kernel read-only data: 4208k [ 0.184507] udev: starting version 158 [ 0.226097] Initialising Xen virtual ethernet driver. [ 0.236524] blkfront: xvda: barriers enabled [ 0.236792] xvda: unknown partition table [ 0.248869] blkfront: xvdb: barriers enabled [ 0.249054] xvdb: unknown partition table [ 0.473159] nbd: registered device at major 43 [ 0.493366] md: linear personality registered for level -1 [ 0.496673] md: multipath personality registered for level -4 [ 0.499504] md: raid0 personality registered for level 0 [ 0.503467] md: raid1 personality registered for level 1 [ 0.506094] async_tx: api initialized (async) [ 0.506541] xor: automatically using best checksumming function: generic_sse [ 0.524008] generic_sse: 2616.000 MB/sec [ 0.524014] xor: using function: generic_sse (2616.000 MB/sec) [ 0.592006] raid6: int64x1 1995 MB/s [ 0.660029] raid6: int64x2 2699 MB/s [ 0.728038] raid6: int64x4 2115 MB/s [ 0.796034] raid6: int64x8 1814 MB/s [ 0.864018] raid6: sse2x1 3985 MB/s [ 0.932018] raid6: sse2x2 4203 MB/s [ 1.000044] raid6: sse2x4 6372 MB/s [ 1.000050] raid6: using algorithm sse2x4 (6372 MB/s) [ 1.003857] md: raid6 personality registered for level 6 [ 1.003865] md: raid5 personality registered for level 5 [ 1.003870] md: raid4 personality registered for level 4 [ 1.012709] md: raid10 personality registered for level 10 [ 1.057208] kjournald starting. Commit interval 5 seconds [ 1.057226] EXT3-fs: mounted filesystem with ordered data mode. [ 3.240597] udev: starting version 158 [ 4.610051] input: PC Speaker as /devices/platform/pcspkr/input/input1 [ 5.648619] Adding 6291448k swap on /dev/xvdb. Priority:-1 extents:1 across:6291448k SS [ 5.857372] EXT3 FS on xvda, internal journal [ 6.045779] loop: module loaded [ 7.306534] Bridge firewalling registered [ 7.336191] device eth0 entered promiscuous mode [ 7.346179] br0: port 1(eth0) entering learning state [ 16.344025] br0: port 1(eth0) entering forwarding state [ 17.623522] fuse init (API version 7.13) [ 17.860012] eth0: no IPv6 routers present [ 18.229009] br0: no IPv6 routers present [ 22.793187] RPC: Registered udp transport module. [ 22.793197] RPC: Registered tcp transport module. [ 22.793203] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 22.837231] Slow work thread pool: Starting up [ 22.837312] Slow work thread pool: Ready [ 22.837383] FS-Cache: Loaded [ 22.893685] FS-Cache: Netfs 'nfs' registered for caching [ 23.006650] svc: failed to register lockdv1 RPC service (errno 97). ** Model information not available ** Loaded modules: Module Size Used by nfs 240826 1 lockd 57603 1 nfs fscache 29834 1 nfs nfs_acl 2031 1 nfs auth_rpcgss 33460 1 nfs sunrpc 161317 11 nfs,lockd,nfs_acl,auth_rpcgss binfmt_misc 6431 1 fuse 50190 1 bridge 39598 0 stp 1440 1 bridge loop 11783 0 pcspkr 1699 0 evdev 7352 0 ext3 106518 1 jbd 37085 1 ext3 mbcache 5050 1 ext3 raid10 17697 0 raid456 44500 0 async_raid6_recov 5170 1 raid456 async_pq 3479 2 raid456,async_raid6_recov raid6_pq 77179 2 async_raid6_recov,async_pq async_xor 2478 3 raid456,async_raid6_recov,async_pq xor 4380 1 async_xor async_memcpy 1198 2 raid456,async_raid6_recov async_tx 1734 5 raid456,async_raid6_recov,async_pq,async_xor,async _memcpy raid1 18191 0 raid0 5517 0 multipath 6059 0 linear 3455 0 md_mod 73664 6 raid10,raid456,raid1,raid0,multipath,linear nbd 7985 0 xen_netfront 15180 0 xen_blkfront 8102 2 ** PCI devices: ** Sound cards: -- System Information: Debian Release: squeeze/sid APT prefers testing APT policy: (500, 'testing') Architecture: amd64 (x86_64) Kernel: Linux 2.6.32-5-amd64 (SMP w/2 CPU cores) Locale: LANG=C, LC_CTYPE=fi_FI (charmap=ISO-8859-1) Shell: /bin/sh linked to /bin/dash Versions of packages linux-image-2.6.32-5-amd64 depends on: ii debconf [debconf-2.0] 1.5.32 Debian configuration management sy ii initramfs-tools [linux-initra 0.97.2 tools for generating an initramfs ii linux-base 2.6.32-15 Linux image base package ii module-init-tools 3.12-1 tools for managing Linux kernel mo Versions of packages linux-image-2.6.32-5-amd64 recommends: ii firmware-linux-free 2.6.32-15 Binary firmware for various driver Versions of packages linux-image-2.6.32-5-amd64 suggests: ii grub-legacy [grub] 0.97-61 GRand Unified Bootloader (Legacy v ii linux-doc-2.6.32 2.6.32-15 Linux kernel specific documentatio Versions of packages linux-image-2.6.32-5-amd64 is related to: pn firmware-bnx2 <none> (no description available) pn firmware-bnx2x <none> (no description available) pn firmware-ipw2x00 <none> (no description available) pn firmware-ivtv <none> (no description available) pn firmware-iwlwifi <none> (no description available) pn firmware-linux <none> (no description available) pn firmware-linux-nonfree <none> (no description available) pn firmware-qlogic <none> (no description available) pn firmware-ralink <none> (no description available) pn xen-hypervisor <none> (no description available) -- debconf information excluded -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84bpah9gl3.fsf@sauna.l.org">http://lists.debian.org/84bpah9gl3.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Would it be ok to raise the severity to serious to prevent migration
to testing? Even if investigating this takes time I'd hate to have a kernel that does not boot under Xen in testing. -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84k4p1kti2.fsf@sauna.l.org">http://lists.debian.org/84k4p1kti2.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Sorry, I forgot to include Xen version number information:
$ sudo xm info host : overlord2 release : 2.6.31.12-0.2-xen version : #1 SMP 2010-03-16 21:25:39 +0100 machine : x86_64 nr_cpus : 4 nr_nodes : 1 cores_per_socket : 4 threads_per_core : 1 cpu_mhz : 2394 hw_caps : bfebfbff:20100800:00000000:00000140:0000e3bd:00000 000:00000001:00000000 virt_caps : hvm total_memory : 8189 free_memory : 16 max_free_memory : 2420 max_para_memory : 2416 max_hvm_memory : 2402 node_to_cpu : node0:0-3 node_to_memory : node0:16 xen_major : 3 xen_minor : 4 xen_extra : .1_19718_04-2.1 xen_caps : xen-3.0-x86_64 xen-3.0-x86_32p hvm-3.0-x86_32 hvm-3.0-x86_32p hvm-3.0-x86_64 xen_scheduler : credit xen_pagesize : 4096 platform_params : virt_start=0xffff800000000000 xen_changeset : 19718 cc_compiler : gcc version 4.4.1 [gcc-4_4-branch revision 150839] (SUSE Linux) cc_compile_by : abuild cc_compile_domain : cc_compile_date : Tue Oct 27 16:31:01 UTC 2009 xend_config_format : 4 (If these look odd it is because Dom0 runs openSUSE 11.2) -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84fwzkizf2.fsf@sauna.l.org">http://lists.debian.org/84fwzkizf2.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Here's some debugging output. I booted domU with an older kernel to be able to install
On opensuse dom0 I ran sudo gdbserver-xen 127.0.0.1:9999 --attach 37 where 37 is domid printed by xm list and then used sudo gdb /local/lindi/vmlinux-2.6.32-5-amd64 (gdb) target remote 127.0.0.1:9999 to start the debugger. It seems that the kernel is stuck trying to print information about a kernel OOPS: (gdb) info threads * 1 Remote target 0xffffffff812f8c78 in __ticket_spin_lock ( lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:114 (gdb) bt full #0 0xffffffff812f8c78 in __ticket_spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:114 inc = 1 tmp = 0 #1 __raw_spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:190 No locals. #2 __spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/include/linux/spinlock_api_smp.h:337 No locals. #3 _spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/spinlock.c:138 No locals. #4 0xffffffff8104d344 in vprintk ( fmt=0xffffffff813a8798 "<1>BUG: unable to handle kernel ", args=0xffff88003fdb93a8) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/printk.c:705 printed_len = 0 current_log_level = 4 flags = 0 p = <value optimized out> #5 0xffffffff812f6eec in printk (fmt=0xffffffff815d13c0 "") at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/printk.c:595 ---Type <return> to continue, or q <return> to quit--- args = {{gp_offset = 8, fp_offset = 48, overflow_arg_area = 0xffff88003fdb9488, reg_save_area = 0xffff88003fdb93c8}} r = 1 #6 0xffffffff810311b0 in show_fault_oops (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:583 No locals. #7 no_context (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:658 tsk = 0xffff88003fdb4000 flags = 0 sig = <value optimized out> #8 0xffffffff81031429 in __bad_area_nosemaphore (regs=0xffff88003fdb9598, error_code=<value optimized out>, address=18446744071582043416, si_code=196609) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:741 tsk = 0xffff88003fdb4000 #9 0xffffffff812f9015 in ?? () at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/kernel/entry_64.S:1383 No locals. #10 0x0005398d646e415d in ?? () No symbol table info available. #11 0x00000000d5de35ab in ?? () ---Type <return> to continue, or q <return> to quit--- No symbol table info available. #12 0x0000000000000002 in ?? () No symbol table info available. #13 0x000c82198774ef00 in ?? () No symbol table info available. #14 0x0000000000000001 in ?? () No symbol table info available. #15 0xffff8800032e6040 in ?? () No symbol table info available. #16 0x0000000000000202 in ?? () No symbol table info available. #17 0xffff880000000000 in ?? () No symbol table info available. #18 0xffffffff813a87b8 in ?? () __kcrctab_unregister_net_sysctl_table = 2623496682 __kstrtab_unregister_net_sysctl_table = "unregister_net_sysctl_table" net_sysctl_ro_root = {root_list = {next = 0xffffffff814649b0, prev = 0xffffffff814a6350}, default_set = {list = { next = 0xffff88003fe34c08, prev = 0xffff88003d8dd808}, parent = 0xffffffff814649c0, is_seen = 0}, lookup = 0, permissions = 0xffffffff812e4313 <net_ctl_ro_header_perms>} sysctl_pernet_ops = {list = {next = 0xffffffff8149bcc0, prev = 0xffffffff814a6140}, init = 0xffffffff812e4323 <sysctl_net_init>, exit = 0xffffffff812e4375 <sysctl_net_exit>} __ksymtab_register_net_sysctl_table = {value = 18446744071581877167, name = 0xffffffff81422f46 "register_net_sysctl_table"} __kstrtab_register_net_sysctl_table = "register_net_sysctl_table" __initcall_sysctl_init4 = 0 ---Type <return> to continue, or q <return> to quit--- __kcrctab_register_net_sysctl_table = 829491708 __ksymtab_register_net_sysctl_rotable = {value = 18446744071581877142, name = 0xffffffff81422f2a "register_net_sysctl_rotable"} __ksymtab_unregister_net_sysctl_table = {value = 18446744071581877137, name = 0xffffffff81422f0e "unregister_net_sysctl_table"} net_sysctl_root = {root_list = {next = 0xffffffff814a6390, prev = 0xffffffff814649b0}, default_set = {list = {next = 0x0, prev = 0x0}, parent = 0x0, is_seen = 0}, lookup = 0xffffffff812e42e8 <net_ctl_header_lookup>, permissions = 0xffffffff812e4342 <net_ctl_permissions>} __kcrctab_register_net_sysctl_rotable = 2528268687 __kstrtab_register_net_sysctl_rotable = "register_net_sysctl_rotable" #19 0x0000000000000020 in ?? () No symbol table info available. #20 0x0005398d90ee1210 in ?? () No symbol table info available. #21 0x00000000f2572b01 in ?? () No symbol table info available. #22 0x0005398d9113c2e5 in ?? () No symbol table info available. #23 0xffffffff813a87b8 in ?? () __kcrctab_unregister_net_sysctl_table = 2623496682 __kstrtab_unregister_net_sysctl_table = "unregister_net_sysctl_table" net_sysctl_ro_root = {root_list = {next = 0xffffffff814649b0, prev = 0xffffffff814a6350}, default_set = {list = { next = 0xffff88003fe34c08, prev = 0xffff88003d8dd808}, parent = 0xffffffff814649c0, is_seen = 0}, lookup = 0, permissions = 0xffffffff812e4313 <net_ctl_ro_header_perms>} sysctl_pernet_ops = {list = {next = 0xffffffff8149bcc0, ---Type <return> to continue, or q <return> to quit--- prev = 0xffffffff814a6140}, init = 0xffffffff812e4323 <sysctl_net_init>, exit = 0xffffffff812e4375 <sysctl_net_exit>} __ksymtab_register_net_sysctl_table = {value = 18446744071581877167, name = 0xffffffff81422f46 "register_net_sysctl_table"} __kstrtab_register_net_sysctl_table = "register_net_sysctl_table" __initcall_sysctl_init4 = 0 __kcrctab_register_net_sysctl_table = 829491708 __ksymtab_register_net_sysctl_rotable = {value = 18446744071581877142, name = 0xffffffff81422f2a "register_net_sysctl_rotable"} __ksymtab_unregister_net_sysctl_table = {value = 18446744071581877137, name = 0xffffffff81422f0e "unregister_net_sysctl_table"} net_sysctl_root = {root_list = {next = 0xffffffff814a6390, prev = 0xffffffff814649b0}, default_set = {list = {next = 0x0, prev = 0x0}, parent = 0x0, is_seen = 0}, lookup = 0xffffffff812e42e8 <net_ctl_header_lookup>, permissions = 0xffffffff812e4342 <net_ctl_permissions>} __kcrctab_register_net_sysctl_rotable = 2528268687 __kstrtab_register_net_sysctl_rotable = "register_net_sysctl_rotable" #24 0xffff8800032e6040 in ?? () No symbol table info available. #25 0xffffffffffffffff in ?? () No symbol table info available. #26 0xffffffff8102bddf in pvclock_clocksource_read (src=0x1) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/kernel/pvclock.c:145 ret = 0 offset = <value optimized out> last = 1 ---Type <return> to continue, or q <return> to quit--- #27 0x0000000000000000 in ?? () No symbol table info available. I think we should figure out what caused the OOPS so the following information about the printk() being stuck is not important: (gdb) x/10i $rip 0xffffffff812f8c78 <__ticket_spin_lock+21>: movzwl (%rdi),%edx 0xffffffff812f8c7b <__ticket_spin_lock+24>: jmp 0xffffffff812f8c72 <__ticket_spin_lock+15> 0xffffffff812f8c7d <_spin_lock+26>: retq 0xffffffff812f8c7e <__raw_write_lock>: subl $0x1000000,%ds:(%rdi) 0xffffffff812f8c85 <__raw_write_lock+7>: je 0xffffffff812f8c8c <_write_lock+14> 0xffffffff812f8c87 <__raw_write_lock+9>: callq 0xffffffff81193b80 0xffffffff812f8c8c <_write_lock+14>: retq 0xffffffff812f8c8d <__ticket_spin_unlock>: incw (%rdi) 0xffffffff812f8c90 <raw_local_irq_restore>: mov %rsi,%rdi 0xffffffff812f8c93 <raw_local_irq_restore+3>: callq *0xffffffff814629b8 (gdb) si 0xffffffff812f8c7b 114 asm volatile(LOCK_PREFIX "xaddl %0, %1 " (gdb) si 0xffffffff812f8c72 114 asm volatile(LOCK_PREFIX "xaddl %0, %1 " (gdb) x/10i $rip 0xffffffff812f8c72 <__ticket_spin_lock+15>: cmp %eax,%edx 0xffffffff812f8c74 <__ticket_spin_lock+17>: je 0xffffffff812f8c7d <_spin_lock+26> 0xffffffff812f8c76 <__ticket_spin_lock+19>: pause 0xffffffff812f8c78 <__ticket_spin_lock+21>: movzwl (%rdi),%edx 0xffffffff812f8c7b <__ticket_spin_lock+24>: jmp 0xffffffff812f8c72 <__ticket_spin_lock+15> 0xffffffff812f8c7d <_spin_lock+26>: retq 0xffffffff812f8c7e <__raw_write_lock>: subl $0x1000000,%ds:(%rdi) 0xffffffff812f8c85 <__raw_write_lock+7>: je 0xffffffff812f8c8c <_write_lock+14> 0xffffffff812f8c87 <__raw_write_lock+9>: callq 0xffffffff81193b80 0xffffffff812f8c8c <_write_lock+14>: retq (gdb) info register rax 0x1 1 rbx 0x0 0 rcx 0xfffedb34 4294892340 rdx 0x0 0 rsi 0xffff88003fdb93a8 -131940323978328 rdi 0xffffffff815d13c0 -2124606528 rbp 0xffff88003fdb93a8 0xffff88003fdb93a8 rsp 0xffff88003fdb92f0 0xffff88003fdb92f0 r8 0x2 2 r9 0xffff88003fdb4000 -131940323999744 r10 0xffff880000000000 -131941395333120 r11 0x202 514 r12 0xffffffff813a8798 -2126870632 r13 0x4 4 r14 0x3 3 r15 0x0 0 rip 0xffffffff812f8c72 0xffffffff812f8c72 <__ticket_spin_lock+15> eflags 0x1397 [ CF PF AF SF TF IF #12 ] cs 0xe033 57395 ss 0xe02b 57387 ds 0x0 0 es 0x0 0 fs 0x0 0 gs 0x0 0 fctrl 0x37f 895 fstat 0x0 0 ftag 0xffff 65535 fiseg 0x0 0 ---Type <return> to continue, or q <return> to quit--- fioff 0x0 0 foseg 0x0 0 fooff 0x0 0 fop 0x0 0 mxcsr 0x1f80 [ IM DM ZM OM UM PM ] (gdb) However, why did the kernel OOPS in the first place? (gdb) frame 7 #7 no_context (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:658 658 show_fault_oops(regs, error_code, address); (gdb) p/x *regs $10 = {r15 = 0x5398d646e415d, r14 = 0xd5de35ab, r13 = 0x2, r12 = 0xc82198774ef00, bp = 0x1, bx = 0xffff8800032e6040, r11 = 0x202, r10 = 0xffff880000000000, r9 = 0xffffffff813a87b8, r8 = 0x20, ax = 0x5398d90ee1210, cx = 0xf2572b01, dx = 0x5398d9113c2e5, si = 0xffffffff813a87b8, di = 0xffff8800032e6040, orig_ax = 0xffffffffffffffff, ip = 0xffffffff8102bddf, cs = 0x10000e030, flags = 0x10002, sp = 0xffff88003fdb9640, ss = 0xe02b} (gdb) p error_code $11 = 3 (gdb) p/x address $22 = 0xffffffff8130cd18 (gdb) x/6i regs->ip 0xffffffff8102bddf <__cmpxchg>: cmpxchg %rdx,%ds:0x2e0f30(%rip) # 0xffffffff8130cd18 0xffffffff8102bde8 <pvclock_clocksource_read+122>: cmp %rdx,%rax 0xffffffff8102bdeb <pvclock_clocksource_read+125>: jne 0xffffffff8102bdda <pvclock_clocksource_read+108> 0xffffffff8102bded <pvclock_clocksource_read+127>: pop %rcx 0xffffffff8102bdee <pvclock_clocksource_read+128>: pop %rbx 0xffffffff8102bdef <pvclock_clocksource_read+129>: pop %rbp (gdb) shell objdump -axdt /local/lindi/vmlinux-2.6.32-5-amd64 | grep ffffffff8130cd18 ffffffff8130cd18 l O .rodata 0000000000000008 last_value ffffffff8102bdd3: 48 8b 05 3e 0f 2e 00 mov 0x2e0f3e(%rip),%rax # ffffffff8130cd18 <last_value> ffffffff8102bddf: f0 48 0f b1 15 30 0f lock cmpxchg %rdx,0x2e0f30(%rip) # ffffffff8130cd18 <last_value> -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 8439vjip2l.fsf@sauna.l.org">http://lists.debian.org/8439vjip2l.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Here's some debugging output. I installed
linux-image-2.6.32-5-amd64-dbg and linux-source-2.6.32 and then used sudo gdbserver-xen 127.0.0.1:9999 --attach 37 where 37 is the domid printed by xm list and sudo gdb /local/lindi/vmlinux-2.6.32-5-amd64 (gdb) target remote 127.0.0.1:9999 on the opensuse dom0 to attach gdb to linux. It seems that Linux (-17) is stuck trying to print an OOPS: (gdb) info threads * 1 Remote target 0xffffffff812f8c78 in __ticket_spin_lock ( lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:114 (gdb) bt full #0 0xffffffff812f8c78 in __ticket_spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:114 inc = 1 tmp = 0 #1 __raw_spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/include/asm/spinlock.h:190 No locals. #2 __spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/include/linux/spinlock_api_smp.h:337 No locals. #3 _spin_lock (lock=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/spinlock.c:138 No locals. #4 0xffffffff8104d344 in vprintk ( fmt=0xffffffff813a8798 "<1>BUG: unable to handle kernel ", args=0xffff88003fdb93a8) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/printk.c:705 printed_len = 0 current_log_level = 4 flags = 0 p = <value optimized out> #5 0xffffffff812f6eec in printk (fmt=0xffffffff815d13c0 "") at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/printk.c:595 ---Type <return> to continue, or q <return> to quit--- args = {{gp_offset = 8, fp_offset = 48, overflow_arg_area = 0xffff88003fdb9488, reg_save_area = 0xffff88003fdb93c8}} r = 1 #6 0xffffffff810311b0 in show_fault_oops (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:583 No locals. #7 no_context (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:658 tsk = 0xffff88003fdb4000 flags = 0 sig = <value optimized out> #8 0xffffffff81031429 in __bad_area_nosemaphore (regs=0xffff88003fdb9598, error_code=<value optimized out>, address=18446744071582043416, si_code=196609) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:741 tsk = 0xffff88003fdb4000 #9 0xffffffff812f9015 in ?? () at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/kernel/entry_64.S:1383 No locals. #10 0x0005398d646e415d in ?? () No symbol table info available. #11 0x00000000d5de35ab in ?? () ---Type <return> to continue, or q <return> to quit--- No symbol table info available. #12 0x0000000000000002 in ?? () No symbol table info available. #13 0x000c82198774ef00 in ?? () No symbol table info available. #14 0x0000000000000001 in ?? () No symbol table info available. #15 0xffff8800032e6040 in ?? () No symbol table info available. #16 0x0000000000000202 in ?? () No symbol table info available. #17 0xffff880000000000 in ?? () No symbol table info available. #18 0xffffffff813a87b8 in ?? () __kcrctab_unregister_net_sysctl_table = 2623496682 __kstrtab_unregister_net_sysctl_table = "unregister_net_sysctl_table" net_sysctl_ro_root = {root_list = {next = 0xffffffff814649b0, prev = 0xffffffff814a6350}, default_set = {list = { next = 0xffff88003fe34c08, prev = 0xffff88003d8dd808}, parent = 0xffffffff814649c0, is_seen = 0}, lookup = 0, permissions = 0xffffffff812e4313 <net_ctl_ro_header_perms>} sysctl_pernet_ops = {list = {next = 0xffffffff8149bcc0, prev = 0xffffffff814a6140}, init = 0xffffffff812e4323 <sysctl_net_init>, exit = 0xffffffff812e4375 <sysctl_net_exit>} __ksymtab_register_net_sysctl_table = {value = 18446744071581877167, name = 0xffffffff81422f46 "register_net_sysctl_table"} __kstrtab_register_net_sysctl_table = "register_net_sysctl_table" __initcall_sysctl_init4 = 0 ---Type <return> to continue, or q <return> to quit--- __kcrctab_register_net_sysctl_table = 829491708 __ksymtab_register_net_sysctl_rotable = {value = 18446744071581877142, name = 0xffffffff81422f2a "register_net_sysctl_rotable"} __ksymtab_unregister_net_sysctl_table = {value = 18446744071581877137, name = 0xffffffff81422f0e "unregister_net_sysctl_table"} net_sysctl_root = {root_list = {next = 0xffffffff814a6390, prev = 0xffffffff814649b0}, default_set = {list = {next = 0x0, prev = 0x0}, parent = 0x0, is_seen = 0}, lookup = 0xffffffff812e42e8 <net_ctl_header_lookup>, permissions = 0xffffffff812e4342 <net_ctl_permissions>} __kcrctab_register_net_sysctl_rotable = 2528268687 __kstrtab_register_net_sysctl_rotable = "register_net_sysctl_rotable" #19 0x0000000000000020 in ?? () No symbol table info available. #20 0x0005398d90ee1210 in ?? () No symbol table info available. #21 0x00000000f2572b01 in ?? () No symbol table info available. #22 0x0005398d9113c2e5 in ?? () No symbol table info available. #23 0xffffffff813a87b8 in ?? () __kcrctab_unregister_net_sysctl_table = 2623496682 __kstrtab_unregister_net_sysctl_table = "unregister_net_sysctl_table" net_sysctl_ro_root = {root_list = {next = 0xffffffff814649b0, prev = 0xffffffff814a6350}, default_set = {list = { next = 0xffff88003fe34c08, prev = 0xffff88003d8dd808}, parent = 0xffffffff814649c0, is_seen = 0}, lookup = 0, permissions = 0xffffffff812e4313 <net_ctl_ro_header_perms>} sysctl_pernet_ops = {list = {next = 0xffffffff8149bcc0, ---Type <return> to continue, or q <return> to quit--- prev = 0xffffffff814a6140}, init = 0xffffffff812e4323 <sysctl_net_init>, exit = 0xffffffff812e4375 <sysctl_net_exit>} __ksymtab_register_net_sysctl_table = {value = 18446744071581877167, name = 0xffffffff81422f46 "register_net_sysctl_table"} __kstrtab_register_net_sysctl_table = "register_net_sysctl_table" __initcall_sysctl_init4 = 0 __kcrctab_register_net_sysctl_table = 829491708 __ksymtab_register_net_sysctl_rotable = {value = 18446744071581877142, name = 0xffffffff81422f2a "register_net_sysctl_rotable"} __ksymtab_unregister_net_sysctl_table = {value = 18446744071581877137, name = 0xffffffff81422f0e "unregister_net_sysctl_table"} net_sysctl_root = {root_list = {next = 0xffffffff814a6390, prev = 0xffffffff814649b0}, default_set = {list = {next = 0x0, prev = 0x0}, parent = 0x0, is_seen = 0}, lookup = 0xffffffff812e42e8 <net_ctl_header_lookup>, permissions = 0xffffffff812e4342 <net_ctl_permissions>} __kcrctab_register_net_sysctl_rotable = 2528268687 __kstrtab_register_net_sysctl_rotable = "register_net_sysctl_rotable" #24 0xffff8800032e6040 in ?? () No symbol table info available. #25 0xffffffffffffffff in ?? () No symbol table info available. #26 0xffffffff8102bddf in pvclock_clocksource_read (src=0x1) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/kernel/pvclock.c:145 ret = 0 offset = <value optimized out> last = 1 ---Type <return> to continue, or q <return> to quit--- #27 0x0000000000000000 in ?? () No symbol table info available. We probably should figure out what caused the OOPS so the following information about printk being stuck is probably not very relevant: (gdb) x/10i $rip 0xffffffff812f8c78 <__ticket_spin_lock+21>: movzwl (%rdi),%edx 0xffffffff812f8c7b <__ticket_spin_lock+24>: jmp 0xffffffff812f8c72 <__ticket_spin_lock+15> 0xffffffff812f8c7d <_spin_lock+26>: retq 0xffffffff812f8c7e <__raw_write_lock>: subl $0x1000000,%ds:(%rdi) 0xffffffff812f8c85 <__raw_write_lock+7>: je 0xffffffff812f8c8c <_write_lock+14> 0xffffffff812f8c87 <__raw_write_lock+9>: callq 0xffffffff81193b80 0xffffffff812f8c8c <_write_lock+14>: retq 0xffffffff812f8c8d <__ticket_spin_unlock>: incw (%rdi) 0xffffffff812f8c90 <raw_local_irq_restore>: mov %rsi,%rdi 0xffffffff812f8c93 <raw_local_irq_restore+3>: callq *0xffffffff814629b8 (gdb) si 0xffffffff812f8c7b 114 asm volatile(LOCK_PREFIX "xaddl %0, %1 " (gdb) si 0xffffffff812f8c72 114 asm volatile(LOCK_PREFIX "xaddl %0, %1 " (gdb) x/10i $rip 0xffffffff812f8c72 <__ticket_spin_lock+15>: cmp %eax,%edx 0xffffffff812f8c74 <__ticket_spin_lock+17>: je 0xffffffff812f8c7d <_spin_lock+26> 0xffffffff812f8c76 <__ticket_spin_lock+19>: pause 0xffffffff812f8c78 <__ticket_spin_lock+21>: movzwl (%rdi),%edx 0xffffffff812f8c7b <__ticket_spin_lock+24>: jmp 0xffffffff812f8c72 <__ticket_spin_lock+15> 0xffffffff812f8c7d <_spin_lock+26>: retq 0xffffffff812f8c7e <__raw_write_lock>: subl $0x1000000,%ds:(%rdi) 0xffffffff812f8c85 <__raw_write_lock+7>: je 0xffffffff812f8c8c <_write_lock+14> 0xffffffff812f8c87 <__raw_write_lock+9>: callq 0xffffffff81193b80 0xffffffff812f8c8c <_write_lock+14>: retq (gdb) info register rax 0x1 1 rbx 0x0 0 rcx 0xfffedb34 4294892340 rdx 0x0 0 rsi 0xffff88003fdb93a8 -131940323978328 rdi 0xffffffff815d13c0 -2124606528 rbp 0xffff88003fdb93a8 0xffff88003fdb93a8 rsp 0xffff88003fdb92f0 0xffff88003fdb92f0 r8 0x2 2 r9 0xffff88003fdb4000 -131940323999744 r10 0xffff880000000000 -131941395333120 r11 0x202 514 r12 0xffffffff813a8798 -2126870632 r13 0x4 4 r14 0x3 3 r15 0x0 0 rip 0xffffffff812f8c72 0xffffffff812f8c72 <__ticket_spin_lock+15> eflags 0x1397 [ CF PF AF SF TF IF #12 ] cs 0xe033 57395 ss 0xe02b 57387 ds 0x0 0 es 0x0 0 fs 0x0 0 gs 0x0 0 fctrl 0x37f 895 fstat 0x0 0 ftag 0xffff 65535 fiseg 0x0 0 ---Type <return> to continue, or q <return> to quit--- fioff 0x0 0 foseg 0x0 0 fooff 0x0 0 fop 0x0 0 mxcsr 0x1f80 [ IM DM ZM OM UM PM ] (gdb) However, it seems that the OOPS is caused by a write (error_code==3) to variable called "last_value" that is in read-only memory (.rodata): (gdb) frame 7 #7 no_context (address=<value optimized out>, error_code=<value optimized out>, regs=<value optimized out>) at /build/buildd-linux-2.6_2.6.32-17-amd64-FO3n8v/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/fault.c:658 658 show_fault_oops(regs, error_code, address); (gdb) p/x *regs $10 = {r15 = 0x5398d646e415d, r14 = 0xd5de35ab, r13 = 0x2, r12 = 0xc82198774ef00, bp = 0x1, bx = 0xffff8800032e6040, r11 = 0x202, r10 = 0xffff880000000000, r9 = 0xffffffff813a87b8, r8 = 0x20, ax = 0x5398d90ee1210, cx = 0xf2572b01, dx = 0x5398d9113c2e5, si = 0xffffffff813a87b8, di = 0xffff8800032e6040, orig_ax = 0xffffffffffffffff, ip = 0xffffffff8102bddf, cs = 0x10000e030, flags = 0x10002, sp = 0xffff88003fdb9640, ss = 0xe02b} (gdb) p error_code $11 = 3 (gdb) p/x address $22 = 0xffffffff8130cd18 (gdb) x/6i regs->ip 0xffffffff8102bddf <__cmpxchg>: cmpxchg %rdx,%ds:0x2e0f30(%rip) # 0xffffffff8130cd18 0xffffffff8102bde8 <pvclock_clocksource_read+122>: cmp %rdx,%rax 0xffffffff8102bdeb <pvclock_clocksource_read+125>: jne 0xffffffff8102bdda <pvclock_clocksource_read+108> 0xffffffff8102bded <pvclock_clocksource_read+127>: pop %rcx 0xffffffff8102bdee <pvclock_clocksource_read+128>: pop %rbx 0xffffffff8102bdef <pvclock_clocksource_read+129>: pop %rbp (gdb) shell objdump -axdt /local/lindi/vmlinux-2.6.32-5-amd64 | grep ffffffff8130cd18 ffffffff8130cd18 l O .rodata 0000000000000008 last_value ffffffff8102bdd3: 48 8b 05 3e 0f 2e 00 mov 0x2e0f3e(%rip),%rax # ffffffff8130cd18 <last_value> ffffffff8102bddf: f0 48 0f b1 15 30 0f lock cmpxchg %rdx,0x2e0f30(%rip) # ffffffff8130cd18 <last_value> What is interesting is that -15 does not contain such a variable: $ objdump -axdt 15/usr/lib/debug/boot/vmlinux-2.6.32-5-amd64|grep last_value ffffffff81671780 l O .bss 0000000000000001 last_value.26980 ffffffff811f5947: 0f b6 05 32 be 47 00 movzbl 0x47be32(%rip),%eax # ffffffff81671780 <last_value.26980> ffffffff811f5955: 88 15 25 be 47 00 mov %dl,0x47be25(%rip) # ffffffff81671780 <last_value.26980> $ objdump -axdt 17/usr/lib/debug/boot/vmlinux-2.6.32-5-amd64|grep last_value ffffffff8130cd18 l O .rodata 0000000000000008 last_value ffffffff81675bc0 l O .bss 0000000000000001 last_value.26980 ffffffff8102bdd3: 48 8b 05 3e 0f 2e 00 mov 0x2e0f3e(%rip),%rax # ffffffff8130cd18 <last_value> ffffffff8102bddf: f0 48 0f b1 15 30 0f lock cmpxchg %rdx,0x2e0f30(%rip) # ffffffff8130cd18 <last_value> ffffffff811f49f7: 0f b6 05 c2 11 48 00 movzbl 0x4811c2(%rip),%eax # ffffffff81675bc0 <last_value.26980> ffffffff811f4a05: 88 15 b5 11 48 00 mov %dl,0x4811b5(%rip) # ffffffff81675bc0 <last_value.26980> interdiff -z linux-2.6_2.6.32-15.diff.gz linux-2.6_2.6.32-17.diff.gz shows ++static atomic64_t last_value = ATOMIC64_INIT(0); ++ + cycle_t pvclock_clocksource_read(struct pvclock_vcpu_time_info *src) + { + struct pvclock_shadow_time shadow; + unsigned version; + cycle_t ret, offset; ++ u64 last; + + do { + version = pvclock_get_time_values(&shadow, src); +@@ -123,6 +126,27 @@ cycle_t pvclock_clocksource_read(struct pvclock_vcpu_time_info *src) + barrier(); + } while (version != src->version); + ++ /* ++ * Assumption here is that last_value, a global accumulator, always goes ++ * forward. If we are less than that, we should not be much smaller. ++ * We assume there is an error marging we're inside, and then the correction ++ * does not sacrifice accuracy. ++ * ++ * For reads: global may have changed between test and return, ++ * but this means someone else updated poked the clock at a later time. ++ * We just need to make sure we are not seeing a backwards event. ++ * ++ * For updates: last_value = ret is not enough, since two vcpus could be ++ * updating at the same time, and one of them could be slightly behind, ++ * making the assumption that last_value always go forward fail to hold. ++ */ ++ last = atomic64_read(&last_value); ++ do { ++ if (ret < last) ++ return last; ++ last = atomic64_cmpxchg(&last_value, last, ret); ++ } while (unlikely(last != ret)); ++ + return ret; + } + -Timo -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84pqynh98f.fsf@sauna.l.org">http://lists.debian.org/84pqynh98f.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
Timo Juhani Lindfors <timo.lindfors@iki.fi> writes:
> However, it seems that the OOPS is caused by a write (error_code==3) > to variable called "last_value" that is in read-only memory (.rodata): Yes, Linus has already explained why here: http://permalink.gmane.org/gmane.linux.kernel/1009506 Note that the patch he posts there isn't complete, and isn't fixing the relevant part for the above problem, but the explanation is still valid. Bjørn -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 87pqynh70z.fsf@nemi.mork.no">http://lists.debian.org/87pqynh70z.fsf@nemi.mork.no |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
I changed
static atomic64_t last_value = ATOMIC64_INIT(0); to volatile static atomic64_t last_value = ATOMIC64_INIT(0); but then Linux got stuck in (gdb) bt 8 #0 panic (fmt=0x0) at /local/lindi/debian-linux-2.6/linux-2.6-2.6.32/debian/build/source_amd64_none/kernel/panic.c:137 #1 0xffffffff812e550f in find_early_table_space (use_gbpages=<value optimized out>, use_pse=<value optimized out>, end=<value optimized out>) at /local/lindi/debian-linux-2.6/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/init.c:80 #2 init_memory_mapping (use_gbpages=<value optimized out>, use_pse=<value optimized out>, end=<value optimized out>) at /local/lindi/debian-linux-2.6/linux-2.6-2.6.32/debian/build/source_amd64_none/arch/x86/mm/init.c:271 #3 0xffffffff814f0e25 in ?? () #4 0xffffffff814ed9d0 in start_kernel () at /local/lindi/debian-linux-2.6/linux-2.6-2.6.32/debian/build/source_amd64_none/init/main.c:546 #5 0xffffffff8307f000 in ?? () #6 0xffffffff83080000 in ?? () #7 0xffffffff83081000 in ?? () (More stack frames follow...) without printing anything. The check in question is probably e820_table_start = find_e820_area(start, max_pfn_mapped<<PAGE_SHIFT, tables, PAGE_SIZE); if (e820_table_start == -1UL) panic("Cannot find space for the kernel page tables"); Any idea what is going on? -Timo -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84lj9bgxoy.fsf@sauna.l.org">http://lists.debian.org/84lj9bgxoy.fsf@sauna.l.org |
Bug#588509: linux-image-2.6.32-5-amd64: hangs under Xen after Write protecting the kernel read-only data: 4220k
package linux-image-2.6.32-5-amd64
tags 588509 patch thanks The trouble I had in message #47 was caused by accidental mem=nopentium option that got enabled when I installed my modified package. When I remove that I can boot: [ 0.000000] Initializing cgroup subsys cpuset [ 0.000000] Initializing cgroup subsys cpu [ 0.000000] Linux version 2.6.32-5-amd64 (Debian 2.6.32-17+lindi1) (lindi@kurp.hut.fi) (gcc version 4.3.5 (Debian 4.3.5-1) ) #1 SMP Fri Jul 16 18:55:57 EEST 2010 [ 0.000000] Command line: root=/dev/xvda ro console=hvc0 [ 0.000000] KERNEL supported cpus: [ 0.000000] Intel GenuineIntel [ 0.000000] AMD AuthenticAMD [ 0.000000] Centaur CentaurHauls [ 0.000000] ACPI in unprivileged domain disabled [ 0.000000] BIOS-provided physical RAM map: [ 0.000000] Xen: 0000000000000000 - 00000000000a0000 (usable) [ 0.000000] Xen: 00000000000a0000 - 0000000000100000 (reserved) [ 0.000000] Xen: 0000000000100000 - 0000000040000000 (usable) [ 0.000000] DMI not present or invalid. [ 0.000000] last_pfn = 0x40000 max_arch_pfn = 0x400000000 [ 0.000000] init_memory_mapping: 0000000000000000-0000000040000000 [ 0.000000] RAMDISK: 016a3000 - 02fb5000 [ 0.000000] No NUMA configuration found [ 0.000000] Faking a node at 0000000000000000-0000000040000000 [ 0.000000] Bootmem setup node 0 0000000000000000-0000000040000000 [ 0.000000] NODE_DATA [0000000000008000 - 000000000000ffff] [ 0.000000] bootmap [0000000000010000 - 0000000000017fff] pages 8 [ 0.000000] (7 early reservations) ==> bootmem [0000000000 - 0040000000] [ 0.000000] #0 [0000000000 - 0000001000] BIOS data page ==> [0000000000 - 0000001000] [ 0.000000] #1 [00031b8000 - 00031d5000] XEN PAGETABLES ==> [00031b8000 - 00031d5000] [ 0.000000] #2 [0000006000 - 0000008000] TRAMPOLINE ==> [0000006000 - 0000008000] [ 0.000000] #3 [0001000000 - 0001682794] TEXT DATA BSS ==> [0001000000 - 0001682794] [ 0.000000] #4 [00016a3000 - 0002fb5000] RAMDISK ==> [00016a3000 - 0002fb5000] [ 0.000000] #5 [0002fb5000 - 00031b8000] XEN START INFO ==> [0002fb5000 - 00031b8000] [ 0.000000] #6 [0000100000 - 00002e2000] PGTABLE ==> [0000100000 - 00002e2000] [ 0.000000] Zone PFN ranges: [ 0.000000] DMA 0x00000000 -> 0x00001000 [ 0.000000] DMA32 0x00001000 -> 0x00100000 [ 0.000000] Normal 0x00100000 -> 0x00100000 [ 0.000000] Movable zone start PFN for each node [ 0.000000] early_node_map[2] active PFN ranges [ 0.000000] 0: 0x00000000 -> 0x000000a0 [ 0.000000] 0: 0x00000100 -> 0x00040000 [ 0.000000] SFI: Simple Firmware Interface v0.7 http://simplefirmware.org [ 0.000000] SMP: Allowing 1 CPUs, 0 hotplug CPUs [ 0.000000] No local APIC present [ 0.000000] APIC: disable apic facility [ 0.000000] PM: Registered nosave memory: 00000000000a0000 - 0000000000100000 [ 0.000000] Allocating PCI resources starting at 40000000 (gap: 40000000:c0000000) [ 0.000000] Booting paravirtualized kernel on Xen [ 0.000000] Xen version: 3.4.1_19718_04-2.1 (preserve-AD) [ 0.000000] NR_CPUS:512 nr_cpumask_bits:512 nr_cpu_ids:1 nr_node_ids:1 [ 0.000000] PERCPU: Embedded 30 pages/cpu @ffff88000320b000 s90328 r8192 d24360 u122880 [ 0.000000] pcpu-alloc: s90328 r8192 d24360 u122880 alloc=30*4096 [ 0.000000] pcpu-alloc: [0] 0 [ 0.000000] Xen: using vcpu_info placement [ 0.000000] Built 1 zonelists in Node order, mobility grouping on. Total pages: 257979 [ 0.000000] Policy zone: DMA32 [ 0.000000] Kernel command line: root=/dev/xvda ro console=hvc0 [ 0.000000] PID hash table entries: 4096 (order: 3, 32768 bytes) [ 0.000000] Initializing CPU#0 [ 0.000000] Checking aperture... [ 0.000000] No AGP bridge found [ 0.000000] Memory: 996936k/1048576k available (3067k kernel code, 384k absent, 51256k reserved, 1879k data, 588k init) [ 0.000000] SLUB: Genslabs=14, HWalign=64, Order=0-3, MinObjects=0, CPUs=1, Nodes=1 [ 0.000000] Hierarchical RCU implementation. [ 0.000000] NR_IRQS:4352 nr_irqs:256 [ 0.000000] Console: colour dummy device 80x25 [ 0.000000] console [tty0] enabled [ 0.000000] console [hvc0] enabled [ 0.000000] installing Xen timer for CPU 0 [ 0.000000] Detected 2393.998 MHz processor. [ 0.004000] Calibrating delay loop (skipped), value calculated using timer frequency.. 4787.99 BogoMIPS (lpj=9575992) [ 0.004000] Security Framework initialized [ 0.004000] SELinux: Disabled at boot. [ 0.004000] Dentry cache hash table entries: 131072 (order: 8, 1048576 bytes) [ 0.004000] Inode-cache hash table entries: 65536 (order: 7, 524288 bytes) [ 0.004000] Mount-cache hash table entries: 256 [ 0.004000] Initializing cgroup subsys ns [ 0.004000] Initializing cgroup subsys cpuacct [ 0.004000] Initializing cgroup subsys devices [ 0.004000] Initializing cgroup subsys freezer [ 0.004000] Initializing cgroup subsys net_cls [ 0.004000] CPU: L1 I cache: 32K, L1 D cache: 32K [ 0.004000] CPU: L2 cache: 4096K [ 0.004000] CPU 0/0x3 -> Node 0 [ 0.004000] CPU: Unsupported number of siblings 4 [ 0.004000] Performance Events: unsupported p6 CPU model 15 no PMU driver, software events only. [ 0.004000] SMP alternatives: switching to UP code [ 0.004000] Freeing SMP alternatives: 28k freed [ 0.004181] Brought up 1 CPUs [ 0.004304] devtmpfs: initialized [ 0.007061] Grant table initialized [ 0.007066] regulator: core version 0.5 [ 0.007120] NET: Registered protocol family 16 [ 0.007244] PCI: Fatal: No config space access function found [ 0.008390] bio: create slab <bio-0> at 0 [ 0.008462] ACPI: Interpreter disabled. [ 0.008487] xen_balloon: Initialising balloon driver. [ 0.008487] vgaarb: loaded [ 0.008487] PCI: System does not support PCI [ 0.008487] PCI: System does not support PCI [ 0.008487] Switching to clocksource xen [ 0.009590] pnp: PnP ACPI: disabled [ 0.009805] NET: Registered protocol family 2 [ 0.009937] IP route cache hash table entries: 32768 (order: 6, 262144 bytes) [ 0.010591] TCP established hash table entries: 131072 (order: 9, 2097152 bytes) [ 0.011346] TCP bind hash table entries: 65536 (order: 8, 1048576 bytes) [ 0.011643] TCP: Hash tables configured (established 131072 bind 65536) [ 0.011650] TCP reno registered [ 0.011799] NET: Registered protocol family 1 [ 0.011866] Unpacking initramfs... [ 0.039069] Freeing initrd memory: 25672k freed [ 0.049563] platform rtc_cmos: registered platform RTC device (no PNP device found) [ 0.049801] audit: initializing netlink socket (disabled) [ 0.049818] type=2000 audit(1279303811.069:1): initialized [ 0.052432] HugeTLB registered 2 MB page size, pre-allocated 0 pages [ 0.053762] VFS: Disk quotas dquot_6.5.2 [ 0.053811] Dquot-cache hash table entries: 512 (order 0, 4096 bytes) [ 0.053901] msgmni has been set to 1997 [ 0.054089] alg: No test for stdrng (krng) [ 0.054144] Block layer SCSI generic (bsg) driver version 0.4 loaded (major 253) [ 0.054152] io scheduler noop registered [ 0.054157] io scheduler anticipatory registered [ 0.054161] io scheduler deadline registered [ 0.054198] io scheduler cfq registered (default) [ 0.055711] Linux agpgart interface v0.103 [ 0.055745] Serial: 8250/16550 driver, 4 ports, IRQ sharing enabled [ 0.175044] input: Macintosh mouse button emulation as /devices/virtual/input/input0 [ 0.175098] PNP: No PS/2 controller found. Probing ports directly. [ 0.175919] i8042.c: No controller found. [ 0.175982] mice: PS/2 mouse device common for all mice [ 0.176044] rtc_cmos: probe of rtc_cmos failed with error -16 [ 0.176070] cpuidle: using governor ladder [ 0.176075] cpuidle: using governor menu [ 0.176082] No iBFT detected. [ 0.176320] TCP cubic registered [ 0.176427] NET: Registered protocol family 10 [ 0.176811] lo: Disabled Privacy Extensions [ 0.177016] Mobile IPv6 [ 0.177022] NET: Registered protocol family 17 [ 0.177115] registered taskstats version 1 [ 0.177129] XENBUS: Device with no driver: device/vbd/51712 [ 0.177135] XENBUS: Device with no driver: device/vbd/51728 [ 0.177140] XENBUS: Device with no driver: device/vif/0 [ 0.177144] XENBUS: Device with no driver: device/console/0 [ 0.177160] /local/lindi/debian-linux-2.6/linux-2.6-2.6.32/debian/build/source_amd64_none/drivers/rtc/hctosys.c: unable to open rtc device (rtc0) [ 0.177180] Initalizing network drop monitor service [ 0.177242] Freeing unused kernel memory: 588k freed [ 0.177423] Write protecting the kernel read-only data: 4212k Loading, please wait... [ 0.211975] udev: starting version 158 [ 0.235828] Initialising Xen virtual ethernet driver. [ 0.285814] blkfront: xvda: barriers enabled [ 0.286096] xvda: unknown partition table [ 0.294449] blkfront: xvdb: barriers enabled [ 0.294637] xvdb: unknown partition table Begin: Loading essential drivers ... done. Begin: Running /scripts/init-premount ... done. Begin: Mounting root file system ... Begin: Running /scripts/local-top ... done. Begin: Running /scripts/local-premount ... done. [ 0.602866] kjournald starting. Commit interval 5 seconds [ 0.602884] EXT3-fs: mounted filesystem with ordered data mode. Begin: Running /scripts/local-bottom ... done. done. Begin: Running /scripts/init-bottom ... done. INIT: version 2.88 booting Using makefile-style concurrent boot in runlevel S. Starting the hotplug events dispatcher: udevd[ 1.989980] udev: starting version 158 . Synthesizing the initial hotplug events...done. Waiting for /dev to be fully populated...[ 2.356664] input: PC Speaker as /devices/platform/pcspkr/input/input1 done. Setting parameters of disc: (none). @Activating swap...[ 3.469274] Adding 6291448k swap on /dev/xvdb. Priority:-1 extents:1 across:6291448k SS done. Checking root file system...fsck from util-linux-ng 2.17.2 /dev/xvda: clean, 584600/1638400 files, 5684183/6553600 blocks done. [ 3.678095] EXT3 FS on xvda, internal journal Cleaning up ifupdown.... Setting up networking.... [ 3.825556] loop: module loaded Loading kernel modules...done. Activating lvm and md swap...done. Checking file systems...fsck from util-linux-ng 2.17.2 done. Mounting local filesystems...done. Activating swapfile swap...done. Cleaning up temporary files.... Setting kernel variables ...done. Configuring network interfaces...[ 5.285452] Bridge firewalling registered [ 5.314820] device eth0 entered promiscuous mode [ 5.323739] br0: port 1(eth0) entering learning state Waiting for br0 to get ready (MAXWAIT is 20 seconds). [ 14.320018] br0: port 1(eth0) entering forwarding state Starting portmap daemon.... Starting NFS common utilities: statd. mount.nfs: Failed to resolve server nfs: Temporary failure in name resolution done. Starting portmap daemon...Already running.. Starting NFS common utilities: statd. Cleaning up temporary files.... Setting up ALSA...done (none loaded). Setting console screen modes and fonts. cannot (un)set powersave mode 9;30]14;30][ 15.147035] fuse init (API version 7.13) Recovering schroot sessions:. Running scripts in rcS.d/ took 14 seconds. INIT: Entering runlevel: 2 Using makefile-style concurrent boot in runlevel 2. CPUFreq Utilities: Setting ondemand CPUFreq governor...disabled, governor not available...done. Starting NFS common utilities: statd. Starting portmap daemon...Already running.. Starting enhanced syslogd: rsyslogd. Starting web server: apache2apache2: apr_sockaddr_info_get() failed for lindi1 apache2: Could not reliably determine the server's fully qualified domain name, using 127.0.0.1 for ServerName . Starting domain name service...: bind9. Starting periodic command scheduler: cron. Starting system message bus: dbus. Starting MTA:Starting Nagios Service Check Acceptor: nsca. exim4. Starting NTP server: ntpd. Starting OpenBSD Secure Shell server: sshd. Starting nagios3 monitoring daemon: nagios3. [ 19.902202] RPC: Registered udp transport module. [ 19.902214] RPC: Registered tcp transport module. [ 19.902219] RPC: Registered tcp NFSv4.1 backchannel transport module. [ 19.988412] Slow work thread pool: Starting up [ 19.988473] Slow work thread pool: Ready [ 19.988532] FS-Cache: Loaded [ 20.314792] FS-Cache: Netfs 'nfs' registered for caching [ 20.360724] svc: failed to register lockdv1 RPC service (errno 97). Running scripts in rc2.d/ took 5 seconds. Debian GNU/Linux squeeze/sid lindi1 hvc0 lindi1 login: I can not test kprobes support since as mentioned in http://wiki.debian.org/HowToRebuildAnOfficialDebianKernelPackage I can not generate my own linux-headers-2.6.32-5-common package unless I build all kernel related packages for which I have no disk space right now in the squeeze machine. -Timo -- To UNSUBSCRIBE, email to debian-kernel-REQUEST@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmaster@lists.debian.org Archive: 84hbjzguex.fsf@sauna.l.org">http://lists.debian.org/84hbjzguex.fsf@sauna.l.org |
| All times are GMT. The time now is 11:41 AM. |
VBulletin, Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
Content Relevant URLs by vBSEO ©2007, Crawlability, Inc.