Re: [2.6.17-rc5-mm2] crash when doing second suspend: BUG in arch/i386/kernel/nmi.c:174

From: Rafael J. Wysocki
Date: Sun Jun 04 2006 - 07:51:09 EST


Hi,

On Saturday 03 June 2006 00:51, Jeremy Fitzhardinge wrote:
> I'm trying to get suspend/resume working properly on my Thinkpad X60.
> This is a dual-core machine, so its running in SMP mode.
>
> Now that I have a set of patches to make AHCI resume properly, I'm
> getting a crash on the second suspend. I can't get an actual listing of
> the oops, but I have a set of screenshots if anyone needs more details.
>
> The gist is that there's a BUG_ON failing at arch/i386/kernel/nmi.c:174
> (BUG_ON(counter > NMI_MAX_COUNTER_BITS)), in release_evntsel_nmi. The
> backtrace is:
>
> release_evntsel_nmi
> stop_apci_nmi_watchdog
> on_each_cpu
> disable_lapic_nmi_watchdog
> lapic_nmi_suspend
> sysdev_suspend
> device_power_down
> suspend_enter
> enter_state
> state_store
> subsys_attr_store
> sysfs_write_file
> vfs_write
> sys_write
> sysenter_past_esp
>
> This happens after all the devices have suspended themselves; then
> there's a longish pause (several seconds), and the oops appears. The
> first suspend is very quick.
>
> Everything works as expected when I disable nmi watchdog with
> nmi_watchdog=0 on the kernel command line.
>
> dmesg after a single successful suspend/resume cycle attached.
> (resent without .config to get under linux-kernel's size limit; mail if
> you want a copy)

Well, this looks like a tough one. Could you please create a bugzilla entry
with all of the relevant information?

Greetings,
Rafael
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/