[BUG or cosmic ray] WARNING: at net/sched/sch_generic.c:222 dev_watchdog+0xe8/0x100()

From: c4p7n1
Date: Thu Jun 05 2008 - 03:34:36 EST


I've got this on rc3. It's _not_ reproducible even on heavy load.
After that the Ethernet PCIe chip went into a borken state until next reboot.

[ 46.316926] r8169: eth0: link up
[ 48.901435] [drm] Initialized drm 1.1.0 20060810
[ 48.904771] PCI: Setting latency timer of device 0000:00:02.0 to 64
[ 48.908103] [drm] Initialized i915 1.6.0 20060119 on minor 0
[ 49.544674] NET: Registered protocol family 17
[ 125.004020] NETDEV WATCHDOG: eth0: transmit timed out
[ 125.004043] ------------[ cut here ]------------
[ 125.004049] WARNING: at net/sched/sch_generic.c:222 dev_watchdog+0xe8/0x100()
[ 125.004053] Modules linked in: af_packet i915 drm rfcomm l2cap bluetooth acpi_cpufreq cpufreq_powersave cpufreq_stats cpufreq_userspace cpufreq_conservative wmi sbs sbshc iptable_filter ip_tables x_tables fuse usbhid hid joydev snd_hda_intel snd_pcm_oss snd_mixer_oss snd_pcm snd_page_alloc snd_hwdep snd_seq_dummy snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq video psmouse output snd_timer snd_seq_device snd r8169 backlight evdev soundcore uhci_hcd ehci_hcd iTCO_wdt iTCO_vendor_support usbcore
[ 125.004143] Pid: 0, comm: swapper Not tainted 2.6.26-rc3c4p7n1-00308-g75d3bce #1
[ 125.004153] [<c0125aaf>] warn_on_slowpath+0x5f/0x90
[ 125.004168] [<c0138e00>] autoremove_wake_function+0x0/0x40
[ 125.004179] [<c011ae7b>] __wake_up_common+0x4b/0x80
[ 125.004192] [<c011ccae>] __wake_up+0x3e/0x60
[ 125.004201] [<c012626b>] wake_up_klogd+0x3b/0x40
[ 125.004209] [<c0126966>] vprintk+0x346/0x3b0
[ 125.004221] [<c012ee47>] lock_timer_base+0x27/0x60
[ 125.004230] [<c0135e10>] delayed_work_timer_fn+0x0/0x20
[ 125.004238] [<c012ef8d>] __mod_timer+0x9d/0xb0
[ 125.004248] [<c0136044>] queue_delayed_work_on+0x84/0xc0
[ 125.004258] [<c0392818>] dev_watchdog+0xe8/0x100
[ 125.004267] [<c012e8de>] run_timer_softirq+0x15e/0x1d0
[ 125.004274] [<c0392730>] dev_watchdog+0x0/0x100
[ 125.004282] [<c0142870>] tick_handle_oneshot_broadcast+0x100/0x120
[ 125.004292] [<c0392730>] dev_watchdog+0x0/0x100
[ 125.004302] [<c012ab13>] __do_softirq+0x63/0xc0
[ 125.004310] [<c012aba7>] do_softirq+0x37/0x40
[ 125.004317] [<c012ad18>] irq_exit+0x68/0x80
[ 125.004323] [<c0106370>] do_IRQ+0x40/0x70
[ 125.004333] [<c0104767>] common_interrupt+0x23/0x28
[ 125.004344] [<c01300d8>] prepare_signal+0x28/0x170
[ 125.004352] [<c02e67ed>] acpi_idle_enter_bm+0x28c/0x2f6
[ 125.004365] [<c0372cab>] cpuidle_idle_call+0x7b/0xc0
[ 125.004373] [<c0372c30>] cpuidle_idle_call+0x0/0xc0
[ 125.004381] [<c010275a>] cpu_idle+0x5a/0xe0
[ 125.004395] =======================
[ 125.004399] ---[ end trace b7a56030e016660f ]---
[ 125.682725] r8169: eth0: link up
...
[ 1337.654531] NETDEV WATCHDOG: eth0: transmit timed out
[ 1338.333962] r8169: eth0: link up

...

rmmod r8169 && modprobe r8169

[ 1504.171195] ACPI: PCI interrupt for device 0000:02:00.0 disabled
[ 1515.491199] r8169 Gigabit Ethernet driver 2.2LK-NAPI loaded
[ 1515.491285] ACPI: PCI Interrupt 0000:02:00.0[A] -> GSI 16 (level, low) -> IRQ 16
[ 1515.494466] PCI: cache line size of 32 is not supported by device 0000:02:00.0
[ 1515.494539] ACPI: PCI interrupt for device 0000:02:00.0 disabled
[ 1515.495561] r8169: probe of 0000:02:00.0 failed with error -22

lspci
02:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8101E PCI Express Fast Ethernet controller [10ec:8136] (rev ff) (prog-if ff)
!!! Unknown header type 7f

-------------------

after a reboot:

[ 833.345672] r8169: eth0: link up
[ 835.599177] NET: Registered protocol family 17
...
[ 1355.282122] r8169: eth0: link down
[ 1381.528391] r8169: eth0: link up
...

02:00.0 Ethernet controller [0200]: Realtek Semiconductor Co., Ltd. RTL8101E PCI Express Fast Ethernet controller [10ec:8136] (rev 02)
Subsystem: Toshiba America Info Systems Unknown device [1179:ff64]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
Latency: 0, Cache Line Size: 32 bytes
Interrupt: pin A routed to IRQ 220
Region 0: I/O ports at 3000 [size=256]
Region 2: Memory at 90010000 (64-bit, prefetchable) [size=4K]
Region 4: Memory at 90000000 (64-bit, prefetchable) [size=64K]
Capabilities: [40] Power Management version 7
Flags: PMEClk- DSI+ D1+ D2- AuxCurrent=375mA PME(D0-,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable+
Address: 00000000fee0300c Data: 41e9
Capabilities: [70] Express Endpoint IRQ 1
Device: Supported: MaxPayload 256 bytes, PhantFunc 0, ExtTag-
Device: Latency L0s <512ns, L1 <64us
Device: AtnBtn- AtnInd- PwrInd-
Device: Errors: Correctable- Non-Fatal- Fatal- Unsupported-
Device: RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop-
Device: MaxPayload 128 bytes, MaxReadReq 512 bytes
Link: Supported Speed 2.5Gb/s, Width x1, ASPM L0s L1, Port 0
Link: Latency L0s <512ns, L1 <64us
Link: ASPM L1 Enabled RCB 64 bytes CommClk+ ExtSynch-
Link: Speed 2.5Gb/s, Width x1
Capabilities: [ac] MSI-X: Enable- Mask- TabSize=2
Vector table: BAR=4 offset=00000000
PBA: BAR=4 offset=00000800
Capabilities: [cc] Vital Product Data
Capabilities: [100] Advanced Error Reporting
Capabilities: [140] Virtual Channel
Capabilities: [160] Device Serial Number 00-00-ff-ff-00-00-00-08


--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/