2.6.21: sky2 "hw csum failure" problem

From: Håkan Lindqvist
Date: Sat Apr 28 2007 - 06:31:37 EST


I have a machine with an onboard sky2 NIC:

0000:03:00.0 0200: 11ab:4343 (rev 14)

0000:03:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8062
PCI-E IPMI Gigabit Ethernet Controller (rev 14)
Subsystem: ASUSTeK Computer Inc.: Unknown device 81da
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B-
Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort-
<MAbort- >SERR- <PERR-
Latency: 0, Cache Line Size: 0x04 (16 bytes)
Interrupt: pin A routed to IRQ 219
Region 0: Memory at ff7fc000 (64-bit, non-prefetchable) [size=16K]
Region 2: I/O ports at b800 [size=256]
Expansion ROM at ff7c0000 [disabled] [size=128K]
Capabilities: [48] Power Management version 2
Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA PME(D0+,D1+,D2+,D3hot
+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [50] Vital Product Data
Capabilities: [5c] Message Signalled Interrupts: 64bit+ Queue=0/1
Enable+
Address: 00000000fee0100c Data: 414a
Capabilities: [e0] #10 [0011]



Detected as:
ACPI: PCI Interrupt 0000:03:00.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:03:00.0 to 64
sky2 0000:03:00.0: v1.14 addr 0xff7fc000 irq 16 Yukon-XL (0xb3) rev 1
sky2 eth0: addr 00:15:f2:d3:62:e9
sky2 eth1: addr 00:15:f2:d3:62:ea
sky2 eth0: enabling interface
sky2 eth0: ram buffer 96K
sky2 eth1: enabling interface
sky2 eth1: ram buffer 96K
sky2 eth0: Link is up at 100 Mbps, full duplex, flow control both


It works normally to start with, but after some random time (presumably
when network traffic rises), no further traffic gets through.

In dmesg there are, at that point, lots of "eth0: hw csum failure."
messages with stack traces. (Log attached, unfortunately this is the
last part of the log, not including when the problem first appeared.)

I don't think this problem is actually new in 2.6.21, but what I do know
is that 2.6.16.1 is a known working kernel on this machine.


Regards,
HÃkan Lindqvist

(I am not subscribed to the list, so I would appreciate if you CC:ed me
on any replies.)
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c016fa55>] bio_free+0x1d/0x2c
[<c016fcdd>] bio_put+0x20/0x26
[<c020a6eb>] xfs_end_bio+0x44/0x51
[<c01708e8>] bio_endio+0x5d/0x82
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02bab56>] clone_endio+0xa9/0xc5
[<c0225ca8>] elv_next_request+0x37/0x127
[<c028cca6>] scsi_delete_timer+0x15/0x5e
[<c028fb78>] scsi_request_fn+0x1c3/0x2d8
[<c022cf88>] as_completed_request+0x81/0x1a3
[<c02299bf>] __end_that_request_first+0x1d8/0x250
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c032924e>] __sched_text_start+0x2e6/0x586
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c0115afc>] __activate_task+0x21/0x3a
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02ffe15>] tcp_rcv_established+0x4cc/0x6a3
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c01164b2>] __wake_up_common+0x33/0x56
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02ffe15>] tcp_rcv_established+0x4cc/0x6a3
[<c03071a4>] tcp_v4_do_rcv+0xd5/0xd9
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c028f734>] scsi_setup_fs_cmnd+0x36/0x5a
[<c010465b>] common_interrupt+0x23/0x28
[<c028fb4d>] scsi_request_fn+0x198/0x2d8
[<c0227efc>] __generic_unplug_device+0x1f/0x21
[<c0227f04>] generic_unplug_device+0x6/0x8
[<c02bd1cc>] dm_table_unplug_all+0x2a/0x2c
[<c02bb29a>] dm_unplug_all+0x17/0x21
[<c0227f13>] blk_backing_dev_unplug+0xd/0xf
[<c020d0a8>] xfs_buf_iowait+0x3c/0x3e
[<c020cc40>] xfs_buf_iostart+0x51/0x7e
[<c020c667>] xfs_buf_read_flags+0x76/0x78
[<c01ff1a8>] xfs_trans_read_buf+0x157/0x2fd
[<c01d3428>] xfs_btree_read_bufs+0x69/0x81
[<c01b95eb>] xfs_alloc_lookup+0xf7/0x3a9
[<c01d353c>] xfs_btree_reada_bufs+0x64/0x78
[<c01b5c65>] xfs_alloc_fixup_trees+0x2b8/0x2dc
[<c01d36bc>] xfs_btree_setbuf+0x28/0xb8
[<c01b62eb>] xfs_alloc_ag_vextent_near+0x31a/0x9cd
[<c01b5e24>] xfs_alloc_ag_vextent+0xf2/0xf4
[<c01b81c1>] xfs_alloc_vextent+0x37c/0x48b
[<c01c7647>] xfs_bmap_btalloc+0x301/0x8fd
[<c01ec74f>] xfs_iext_bno_to_ext+0x106/0x1aa
[<c01cb851>] xfs_bmapi+0x128b/0x16f8
[<c01ec74f>] xfs_iext_bno_to_ext+0x106/0x1aa
[<c022d64b>] as_add_request+0x68/0x8a
[<c01c0000>] xfs_attr_leaf_remove+0xb6/0x554
[<c01ef1d7>] xfs_iomap_write_allocate+0x27a/0x481
[<c02bb231>] dm_request+0x81/0x98
[<c01edf0e>] xfs_iomap+0x3ed/0x4c4
[<c0170009>] bio_add_page+0x37/0x43
[<c020a684>] xfs_map_blocks+0x45/0x68
[<c020b4d5>] xfs_page_state_convert+0x386/0x607
[<c020b7ae>] xfs_vm_writepage+0x58/0xd5
[<c013cda2>] generic_writepages+0x1c9/0x2e3
[<c020b756>] xfs_vm_writepage+0x0/0xd5
[<c020b82b>] xfs_vm_writepages+0x0/0x1b
[<c013cee3>] do_writepages+0x27/0x43
[<c0169b03>] __sync_single_inode+0x4e/0x1ac
[<c0169ca8>] __writeback_single_inode+0x47/0x17f
[<c020c98e>] xfs_buf_rele+0x15/0x59
[<c020d03d>] xfs_buf_iorequest+0x34/0x63
[<c01fed77>] xfs_trans_first_ail+0x16/0x26
[<c01f20d6>] xfs_log_need_covered+0x73/0x99
[<c02bd168>] dm_table_any_congested+0xd/0x47
[<c0169f6a>] sync_sb_inodes+0x18a/0x257
[<c016a0aa>] writeback_inodes+0x73/0x75
[<c013c9ca>] background_writeout+0x5a/0x90
[<c013d467>] pdflush+0x0/0x28
[<c013d3c5>] __pdflush+0xa9/0x14b
[<c013d48b>] pdflush+0x24/0x28
[<c013c970>] background_writeout+0x0/0x90
[<c013d467>] pdflush+0x0/0x28
[<c0128f3d>] kthread+0xaf/0xb3
[<c0128e8e>] kthread+0x0/0xb3
[<c010480b>] kernel_thread_helper+0x7/0x10
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c016fa55>] bio_free+0x1d/0x2c
[<c016fcdd>] bio_put+0x20/0x26
[<c020a6eb>] xfs_end_bio+0x44/0x51
[<c01708e8>] bio_endio+0x5d/0x82
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02bab56>] clone_endio+0xa9/0xc5
[<c02baaad>] clone_endio+0x0/0xc5
[<c028cca6>] scsi_delete_timer+0x15/0x5e
[<c022cf88>] as_completed_request+0x81/0x1a3
[<c02299bf>] __end_that_request_first+0x1d8/0x250
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c032924e>] __sched_text_start+0x2e6/0x586
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c020a6a7>] xfs_end_bio+0x0/0x51
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02d3304>] sk_reset_timer+0xc/0x15
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<c0115afc>] __activate_task+0x21/0x3a
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c032924e>] __sched_text_start+0x2e6/0x586
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c020a6a7>] xfs_end_bio+0x0/0x51
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02d3304>] sk_reset_timer+0xc/0x15
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<c0280008>] autoconfig_read_divisor_id+0x35/0xb0
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c032924e>] __sched_text_start+0x2e6/0x586
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
eth0: hw csum failure.
[<c02d7103>] __skb_checksum_complete+0x66/0x6c
[<c031e8cd>] nf_ip_checksum+0xe9/0x124
[<f8872eee>] tcp_error+0x131/0x1b8 [nf_conntrack]
[<c016fa55>] bio_free+0x1d/0x2c
[<c016fcdd>] bio_put+0x20/0x26
[<c020a6eb>] xfs_end_bio+0x44/0x51
[<c01708e8>] bio_endio+0x5d/0x82
[<f887033a>] nf_conntrack_in+0x94/0x2b6 [nf_conntrack]
[<c02bab56>] clone_endio+0xa9/0xc5
[<c0225ca8>] elv_next_request+0x37/0x127
[<c028fb78>] scsi_request_fn+0x1c3/0x2d8
[<c022cf88>] as_completed_request+0x81/0x1a3
[<c02299bf>] __end_that_request_first+0x1d8/0x250
[<c02e8899>] nf_iterate+0x69/0x83
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02e8910>] nf_hook_slow+0x5d/0xc7
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<c02ee01a>] ip_rcv+0x3a0/0x458
[<c02ee279>] ip_rcv_finish+0x0/0x233
[<f91d3ef7>] sky2_rx_submit+0x22/0x64 [sky2]
[<c02d9b08>] netif_receive_skb+0x1ec/0x26c
[<f91d5da1>] sky2_status_intr+0x2dc/0x463 [sky2]
[<f91d64a8>] sky2_poll+0x49/0xd6 [sky2]
[<c02d9ccf>] net_rx_action+0x61/0x129
[<c011d280>] __do_softirq+0x6d/0x79
[<c011d2b3>] do_softirq+0x27/0x29
[<c0105fad>] do_IRQ+0x3e/0x6d
[<c032924e>] __sched_text_start+0x2e6/0x586
[<c010465b>] common_interrupt+0x23/0x28
[<c0102162>] mwait_idle_with_hints+0x39/0x48
[<c01020a8>] cpu_idle+0x52/0x5b
[<c03fca11>] start_kernel+0x17f/0x1bb
[<c03fc54c>] unknown_bootoption+0x0/0x192
=======================
sky2 eth1: disabling interface
sky2 eth0: disabling interface
ACPI: PCI interrupt for device 0000:03:00.0 disabled
ACPI: PCI Interrupt 0000:03:00.0[A] -> GSI 16 (level, low) -> IRQ 16
PCI: Setting latency timer of device 0000:03:00.0 to 64
sky2 0000:03:00.0: v1.14 addr 0xff7fc000 irq 16 Yukon-XL (0xb3) rev 1
sky2 eth0: addr 00:15:f2:d3:62:e9
sky2 eth1: addr 00:15:f2:d3:62:ea
sky2 eth0: enabling interface
sky2 eth0: ram buffer 96K
ADDRCONF(NETDEV_UP): eth0: link is not ready
sky2 eth0: Link is up at 100 Mbps, full duplex, flow control both
ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
eth0: no IPv6 routers present

Attachment: smime.p7s
Description: S/MIME cryptographic signature