INFO: rcu_preempt self-detected stall on CPU with 3.10 kernel

From: wyang
Date: Mon Feb 24 2014 - 03:25:51 EST


Hi Paul,

I ran into a cpu stall warning in a MIPS board with 3.10 kernel, I noted you push a patch(c896054f) to reduce default RCU CPU stall warning timeout to 21s.
why is it 21s? what decide this time? seems this time is too short for MIPS, can I retore it to 60s? Do you give me any suggestion on this issue?

Thanks
Wei
[ 34.480000] [sched_delayed] sched: RT throttling activated
[ 54.230000] INFO: rcu_preempt self-detected stall on CPU
[ 54.230000] INFO: rcu_preempt self-detected stall on CPU

[ 54.230000] 5: (2099 ticks this GP) idle=cf7/140000000000001/0 softirq=63/70
[ 54.230000]
[ 54.230000] (t=2100 jiffies g=18446744073709551451 c=18446744073709551450 q=682)
[ 54.230000] CPU: 5 PID: 133 Comm: udevd Not tainted 3.10.19-WR6.0.0.0_cgl #5
[ 54.230000] Stack : c0000000f5fcf170 ffffffffc1b2dfb0 c0000000f5fcf180 ffffffffc10abd8c
ffffffffc1b30000 0000000000000004 c0000000f5fcf1a0 ffffffffc10776fc
ffffffffc1eb0000 0000000000000000 c0000000f5fcf1c0 ffffffffc1b30000
c0000000f5fcf1d0 ffffffffc10785b4 0000000000000000 0000000000000000
0000000000000000 ffffffffc1e90000 ffffffffc1e912c8 ffffffffc1e90000
ffffffffc19bb388 ffffffffc1b1d277 ffffffffc1e912c8 c0000000f5d04c68
0000000000000085 0000000000000005 ffffffffc1b10000 ffffffffc1b10000
c0000000f5fcf250 c0000000f5fcf160 c0000000f5fcf278 ffffffffc111bbb0
c0000000f5fcf2b0 ffffffffc107a058 c0000000f5d048d8 ffffffffc19bb388
0000000000000005 ffffffffc1048e38 0000000000000000 0000000000000000


[ 54.230000] Call Trace:
[ 54.230000] [<ffffffffc1048e38>] show_stack+0xd8/0xf8
[ 54.230000] [<ffffffffc111bbb0>] rcu_check_callbacks+0x400/0xa80
[ 54.230000] [<ffffffffc108ae00>] update_process_times+0x60/0xa8
[ 54.230000] [<ffffffffc10d7d3c>] tick_sched_timer+0x54/0x90
[ 54.230000] [<ffffffffc10a91f4>] __run_hrtimer+0xa4/0x298
[ 54.230000] [<ffffffffc10a9fb8>] hrtimer_interrupt+0x148/0x2e0
[ 54.230000] [<ffffffffc104d238>] c0_compare_interrupt+0x98/0xe8
[ 54.230000] [<ffffffffc1110b1c>] handle_irq_event_percpu+0x84/0x368
[ 54.230000] [<ffffffffc11158f0>] handle_percpu_irq+0xa0/0xe8
[ 54.230000] [<ffffffffc110feec>] generic_handle_irq+0x4c/0x80
[ 54.230000] [<ffffffffc189f90c>] do_IRQ+0x2c/0x40
[ 54.230000] [<ffffffffc1034650>] plat_irq_dispatch+0x380/0x3d0
[ 54.230000] [<ffffffffc1042500>] ret_from_irq+0x0/0x4
[ 54.230000] [<ffffffffc1898844>] __schedule+0x4dc/0xa48
[ 54.230000] [<ffffffffc18991dc>] preempt_schedule+0x54/0x78
[ 54.230000] [<ffffffffc1716e70>] sock_def_readable+0x50/0x88
[ 54.230000] [<ffffffffc176888c>] netlink_sendskb+0x44/0xb8
[ 54.230000] [<ffffffffc1768b88>] netlink_unicast+0x288/0x2e0
[ 54.230000] [<ffffffffc1769010>] netlink_sendmsg+0x2e0/0x3d8
[ 54.230000] [<ffffffffc1710a38>] sock_sendmsg+0xe8/0x128
[ 54.230000] [<ffffffffc1711cc0>] ___sys_sendmsg+0x378/0x388
[ 54.230000] [<ffffffffc1713b54>] __sys_sendmsg+0x5c/0xc0
[ 54.230000] [<ffffffffc104fce4>] handle_sys64+0x44/0x68
[ 54.410000] 1: (2099 ticks this GP) idle=d7d/140000000000001/0 softirq=131/134
[ 54.420000] (t=2119 jiffies g=18446744073709551451 c=18446744073709551450 q=682)
[ 54.430000] CPU: 1 PID: 70 Comm: irq/23-eth%d Not tainted 3.10.19-WR6.0.0.0_cgl #5
[ 54.430000] Stack : c0000000f4d7f4c0 ffffffffc107809c 0000000000000001 0000000000000285
ffffffffc1b30000 0000000000000004 0000000000000001 0000000000000000
c0000000f4d7f500 ffffffffc1077e9c 0000000000000046 ffffffffc1b30000
c0000000f4d7f520 ffffffffc107850c 0000000000000000 0000000000000000
0000000000000000 ffffffffc1e90000 ffffffffc1e912c8 ffffffffc1e90000
ffffffffc19bb388 ffffffffc1b1d277 ffffffffc1e912c8 c0000000f8c35c98
0000000000000046 0000000000000001 ffffffffc1b10000 ffffffffc1b10000
c0000000f4d7f5a0 c0000000f4d7f4b0 c0000000f4d7f5c8 ffffffffc111bbb0
c0000000f4d7f600 ffffffffc107a058 c0000000f8c35908 ffffffffc19bb388
0000000000000001 ffffffffc1048e38 0000000000000000 0000000000000000
...
[ 54.500000] Call Trace:
[ 54.500000] [<ffffffffc1048e38>] show_stack+0xd8/0xf8
[ 54.500000] [<ffffffffc111bbb0>] rcu_check_callbacks+0x400/0xa80
[ 54.510000] [<ffffffffc108ae00>] update_process_times+0x60/0xa8
[ 54.510000] [<ffffffffc10d7d3c>] tick_sched_timer+0x54/0x90
[ 54.510000] [<ffffffffc10a91f4>] __run_hrtimer+0xa4/0x298
[ 54.520000] [<ffffffffc10a9fb8>] hrtimer_interrupt+0x148/0x2e0
[ 54.520000] [<ffffffffc104d238>] c0_compare_interrupt+0x98/0xe8
[ 54.530000] [<ffffffffc1110b1c>] handle_irq_event_percpu+0x84/0x368
[ 54.530000] [<ffffffffc11158f0>] handle_percpu_irq+0xa0/0xe8
[ 54.540000] [<ffffffffc110feec>] generic_handle_irq+0x4c/0x80
[ 54.540000] [<ffffffffc189f90c>] do_IRQ+0x2c/0x40
[ 54.540000] [<ffffffffc1034650>] plat_irq_dispatch+0x380/0x3d0
[ 54.550000] [<ffffffffc1042500>] ret_from_irq+0x0/0x4
[ 54.550000] [<ffffffffc1774614>] ip_rcv_finish+0x2b4/0x3c8
[ 54.560000] [<ffffffffc172f710>] __netif_receive_skb_core+0x370/0x9d0
[ 54.560000] [<ffffffffc172fe6c>] netif_receive_skb+0x4c/0xf0
[ 54.570000] [<ffffffffc161b974>] smsc911x_poll+0x11c/0x2b0
[ 54.570000] [<ffffffffc1731754>] net_rx_action+0x1ac/0x320
[ 54.570000] [<ffffffffc107f910>] __do_softirq+0x150/0x340
[ 54.580000] [<ffffffffc107fc60>] do_softirq+0xb8/0xc0
[ 54.580000] [<ffffffffc107fe6c>] local_bh_enable+0xfc/0x108
[ 54.590000] [<ffffffffc1111d38>] irq_forced_thread_fn+0x68/0x90
[ 54.590000] [<ffffffffc1112190>] irq_thread+0x160/0x1f8
[ 54.600000] [<ffffffffc10a47d4>] kthread+0xcc/0xd8
[ 54.600000] [<ffffffffc104258c>] ret_from_kernel_thread+0x14/0x1c

[ 117.260000] INFO: rcu_preempt self-detected stall on CPU
[ 117.260000] 7: (8402 ticks this GP) idle=c6f/140000000000001/0 softirq=379/383
[ 117.270000] (t=8404 jiffies g=18446744073709551451 c=18446744073709551450 q=690)
[ 117.280000] INFO: rcu_preempt detected stalls on CPUs/tasks:
[ 117.280000] 7: (8402 ticks this GP) idle=c6f/140000000000001/0 softirq=379/383
[ 117.280000] (detected by 0, t=8405 jiffies, g=18446744073709551451, c=18446744073709551450, q=690)
[ 117.280000] Task dump for CPU 7:
[ 117.280000] ksoftirqd/7 R running task 0 36 2 0x00100004
[ 117.280000] Stack : 0000000080e0b38a 80e0a5140b010000 80e0a51480e0a514 c0000000f873fac0
000002fe00000000 c0000000f5a306c0 c0000000f8ac1710 ffffffff80e0b38a
000000000000000b 0000000000000001 c0000000f873fbb8 ffffffffc189a15c
c0000000f5a34380 ffffffffc1b7aa80 c0000000f5a18300 ffffffffc1775e50
c0000000f5a34380 ffffffffc1b90000 c0000000f5a34388 0000000000000007
c0000000f873fc08 ffffffffc1775ff0 c0000000f5a342c0 c0000000f8598000
c0000000f5a34388 0000000000000100 c0000000f873fc38 ffffffffc10898d0
c0000000f5a327c8 c0000000f8598000 0000000000000000 ffffffffc1775e50
c0000000f5a34380 0000000000200200 c0000000f873fc78 ffffffffc1089d08
c0000000f873fc78 c0000000f873fc78 c0000000f8599028 c0000000f8599428
...
[ 117.280000] Call Trace:
[ 117.280000] [<ffffffffc1898814>] __schedule+0x4ac/0xa48
[ 117.280000] [<ffffffffc10afc30>] smpboot_thread_fn+0x280/0x320
[ 117.280000] [<ffffffffc10a47d4>] kthread+0xcc/0xd8
[ 117.280000] [<ffffffffc104258c>] ret_from_kernel_thread+0x14/0x1c

[ 117.380000] CPU: 7 PID: 36 Comm: ksoftirqd/7 Not tainted 3.10.19-WR6.0.0.0_cgl #5
[ 117.390000] Stack : c0000000f873f3e0 ffffffffc107809c 0000000000000001 00000000000002b1
ffffffffc1b30000 0000000000000004 0000000000000001 0000000000000000
c0000000f873f420 ffffffffc1077e9c 0000000000000045 ffffffffc1b30000
c0000000f873f440 ffffffffc107850c 0000000000000000 0000000000000000
0000000000000000 ffffffffc1e90000 ffffffffc1e912c8 ffffffffc1e90000
ffffffffc19bb388 ffffffffc1b1d277 ffffffffc1e912c8 c0000000f84c5c98
0000000000000024 0000000000000007 ffffffffc1b10000 ffffffffc1b10000
c0000000f873f4c0 c0000000f873f3d0 c0000000f873f4e8 ffffffffc111bbb0
c0000000f873f520 ffffffffc107a058 c0000000f84c5908 ffffffffc19bb388
0000000000000007 ffffffffc1048e38 0000000000000000 0000000000000000
...
[ 117.450000] Call Trace:
[ 117.450000] [<ffffffffc1048e38>] show_stack+0xd8/0xf8
[ 117.460000] [<ffffffffc111bbb0>] rcu_check_callbacks+0x400/0xa80
[ 117.460000] [<ffffffffc108ae00>] update_process_times+0x60/0xa8
[ 117.470000] [<ffffffffc10d7d3c>] tick_sched_timer+0x54/0x90
[ 117.470000] [<ffffffffc10a91f4>] __run_hrtimer+0xa4/0x298
[ 117.470000] [<ffffffffc10a9fb8>] hrtimer_interrupt+0x148/0x2e0
[ 117.480000] [<ffffffffc104d238>] c0_compare_interrupt+0x98/0xe8
[ 117.480000] [<ffffffffc1110b1c>] handle_irq_event_percpu+0x84/0x368
[ 117.490000] [<ffffffffc11158f0>] handle_percpu_irq+0xa0/0xe8
[ 117.490000] [<ffffffffc110feec>] generic_handle_irq+0x4c/0x80
[ 117.500000] [<ffffffffc189f90c>] do_IRQ+0x2c/0x40
[ 117.500000] [<ffffffffc1034650>] plat_irq_dispatch+0x380/0x3d0
[ 117.510000] [<ffffffffc1042500>] ret_from_irq+0x0/0x4
[ 117.510000] [<ffffffffc17dfdd4>] xfrm_lookup+0x284/0x5d8
[ 117.510000] [<ffffffffc17af814>] icmp_send+0x5d4/0x680
[ 117.520000] [<ffffffffc1775ff0>] ip_expire+0x1a0/0x1e0
[ 117.520000] [<ffffffffc10898d0>] call_timer_fn+0x58/0x1a0
[ 117.530000] [<ffffffffc1089d08>] run_timer_softirq+0x1f8/0x2d0
[ 117.530000] [<ffffffffc107f910>] __do_softirq+0x150/0x340
[ 117.530000] [<ffffffffc107fb80>] run_ksoftirqd+0x80/0xa8
[ 117.540000] [<ffffffffc10afc30>] smpboot_thread_fn+0x280/0x320
[ 117.540000] [<ffffffffc10a47d4>] kthread+0xcc/0xd8
[ 117.550000] [<ffffffffc104258c>] ret_from_kernel_thread+0x14/0x1c
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/