Re: 5.6-rc3: WARNING: CPU: 48 PID: 17435 at kernel/sched/fair.c:380 enqueue_task_fair+0x328/0x440

From: Christian Borntraeger
Date: Fri Feb 28 2020 - 10:08:17 EST


Also happened with 5.4:
Seems that I just happen to have an interesting test workload/system size interaction
on a newly installed system that triggers this.


[ 9761.439278] ------------[ cut here ]------------
[ 9761.439283] rq->tmp_alone_branch != &rq->leaf_cfs_rq_list
[ 9761.439300] WARNING: CPU: 58 PID: 17405 at kernel/sched/fair.c:381 enqueue_task_fair+0x7cc/0x9b0
[ 9761.439303] Modules linked in: kvm xt_CHECKSUM xt_MASQUERADE nf_nat_tftp nf_conntrack_tftp xt_CT tun bridge stp llc xt_tcpudp ip6t_REJECT nf_reject_ipv6 ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ip_set nfnetlink ip6table_filter ip6_tables iptable_filter rpcrdma sunrpc rdma_ucm rdma_cm iw_cm ib_cm configfs s390_trng ghash_s390 prng mlx5_ib aes_s390 ib_uverbs des_s390 libdes ib_core sha3_512_s390 sha3_256_s390 sha512_s390 genwqe_card sha1_s390 crc_itu_t vfio_ccw vfio_mdev mdev vfio_iommu_type1 eadm_sch vfio zcrypt_cex4 sch_fq_codel ip_tables x_tables mlx5_core sha256_s390 sha_common pkey zcrypt rng_core autofs4
[ 9761.439335] CPU: 58 PID: 17405 Comm: sh Not tainted 5.4.0 #27
[ 9761.439336] Hardware name: IBM 3906 M04 704 (LPAR)
[ 9761.439338] Krnl PSW : 0404c00180000000 00000007353f2d4c (enqueue_task_fair+0x7cc/0x9b0)
[ 9761.439340] R:0 T:1 IO:0 EX:0 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 RI:0 EA:3
[ 9761.439342] Krnl GPRS: 00000000000003e0 0400000735f500bc 000000000000002d 00000007365f4bc2
[ 9761.439343] 000000000000002c 0000000735a49388 0000000000000001 0400001f00000000
[ 9761.439344] 000003e0015ebc88 0000001fbd856c00 0000001fbd856d00 0000000000000000
[ 9761.439345] 0000001bc8a12000 0000000735d853c0 00000007353f2d48 000003e0015ebad0
[ 9761.439385] Krnl Code: 00000007353f2d3c: c020005ae9c0 larl %r2,735f500bc
00000007353f2d42: c0e5fffdc487 brasl %r14,7353ab650
#00000007353f2d48: a7f40001 brc 15,7353f2d4a
>00000007353f2d4c: a7f4fcda brc 15,7353f2700
00000007353f2d50: e33073480004 lg %r3,840(%r7)
00000007353f2d56: 41b07340 la %r11,832(%r7)
00000007353f2d5a: b9040063 lgr %r6,%r3
00000007353f2d5e: b904004b lgr %r4,%r11
[ 9761.439397] Call Trace:
[ 9761.439399] ([<00000007353f2d48>] enqueue_task_fair+0x7c8/0x9b0)
[ 9761.439401] [<00000007353e1b48>] activate_task+0x88/0xf0
[ 9761.439403] [<00000007353e20c6>] ttwu_do_activate+0x56/0x80
[ 9761.439405] [<00000007353e3106>] try_to_wake_up+0x256/0x650
[ 9761.439408] [<000000073540353e>] swake_up_locked.part.0+0x2e/0x70
[ 9761.439409] [<0000000735403764>] swake_up_one+0x54/0x90
[ 9761.439449] [<000003ff8047be52>] kvm_vcpu_wake_up+0x52/0x80 [kvm]
[ 9761.439458] [<000003ff80498e3a>] kvm_s390_vcpu_wakeup+0x2a/0x40 [kvm]
[ 9761.439466] [<000003ff8049959e>] kvm_s390_idle_wakeup+0x6e/0xa0 [kvm]
[ 9761.439470] [<000000073544acb4>] __hrtimer_run_queues+0x114/0x2f0
[ 9761.439472] [<000000073544b97c>] hrtimer_interrupt+0x12c/0x2b0
[ 9761.439475] [<0000000735370a1a>] do_IRQ+0xaa/0xb0
[ 9761.439480] [<0000000735d75998>] ext_int_handler+0x128/0x12c
[ 9761.439485] [<00000007355abd28>] get_page_from_freelist+0x528/0x1860
[ 9761.439486] ([<00000007355abc36>] get_page_from_freelist+0x436/0x1860)
[ 9761.439488] [<00000007355ae420>] __alloc_pages_nodemask+0x120/0x320
[ 9761.439492] [<00000007355cca8a>] alloc_pages_vma+0x9a/0x280
[ 9761.439494] [<0000000735588062>] wp_page_copy+0xb2/0x730
[ 9761.439495] [<000000073558b642>] do_wp_page+0xa2/0x760
[ 9761.439497] [<000000073558def2>] __handle_mm_fault+0x852/0x910
[ 9761.439498] [<000000073558e076>] handle_mm_fault+0xc6/0x180
[ 9761.439500] [<0000000735389c44>] do_protection_exception+0x164/0x4b0
[ 9761.439502] [<0000000735d7558c>] pgm_check_handler+0x1c8/0x220
[ 9761.439502] Last Breaking-Event-Address:
[ 9761.439503] [<00000007353f2d48>] enqueue_task_fair+0x7c8/0x9b0
[ 9761.439504] ---[ end trace 40ea9b5f62b01ed1 ]---