Kernel 4.1.5 not stable - crashes

From: Gerhard Wiesinger
Date: Fri Aug 21 2015 - 02:51:27 EST


Hello,

I'm having big problems with Fedora FC22 kernel 4.1.5 (happened with all tried kernels 4.1.x from FC22) which is not stable at all. At the nightly backup jobs (database dumps, rsync via ssh, etc.) maschine crashes reproduceable at every night with the stack trace below. Message repeats on different CPUs in around 1~10s with same message.

Kernel 4.0.8 from Fedora FC22 works well with long uptimes, also previous kernel versions are highly stable. Kernel 4.1.4/4.1.5 had a lot of RAID fixes so I tried it again but it didn't help. So something critical must be different from 4.0.8 to 4.1.2 and later.

I'm running 2 RAID5 volumes with each LVM and cryptsetup above. After the crash RAID does a resync.

Machine:
- Mainboard: ASUS - M3N-H HDMI with latest BIOS
- CPU: AMD Phenom II X4 940 Black Edition, 4x 3.00GHz, boxed (HDZ940XCGIBOX)
- NIC: HP Broadcom Netxtreme Gigabit PCIe Netzwerkkarte 482914-001 (BCM5761)

If you need further information please let me know.

Any ideas?

Thank you.

Ciao,
Gerhard

[63525.726812] NMI watchdog: BUG: soft lockup - CPU#1 stuck for 22s! [ping:18283]
[63525.734015] Modules linked in: tun ebtable_filter ebtables bridge stp llc cfg80211 rfkill ipt_MASQUERADE nf_nat_masquerade_ipv4 ip6t_REJECT nf_reject_ipv6 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_i
pv4 nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat xt_CHECKSUM xt_conntrack nf_conntrack iptable_mangle iptable_security ip6table_filter ip6_tables iptable_raw hwmon_vid snd_hda_codec_hdmi lnbp21 stb6100 stb0899 snd_hd
a_codec_realtek snd_hda_codec_generic kvm_amd kvm snd_hda_intel snd_hda_controller snd_hda_codec snd_hda_core edac_core edac_mce_amd mantis snd_hwdep mantis_core snd_seq k10temp snd_seq_device dvb_core snd_pcm s
nd_timer snd soundcore shpchp i2c_nforce2 asus_atk0110 acpi_cpufreq nfsd auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc dm_crypt raid1 ata_generic raid456 async_raid6_recov async_memcpy async_pq async_xor xo
r async_tx pata_acpi raid6_pq nouveau i2c_algo_bit drm_kms_helper ttm mxm_wmi drm tg3 serio_raw ptp pps_core firewire_ohci forcedeth firewire_core crc_itu_t pata_amd video wmi uas usb_storage
[63525.825481] CPU: 1 PID: 18283 Comm: ping Tainted: G D W L 4.1.5-200.fc22.x86_64 #1
[63525.833809] Hardware name: System manufacturer System Product Name/M3N-H/HDMI, BIOS ASUS M3N-H/HDMI ACPI BIOS Revision 2603 06/11/2010
[63525.845863] task: ffff88019de5c520 ti: ffff880117f50000 task.ti: ffff880117f50000
[63525.853325] RIP: 0010:[<ffffffff81121cc2>] [<ffffffff81121cc2>] smp_call_function_many+0x222/0x280
[63525.862366] RSP: 0018:ffff880117f53c58 EFLAGS: 00000202
[63525.867663] RAX: 0000000000000003 RBX: 0000000000000293 RCX: 0000000000000000
[63525.874781] RDX: ffff88023fc1b8c8 RSI: 0000000000000008 RDI: ffff880237406bb0
[63525.881897] RBP: ffff880117f53c98 R08: 0000000000000000 R09: 000000000000000d
[63525.889015] R10: ffffffff813ad019 R11: ffffffff813acfa4 R12: ffff880117f53c28
[63525.896131] R13: ffff880117f53bc8 R14: ffffffff813acfa4 R15: 00000000000082d2
[63525.903249] FS: 00007f4227e48700(0000) GS:ffff88023fc40000(0000) knlGS:0000000000000000
[63525.911319] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[63525.917051] CR2: 00007fb542e34000 CR3: 0000000014833000 CR4: 00000000000006e0
[63525.924166] Stack:
[63525.926176] 0000000000000001 0100000000000001 0000000000000002 0000000000000000
[63525.933620] ffffffff81069d90 0000000000000000 ffff880117f53db0 0000000000000001
[63525.941067] ffff880117f53cc8 ffffffff81121d81 ffffc90001130000 0000000000000000
[63525.948513] Call Trace:
[63525.950956] [<ffffffff81069d90>] ? unmap_pte_range+0xe0/0xe0
[63525.956688] [<ffffffff81121d81>] on_each_cpu+0x31/0x60
[63525.961901] [<ffffffff8106bcd1>] change_page_attr_set_clr+0x421/0x530
[63525.968412] [<ffffffff8106c8bf>] set_memory_ro+0x2f/0x40
[63525.973797] [<ffffffff81191e99>] bpf_prog_select_runtime+0x29/0x40
[63525.980047] [<ffffffff81699130>] bpf_prepare_filter+0x160/0x180
[63525.986038] [<ffffffff81699462>] sk_attach_filter+0xe2/0x190
[63525.991772] [<ffffffff810dee91>] ? pick_next_task_fair+0x7e1/0x980
[63525.998022] [<ffffffff8166b005>] sock_setsockopt+0x3f5/0x9a0
[63526.003755] [<ffffffff81665966>] SyS_setsockopt+0xd6/0xf0
[63526.009225] [<ffffffff810250d7>] ? syscall_trace_leave+0xc7/0x140
[63526.015391] [<ffffffff817a1e6e>] system_call_fastpath+0x12/0x71
[63526.021382] Code: 05 78 a2 c0 00 89 c1 0f 8d 73 fe ff ff 48 98 49 8b 16 48 03 14 c5 a0 77 d2 81 8b 42 18 a8 01 74 c8 0f 1f 84 00 00 00 00 00 f3 90 <8b> 42 18 a8 01 75 f7 eb b5 0f b6 4d c8 4c 89 ea 4c 89 e6 44
89
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/