Re: kernel BUG at mm/huge_memory.c:2613!

From: Roman Gushchin
Date: Fri Jun 19 2020 - 18:57:34 EST


On Fri, Jun 19, 2020 at 01:56:28PM -0700, David Rientjes wrote:
> On Fri, 19 Jun 2020, Roman Gushchin wrote:
>
> > [ 40.287524] BUG: unable to handle page fault for address: ffffa77b833df000
> > [ 40.287529] #PF: supervisor write access in kernel mode
> > [ 40.287531] #PF: error_code(0x000b) - reserved bit violation
> > [ 40.287532] PGD 40d14e067 P4D 40d14e067 PUD 40d14f067 PMD 3ec54d067
> > PTE 80001688033d9163
> > [ 40.287538] Oops: 000b [#2] SMP NOPTI
> > [ 40.287542] CPU: 9 PID: 1986 Comm: pulseaudio Tainted: G D
> > 5.8.0-rc1+ #697
> > [ 40.287544] Hardware name: Gigabyte Technology Co., Ltd.
> > AB350-Gaming/AB350-Gaming-CF, BIOS F25 01/16/2019
> > [ 40.287550] RIP: 0010:__memset+0x24/0x30
> > [ 40.287553] Code: cc cc cc cc cc cc 0f 1f 44 00 00 49 89 f9 48 89
> > d1 83 e2 07 48 c1 e9 03 40 0f b6 f6 48 b8 01 01 01 01 01 01 01 01 48
> > 0f af c6 <f3> 48 ab 89 d1 f3 aa 4c 89 c8 c3 90 49 89 f9 40 88 f0 48 89
> > d1 f3
> > [ 40.287556] RSP: 0018:ffffa77b827a7e08 EFLAGS: 00010216
> > [ 40.287558] RAX: 0000000000000000 RBX: ffff90f77dced800 RCX: 00000000000008a0
> > [ 40.287560] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffffa77b833df000
> > [ 40.287561] RBP: ffff90f7898c7000 R08: ffff90f78c507768 R09: ffffa77b833df000
> > [ 40.287563] R10: ffffa77b833df000 R11: ffff90f7839f2d40 R12: 0000000000000000
> > [ 40.287564] R13: ffff90f76a802e00 R14: ffffffffc0cb8880 R15: ffff90f770f4e700
> > [ 40.287567] FS: 00007f3d8e8df880(0000) GS:ffff90f78ee40000(0000)
> > knlGS:0000000000000000
> > [ 40.287569] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> > [ 40.287570] CR2: ffffa77b833df000 CR3: 00000003fa556000 CR4: 00000000003406e0
> > [ 40.287572] Call Trace:
> > [ 40.287584] snd_pcm_hw_params+0x3fd/0x490 [snd_pcm]
> > [ 40.287593] snd_pcm_common_ioctl+0x1c5/0x1110 [snd_pcm]
> > [ 40.287601] ? snd_pcm_info_user+0x64/0x80 [snd_pcm]
> > [ 40.287608] snd_pcm_ioctl+0x23/0x30 [snd_pcm]
> > [ 40.287613] ksys_ioctl+0x82/0xc0
> > [ 40.287617] __x64_sys_ioctl+0x16/0x20
> > [ 40.287622] do_syscall_64+0x4d/0x90
> > [ 40.287627] entry_SYSCALL_64_after_hwframe+0x44/0xa9
>
> Hi Roman,
>
> If you have CONFIG_AMD_MEM_ENCRYPT set, this should be resolved by
>
> commit dbed452a078d56bc7f1abecc3edd6a75e8e4484e
> Author: David Rientjes <rientjes@xxxxxxxxxx>
> Date: Thu Jun 11 00:25:57 2020 -0700
>
> dma-pool: decouple DMA_REMAP from DMA_COHERENT_POOL
>
> Or you might want to wait for 5.8-rc2 instead which includes this fix.
>

Hello, David!

Thank you for pointing at it! Unfortunately, there must be something wrong
with drivers, your patch didn't help much. I still see the same stacktrace.

I'll try again after 5.8-rc2 will be out.

Thanks!