Re: general protection fault in find_device

From: Nikolay Borisov
Date: Mon Jun 18 2018 - 04:26:44 EST


[Actually adding Anand to CC, Anand see my analysis of the issue below
from previous email]

On 18.06.2018 10:03, Nikolay Borisov wrote:
> [Adding Anand to CC list since he's been doing devices-related work]
>
> On 18.06.2018 08:55, syzbot wrote:
>> Hello,
>>
>> syzbot found the following crash on:
>>
>> HEAD commit:ÂÂÂ ce397d215ccd Linux 4.18-rc1
>> git tree:ÂÂÂÂÂÂ upstream
>> console output: https://syzkaller.appspot.com/x/log.txt?x=14e765f8400000
>> kernel config:Â https://syzkaller.appspot.com/x/.config?x=f390986c4f7cd566
>> dashboard link:
>> https://syzkaller.appspot.com/bug?extid=923aa93978c7ad27a9b1
>> compiler:ÂÂÂÂÂÂ gcc (GCC) 8.0.1 20180413 (experimental)
>>
>> Unfortunately, I don't have any reproducer for this crash yet.
>>
>> IMPORTANT: if you fix the bug, please add the following tag to the commit:
>> Reported-by: syzbot+923aa93978c7ad27a9b1@xxxxxxxxxxxxxxxxxxxxxxxxx
>>
>> kasan: CONFIG_KASAN_INLINE enabled
>> kasan: GPF could be caused by NULL-ptr deref or user memory access
>> general protection fault: 0000 [#1] SMP KASAN
>> CPU: 0 PID: 14460 Comm: syz-executor5 Not tainted 4.18.0-rc1+ #107
>> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS
>> Google 01/01/2011
>> RIP: 0010:find_device+0x94/0x130 fs/btrfs/volumes.c:366
>> Code: 42 80 3c 28 00 0f 85 9d 00 00 00 48 8b 1b 4c 39 f3 0f 84 86 00 00
>> 00 e8 6a 79 b1 fe 48 8d bb c0 00 00 00 48 89 f8 48 c1 e8 03 <42> 80 3c
>> 28 00 75 70 4c 8b bb c0 00 00 00 4c 89 e6 4c 89 ff e8 f3
>> RSP: 0018:ffff8801d880ee70 EFLAGS: 00010206
>> RAX: 0000000000000018 RBX: 0000000000000000 RCX: ffffc9000d8a5000
>> RDX: 0000000000002d14 RSI: ffffffff82ca3136 RDI: 00000000000000c0
>> RBP: ffff8801d880eea8 R08: ffff8801abee0240 R09: fffffbfff123dea8
>> R10: ffff8801d880f178 R11: ffffffff891ef547 R12: 231f7dc339e55e1c
>> R13: dffffc0000000000 R14: ffff8801d7a65b98 R15: 0000000000000000
>> FS:Â 00007faa9dcb2700(0000) GS:ffff8801dae00000(0000)
>> knlGS:0000000000000000
>> CS:Â 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
>> CR2: 000000000093002d CR3: 00000001bd208000 CR4: 00000000001406f0
>> DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
>> DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
>> Call Trace:
>> Âdevice_list_add+0x230/0x1530 fs/btrfs/volumes.c:771
>> Âbtrfs_scan_one_device+0x474/0xb00 fs/btrfs/volumes.c:1247
>> Âbtrfs_mount_root+0x3ae/0x1e90 fs/btrfs/super.c:1542
>> Âmount_fs+0xae/0x328 fs/super.c:1277
>> Âvfs_kern_mount.part.34+0xdc/0x4e0 fs/namespace.c:1037
>> Âvfs_kern_mount+0x40/0x60 fs/namespace.c:1027
>> Âbtrfs_mount+0x4a9/0x215e fs/btrfs/super.c:1661
>> Âmount_fs+0xae/0x328 fs/super.c:1277
>> Âvfs_kern_mount.part.34+0xdc/0x4e0 fs/namespace.c:1037
>> Âvfs_kern_mount fs/namespace.c:1027 [inline]
>> Âdo_new_mount fs/namespace.c:2518 [inline]
>> Âdo_mount+0x581/0x30e0 fs/namespace.c:2848
>> Âksys_mount+0x12d/0x140 fs/namespace.c:3064
>> Â__do_sys_mount fs/namespace.c:3078 [inline]
>> Â__se_sys_mount fs/namespace.c:3075 [inline]
>> Â__x64_sys_mount+0xbe/0x150 fs/namespace.c:3075
>> Âdo_syscall_64+0x1b9/0x820 arch/x86/entry/common.c:290
>> Âentry_SYSCALL_64_after_hwframe+0x49/0xbe
>> RIP: 0033:0x45855a
>> Code: b8 a6 00 00 00 0f 05 48 3d 01 f0 ff ff 0f 83 dd 8f fb ff c3 66 2e
>> 0f 1f 84 00 00 00 00 00 66 90 49 89 ca b8 a5 00 00 00 0f 05 <48> 3d 01
>> f0 ff ff 0f 83 ba 8f fb ff c3 66 0f 1f 84 00 00 00 00 00
>> RSP: 002b:00007faa9dcb1a88 EFLAGS: 00000202 ORIG_RAX: 00000000000000a5
>> RAX: ffffffffffffffda RBX: 0000000020000428 RCX: 000000000045855a
>> RDX: 00007faa9dcb1ad0 RSI: 00000000200000c0 RDI: 00007faa9dcb1af0
>> RBP: 0000000000000001 R08: 00007faa9dcb1b30 R09: 00007faa9dcb1ad0
>> R10: 0000000000000000 R11: 0000000000000202 R12: 0000000000000013
>> R13: 0000000000000001 R14: 00000000004d2d78 R15: 0000000000000000
>
>
> So this suggests some inconsistency on fs_devices->devices list. On a
> quick look indeed it doesn't seem clear what the locking rules for this
> list are. In device_list_add in the !device case a device is added with
> fs_devices->device_list_Mutex held and using list_add_rcu. In the same
> function if we want to read the list ie invoke find_devices (because we
> have found an fsid) we are using plain list_for_each_entry (ie not the
> _rcu version and i don't see device_list_mutex being held while
> iterating the list). Additionally in btrfs_free_extra_devids the
> fs_devices->devices list is iterated with uuid_mutex being held and not
> device_list_mutex. In open_fs_devices we don't get any protection
> whatsoever while reading the list. Same thing in
> btrfs_find_next_active_device. If the list is supposed to be
> RCU-protected then the rules are:
>
> 1. There needs to be an out of band (ie not RCU) mutual exclusion of
> modifiers
> 2. Iterating the list should use _rcu list primitives.
>
> Currently I don't see those 2 invariants being enforced in every code path.
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>