[syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm

linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed

* [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
@ 2024-07-16  8:29 syzbot
  2024-07-18 10:59 ` Vlastimil Babka (SUSE)
  0 siblings, 1 reply; 10+ messages in thread
From: syzbot @ 2024-07-16  8:29 UTC (permalink / raw)
  To: akpm, davem, herbert, linux-crypto, linux-kernel, linux-mm,
	syzkaller-bugs

Hello,

syzbot found the following issue on:

HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
git tree:       linux-next
console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000

Downloadable assets:
disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz

IMPORTANT: if you fix the issue, please add the following tag to the commit:
Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com

==================================================================
BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235

CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
Call Trace:
 <TASK>
 __dump_stack lib/dump_stack.c:94 [inline]
 dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
 print_address_description mm/kasan/report.c:377 [inline]
 print_report+0x169/0x550 mm/kasan/report.c:488
 kasan_report+0x143/0x180 mm/kasan/report.c:601
 handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
 faultin_page mm/gup.c:1194 [inline]
 __get_user_pages+0x6ec/0x16a0 mm/gup.c:1493
 populate_vma_page_range+0x264/0x330 mm/gup.c:1932
 __mm_populate+0x27a/0x460 mm/gup.c:2035
 mm_populate include/linux/mm.h:3429 [inline]
 vm_mmap_pgoff+0x2c3/0x3d0 mm/util.c:593
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f
RIP: 0033:0x7f093ce17fe9
Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 1d 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
RSP: 002b:00007f093cd9e158 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
RAX: ffffffffffffffda RBX: 00007f093ce9f4b8 RCX: 00007f093ce17fe9
RDX: 0000000000000002 RSI: 0000000000b36000 RDI: 0000000020000000
RBP: 00007f093ce9f4b0 R08: 00000000ffffffff R09: 0000000000000000
R10: 0000000000008031 R11: 0000000000000246 R12: 00007f093ce9f4bc
R13: 000000000000006e R14: 00007ffe8008cc30 R15: 00007ffe8008cd18
 </TASK>

Allocated by task 5235:
 kasan_save_stack mm/kasan/common.c:47 [inline]
 kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
 unpoison_slab_object mm/kasan/common.c:312 [inline]
 __kasan_slab_alloc+0x66/0x80 mm/kasan/common.c:338
 kasan_slab_alloc include/linux/kasan.h:201 [inline]
 slab_post_alloc_hook mm/slub.c:3988 [inline]
 slab_alloc_node mm/slub.c:4037 [inline]
 kmem_cache_alloc_noprof+0x135/0x2a0 mm/slub.c:4044
 vm_area_alloc+0x24/0x1d0 kernel/fork.c:472
 mmap_region+0xc3d/0x2090 mm/mmap.c:2944
 do_mmap+0x8f9/0x1010 mm/mmap.c:1468
 vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f

Freed by task 5237:
 kasan_save_stack mm/kasan/common.c:47 [inline]
 kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
 kasan_save_free_info+0x40/0x50 mm/kasan/generic.c:579
 poison_slab_object+0xe0/0x150 mm/kasan/common.c:240
 __kasan_slab_free+0x37/0x60 mm/kasan/common.c:256
 kasan_slab_free include/linux/kasan.h:184 [inline]
 slab_free_hook mm/slub.c:2252 [inline]
 slab_free mm/slub.c:4473 [inline]
 kmem_cache_free+0x145/0x350 mm/slub.c:4548
 rcu_do_batch kernel/rcu/tree.c:2569 [inline]
 rcu_core+0xafd/0x1830 kernel/rcu/tree.c:2843
 handle_softirqs+0x2c4/0x970 kernel/softirq.c:554
 __do_softirq kernel/softirq.c:588 [inline]
 invoke_softirq kernel/softirq.c:428 [inline]
 __irq_exit_rcu+0xf4/0x1c0 kernel/softirq.c:637
 irq_exit_rcu+0x9/0x30 kernel/softirq.c:649
 instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1043 [inline]
 sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1043
 asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702

Last potentially related work creation:
 kasan_save_stack+0x3f/0x60 mm/kasan/common.c:47
 __kasan_record_aux_stack+0xac/0xc0 mm/kasan/generic.c:541
 __call_rcu_common kernel/rcu/tree.c:3106 [inline]
 call_rcu+0x167/0xa70 kernel/rcu/tree.c:3210
 remove_vma mm/mmap.c:189 [inline]
 remove_mt mm/mmap.c:2415 [inline]
 do_vmi_align_munmap+0x155c/0x18c0 mm/mmap.c:2758
 do_vmi_munmap+0x261/0x2f0 mm/mmap.c:2830
 mmap_region+0x72f/0x2090 mm/mmap.c:2881
 do_mmap+0x8f9/0x1010 mm/mmap.c:1468
 vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
 ksys_mmap_pgoff+0x544/0x720 mm/mmap.c:1514
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f

The buggy address belongs to the object at ffff88802c4719b0
 which belongs to the cache vm_area_struct of size 184
The buggy address is located 32 bytes inside of
 freed 184-byte region [ffff88802c4719b0, ffff88802c471a68)

The buggy address belongs to the physical page:
page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x2c471
memcg:ffff888024312e01
flags: 0xfff00000000000(node=0|zone=1|lastcpupid=0x7ff)
page_type: 0xfdffffff(slab)
raw: 00fff00000000000 ffff888015eefb40 ffffea000091a480 0000000000000008
raw: 0000000000000000 0000000080100010 00000001fdffffff ffff888024312e01
page dumped because: kasan: bad access detected
page_owner tracks the page as allocated
page last allocated via order 0, migratetype Unmovable, gfp_mask 0x52cc0(GFP_KERNEL|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP), pid 4769, tgid 4769 (S41dhcpcd), ts 42147845478, free_ts 42112307396
 set_page_owner include/linux/page_owner.h:32 [inline]
 post_alloc_hook+0x1f3/0x230 mm/page_alloc.c:1493
 prep_new_page mm/page_alloc.c:1501 [inline]
 get_page_from_freelist+0x2ccb/0x2d80 mm/page_alloc.c:3474
 __alloc_pages_noprof+0x256/0x6c0 mm/page_alloc.c:4732
 __alloc_pages_node_noprof include/linux/gfp.h:269 [inline]
 alloc_pages_node_noprof include/linux/gfp.h:296 [inline]
 alloc_slab_page+0x5f/0x120 mm/slub.c:2321
 allocate_slab+0x5a/0x2f0 mm/slub.c:2484
 new_slab mm/slub.c:2537 [inline]
 ___slab_alloc+0xcd1/0x14b0 mm/slub.c:3723
 __slab_alloc+0x58/0xa0 mm/slub.c:3813
 __slab_alloc_node mm/slub.c:3866 [inline]
 slab_alloc_node mm/slub.c:4025 [inline]
 kmem_cache_alloc_noprof+0x1c1/0x2a0 mm/slub.c:4044
 vm_area_dup+0x27/0x290 kernel/fork.c:487
 dup_mmap kernel/fork.c:696 [inline]
 dup_mm kernel/fork.c:1674 [inline]
 copy_mm+0xc7b/0x1f30 kernel/fork.c:1723
 copy_process+0x186b/0x3d90 kernel/fork.c:2376
 kernel_clone+0x226/0x8f0 kernel/fork.c:2788
 __do_sys_clone kernel/fork.c:2931 [inline]
 __se_sys_clone kernel/fork.c:2915 [inline]
 __x64_sys_clone+0x258/0x2a0 kernel/fork.c:2915
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f
page last free pid 4768 tgid 4768 stack trace:
 reset_page_owner include/linux/page_owner.h:25 [inline]
 free_pages_prepare mm/page_alloc.c:1094 [inline]
 free_unref_folios+0x103a/0x1b00 mm/page_alloc.c:2692
 folios_put_refs+0x76e/0x860 mm/swap.c:1039
 free_pages_and_swap_cache+0x5c8/0x690 mm/swap_state.c:335
 __tlb_batch_free_encoded_pages mm/mmu_gather.c:136 [inline]
 tlb_batch_pages_flush mm/mmu_gather.c:149 [inline]
 tlb_flush_mmu_free mm/mmu_gather.c:366 [inline]
 tlb_flush_mmu+0x3a3/0x680 mm/mmu_gather.c:373
 tlb_finish_mmu+0xd4/0x200 mm/mmu_gather.c:465
 exit_mmap+0x44f/0xc80 mm/mmap.c:3425
 __mmput+0x115/0x390 kernel/fork.c:1347
 exit_mm+0x220/0x310 kernel/exit.c:571
 do_exit+0x9b2/0x27f0 kernel/exit.c:869
 do_group_exit+0x207/0x2c0 kernel/exit.c:1031
 __do_sys_exit_group kernel/exit.c:1042 [inline]
 __se_sys_exit_group kernel/exit.c:1040 [inline]
 __x64_sys_exit_group+0x3f/0x40 kernel/exit.c:1040
 x64_sys_call+0x26c3/0x26d0 arch/x86/include/generated/asm/syscalls_64.h:232
 do_syscall_x64 arch/x86/entry/common.c:52 [inline]
 do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
 entry_SYSCALL_64_after_hwframe+0x77/0x7f

Memory state around the buggy address:
 ffff88802c471880: fc fc fc fc fc fc fc 00 00 00 00 00 00 00 00 00
 ffff88802c471900: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 fc fc
>ffff88802c471980: fc fc fc fc fc fc fa fb fb fb fb fb fb fb fb fb
                                                 ^
 ffff88802c471a00: fb fb fb fb fb fb fb fb fb fb fb fb fb fc fc fc
 ffff88802c471a80: fc fc fc fc fc 00 00 00 00 00 00 00 00 00 00 00
==================================================================


---
This report is generated by a bot. It may contain errors.
See https://goo.gl/tpsmEJ for more information about syzbot.
syzbot engineers can be reached at syzkaller@googlegroups.com.

syzbot will keep track of this issue. See:
https://goo.gl/tpsmEJ#status for how to communicate with syzbot.

If the report is already addressed, let syzbot know by replying with:
#syz fix: exact-commit-title

If you want syzbot to run the reproducer, reply with:
#syz test: git://repo/address.git branch-or-commit-hash
If you attach or paste a git patch, syzbot will apply it before testing.

If you want to overwrite report's subsystems, reply with:
#syz set subsystems: new-subsystem
(See the list of subsystem names on the web dashboard)

If the report is a duplicate of another one, reply with:
#syz dup: exact-subject-of-another-report

If you want to undo deduplication, reply with:
#syz undup


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-16  8:29 [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault syzbot
@ 2024-07-18 10:59 ` Vlastimil Babka (SUSE)
  2024-07-18 15:43   ` Liam R. Howlett
  0 siblings, 1 reply; 10+ messages in thread
From: Vlastimil Babka (SUSE) @ 2024-07-18 10:59 UTC (permalink / raw)
  To: syzbot, akpm, davem, herbert, linux-crypto, linux-kernel,
	linux-mm, syzkaller-bugs
  Cc: Suren Baghdasaryan, Liam R. Howlett, Lorenzo Stoakes

On 7/16/24 10:29 AM, syzbot wrote:
> Hello,

dunno about the [crypto?] parts, sounds rather something for Suren or Liam
or maybe it's due to some changes to gup?

> syzbot found the following issue on:
> 
> HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> git tree:       linux-next
> console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> 
> Downloadable assets:
> disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> 
> IMPORTANT: if you fix the issue, please add the following tag to the commit:
> Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> 
> ==================================================================
> BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> 
> CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> Call Trace:
>  <TASK>
>  __dump_stack lib/dump_stack.c:94 [inline]
>  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
>  print_address_description mm/kasan/report.c:377 [inline]
>  print_report+0x169/0x550 mm/kasan/report.c:488
>  kasan_report+0x143/0x180 mm/kasan/report.c:601
>  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
>  faultin_page mm/gup.c:1194 [inline]
>  __get_user_pages+0x6ec/0x16a0 mm/gup.c:1493
>  populate_vma_page_range+0x264/0x330 mm/gup.c:1932
>  __mm_populate+0x27a/0x460 mm/gup.c:2035
>  mm_populate include/linux/mm.h:3429 [inline]
>  vm_mmap_pgoff+0x2c3/0x3d0 mm/util.c:593
>  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
>  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
>  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> RIP: 0033:0x7f093ce17fe9
> Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 1d 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
> RSP: 002b:00007f093cd9e158 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
> RAX: ffffffffffffffda RBX: 00007f093ce9f4b8 RCX: 00007f093ce17fe9
> RDX: 0000000000000002 RSI: 0000000000b36000 RDI: 0000000020000000
> RBP: 00007f093ce9f4b0 R08: 00000000ffffffff R09: 0000000000000000
> R10: 0000000000008031 R11: 0000000000000246 R12: 00007f093ce9f4bc
> R13: 000000000000006e R14: 00007ffe8008cc30 R15: 00007ffe8008cd18
>  </TASK>
> 
> Allocated by task 5235:
>  kasan_save_stack mm/kasan/common.c:47 [inline]
>  kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
>  unpoison_slab_object mm/kasan/common.c:312 [inline]
>  __kasan_slab_alloc+0x66/0x80 mm/kasan/common.c:338
>  kasan_slab_alloc include/linux/kasan.h:201 [inline]
>  slab_post_alloc_hook mm/slub.c:3988 [inline]
>  slab_alloc_node mm/slub.c:4037 [inline]
>  kmem_cache_alloc_noprof+0x135/0x2a0 mm/slub.c:4044
>  vm_area_alloc+0x24/0x1d0 kernel/fork.c:472
>  mmap_region+0xc3d/0x2090 mm/mmap.c:2944
>  do_mmap+0x8f9/0x1010 mm/mmap.c:1468
>  vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
>  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
>  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
>  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> 
> Freed by task 5237:
>  kasan_save_stack mm/kasan/common.c:47 [inline]
>  kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
>  kasan_save_free_info+0x40/0x50 mm/kasan/generic.c:579
>  poison_slab_object+0xe0/0x150 mm/kasan/common.c:240
>  __kasan_slab_free+0x37/0x60 mm/kasan/common.c:256
>  kasan_slab_free include/linux/kasan.h:184 [inline]
>  slab_free_hook mm/slub.c:2252 [inline]
>  slab_free mm/slub.c:4473 [inline]
>  kmem_cache_free+0x145/0x350 mm/slub.c:4548
>  rcu_do_batch kernel/rcu/tree.c:2569 [inline]
>  rcu_core+0xafd/0x1830 kernel/rcu/tree.c:2843
>  handle_softirqs+0x2c4/0x970 kernel/softirq.c:554
>  __do_softirq kernel/softirq.c:588 [inline]
>  invoke_softirq kernel/softirq.c:428 [inline]
>  __irq_exit_rcu+0xf4/0x1c0 kernel/softirq.c:637
>  irq_exit_rcu+0x9/0x30 kernel/softirq.c:649
>  instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1043 [inline]
>  sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1043
>  asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702
> 
> Last potentially related work creation:
>  kasan_save_stack+0x3f/0x60 mm/kasan/common.c:47
>  __kasan_record_aux_stack+0xac/0xc0 mm/kasan/generic.c:541
>  __call_rcu_common kernel/rcu/tree.c:3106 [inline]
>  call_rcu+0x167/0xa70 kernel/rcu/tree.c:3210
>  remove_vma mm/mmap.c:189 [inline]
>  remove_mt mm/mmap.c:2415 [inline]
>  do_vmi_align_munmap+0x155c/0x18c0 mm/mmap.c:2758
>  do_vmi_munmap+0x261/0x2f0 mm/mmap.c:2830
>  mmap_region+0x72f/0x2090 mm/mmap.c:2881
>  do_mmap+0x8f9/0x1010 mm/mmap.c:1468
>  vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
>  ksys_mmap_pgoff+0x544/0x720 mm/mmap.c:1514
>  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
>  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
>  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> 
> The buggy address belongs to the object at ffff88802c4719b0
>  which belongs to the cache vm_area_struct of size 184
> The buggy address is located 32 bytes inside of
>  freed 184-byte region [ffff88802c4719b0, ffff88802c471a68)
> 
> The buggy address belongs to the physical page:
> page: refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x2c471
> memcg:ffff888024312e01
> flags: 0xfff00000000000(node=0|zone=1|lastcpupid=0x7ff)
> page_type: 0xfdffffff(slab)
> raw: 00fff00000000000 ffff888015eefb40 ffffea000091a480 0000000000000008
> raw: 0000000000000000 0000000080100010 00000001fdffffff ffff888024312e01
> page dumped because: kasan: bad access detected
> page_owner tracks the page as allocated
> page last allocated via order 0, migratetype Unmovable, gfp_mask 0x52cc0(GFP_KERNEL|__GFP_NOWARN|__GFP_NORETRY|__GFP_COMP), pid 4769, tgid 4769 (S41dhcpcd), ts 42147845478, free_ts 42112307396
>  set_page_owner include/linux/page_owner.h:32 [inline]
>  post_alloc_hook+0x1f3/0x230 mm/page_alloc.c:1493
>  prep_new_page mm/page_alloc.c:1501 [inline]
>  get_page_from_freelist+0x2ccb/0x2d80 mm/page_alloc.c:3474
>  __alloc_pages_noprof+0x256/0x6c0 mm/page_alloc.c:4732
>  __alloc_pages_node_noprof include/linux/gfp.h:269 [inline]
>  alloc_pages_node_noprof include/linux/gfp.h:296 [inline]
>  alloc_slab_page+0x5f/0x120 mm/slub.c:2321
>  allocate_slab+0x5a/0x2f0 mm/slub.c:2484
>  new_slab mm/slub.c:2537 [inline]
>  ___slab_alloc+0xcd1/0x14b0 mm/slub.c:3723
>  __slab_alloc+0x58/0xa0 mm/slub.c:3813
>  __slab_alloc_node mm/slub.c:3866 [inline]
>  slab_alloc_node mm/slub.c:4025 [inline]
>  kmem_cache_alloc_noprof+0x1c1/0x2a0 mm/slub.c:4044
>  vm_area_dup+0x27/0x290 kernel/fork.c:487
>  dup_mmap kernel/fork.c:696 [inline]
>  dup_mm kernel/fork.c:1674 [inline]
>  copy_mm+0xc7b/0x1f30 kernel/fork.c:1723
>  copy_process+0x186b/0x3d90 kernel/fork.c:2376
>  kernel_clone+0x226/0x8f0 kernel/fork.c:2788
>  __do_sys_clone kernel/fork.c:2931 [inline]
>  __se_sys_clone kernel/fork.c:2915 [inline]
>  __x64_sys_clone+0x258/0x2a0 kernel/fork.c:2915
>  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
>  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
>  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> page last free pid 4768 tgid 4768 stack trace:
>  reset_page_owner include/linux/page_owner.h:25 [inline]
>  free_pages_prepare mm/page_alloc.c:1094 [inline]
>  free_unref_folios+0x103a/0x1b00 mm/page_alloc.c:2692
>  folios_put_refs+0x76e/0x860 mm/swap.c:1039
>  free_pages_and_swap_cache+0x5c8/0x690 mm/swap_state.c:335
>  __tlb_batch_free_encoded_pages mm/mmu_gather.c:136 [inline]
>  tlb_batch_pages_flush mm/mmu_gather.c:149 [inline]
>  tlb_flush_mmu_free mm/mmu_gather.c:366 [inline]
>  tlb_flush_mmu+0x3a3/0x680 mm/mmu_gather.c:373
>  tlb_finish_mmu+0xd4/0x200 mm/mmu_gather.c:465
>  exit_mmap+0x44f/0xc80 mm/mmap.c:3425
>  __mmput+0x115/0x390 kernel/fork.c:1347
>  exit_mm+0x220/0x310 kernel/exit.c:571
>  do_exit+0x9b2/0x27f0 kernel/exit.c:869
>  do_group_exit+0x207/0x2c0 kernel/exit.c:1031
>  __do_sys_exit_group kernel/exit.c:1042 [inline]
>  __se_sys_exit_group kernel/exit.c:1040 [inline]
>  __x64_sys_exit_group+0x3f/0x40 kernel/exit.c:1040
>  x64_sys_call+0x26c3/0x26d0 arch/x86/include/generated/asm/syscalls_64.h:232
>  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
>  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
>  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> 
> Memory state around the buggy address:
>  ffff88802c471880: fc fc fc fc fc fc fc 00 00 00 00 00 00 00 00 00
>  ffff88802c471900: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 fc fc
>>ffff88802c471980: fc fc fc fc fc fc fa fb fb fb fb fb fb fb fb fb
>                                                  ^
>  ffff88802c471a00: fb fb fb fb fb fb fb fb fb fb fb fb fb fc fc fc
>  ffff88802c471a80: fc fc fc fc fc 00 00 00 00 00 00 00 00 00 00 00
> ==================================================================
> 
> 
> ---
> This report is generated by a bot. It may contain errors.
> See https://goo.gl/tpsmEJ for more information about syzbot.
> syzbot engineers can be reached at syzkaller@googlegroups.com.
> 
> syzbot will keep track of this issue. See:
> https://goo.gl/tpsmEJ#status for how to communicate with syzbot.
> 
> If the report is already addressed, let syzbot know by replying with:
> #syz fix: exact-commit-title
> 
> If you want syzbot to run the reproducer, reply with:
> #syz test: git://repo/address.git branch-or-commit-hash
> If you attach or paste a git patch, syzbot will apply it before testing.
> 
> If you want to overwrite report's subsystems, reply with:
> #syz set subsystems: new-subsystem
> (See the list of subsystem names on the web dashboard)
> 
> If the report is a duplicate of another one, reply with:
> #syz dup: exact-subject-of-another-report
> 
> If you want to undo deduplication, reply with:
> #syz undup
> 



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 10:59 ` Vlastimil Babka (SUSE)
@ 2024-07-18 15:43   ` Liam R. Howlett
  2024-07-18 16:20     ` Suren Baghdasaryan
  0 siblings, 1 reply; 10+ messages in thread
From: Liam R. Howlett @ 2024-07-18 15:43 UTC (permalink / raw)
  To: Vlastimil Babka (SUSE)
  Cc: syzbot, akpm, davem, herbert, linux-crypto, linux-kernel,
	linux-mm, syzkaller-bugs, Suren Baghdasaryan, Lorenzo Stoakes

* Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> On 7/16/24 10:29 AM, syzbot wrote:
> > Hello,
> 
> dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> or maybe it's due to some changes to gup?

Yes, that crypto part is very odd.

> 
> > syzbot found the following issue on:
> > 
> > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > git tree:       linux-next
> > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > 
> > Downloadable assets:
> > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > 
> > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > 
> > ==================================================================
> > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > 
> > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > Call Trace:
> >  <TASK>
> >  __dump_stack lib/dump_stack.c:94 [inline]
> >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> >  print_address_description mm/kasan/report.c:377 [inline]
> >  print_report+0x169/0x550 mm/kasan/report.c:488
> >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842

/*
 * By the time we get here, we already hold the mm semaphore
 *
 * The mmap_lock may have been released depending on flags and our
 * return value.  See filemap_fault() and __folio_lock_or_retry().
 */

Somehow we are here without an RCU or mmap_lock held?

> >  faultin_page mm/gup.c:1194 [inline]

/*
 * mmap_lock must be held on entry.  If @flags has FOLL_UNLOCKABLE but not
 * FOLL_NOWAIT, the mmap_lock may be released.  If it is, *@locked will be set
 * to 0 and -EBUSY returned.
 */

We should probably have a lockdep check there then?

> >  __get_user_pages+0x6ec/0x16a0 mm/gup.c:1493
> >  populate_vma_page_range+0x264/0x330 mm/gup.c:1932
> >  __mm_populate+0x27a/0x460 mm/gup.c:2035

/*
 * __mm_populate - populate and/or mlock pages within a range of address space.
 *
 * This is used to implement mlock() and the MAP_POPULATE / MAP_LOCKED mmap
 * flags. VMAs must be already marked with the desired vm_flags, and
 * mmap_lock must not be held.
 */

What ensures the vma doesn't go away then? - I guess nothing, because it
went away.

I don't get it.. __mm_populate() must NOT have the mmap_lock, but
faultin_page() must hold the mmap_lock...

> >  mm_populate include/linux/mm.h:3429 [inline]
> >  vm_mmap_pgoff+0x2c3/0x3d0 mm/util.c:593
> >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > RIP: 0033:0x7f093ce17fe9
> > Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 1d 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
> > RSP: 002b:00007f093cd9e158 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
> > RAX: ffffffffffffffda RBX: 00007f093ce9f4b8 RCX: 00007f093ce17fe9
> > RDX: 0000000000000002 RSI: 0000000000b36000 RDI: 0000000020000000
> > RBP: 00007f093ce9f4b0 R08: 00000000ffffffff R09: 0000000000000000
> > R10: 0000000000008031 R11: 0000000000000246 R12: 00007f093ce9f4bc
> > R13: 000000000000006e R14: 00007ffe8008cc30 R15: 00007ffe8008cd18
> >  </TASK>
> > 
> > Allocated by task 5235:
...

> > 
> > Freed by task 5237:
> >  kasan_save_stack mm/kasan/common.c:47 [inline]
> >  kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
> >  kasan_save_free_info+0x40/0x50 mm/kasan/generic.c:579
> >  poison_slab_object+0xe0/0x150 mm/kasan/common.c:240
> >  __kasan_slab_free+0x37/0x60 mm/kasan/common.c:256
> >  kasan_slab_free include/linux/kasan.h:184 [inline]
> >  slab_free_hook mm/slub.c:2252 [inline]
> >  slab_free mm/slub.c:4473 [inline]
> >  kmem_cache_free+0x145/0x350 mm/slub.c:4548
> >  rcu_do_batch kernel/rcu/tree.c:2569 [inline]
> >  rcu_core+0xafd/0x1830 kernel/rcu/tree.c:2843

This seems right.  RCU freeing of a vma here, so that's okay.

> >  handle_softirqs+0x2c4/0x970 kernel/softirq.c:554
> >  __do_softirq kernel/softirq.c:588 [inline]
> >  invoke_softirq kernel/softirq.c:428 [inline]
> >  __irq_exit_rcu+0xf4/0x1c0 kernel/softirq.c:637
> >  irq_exit_rcu+0x9/0x30 kernel/softirq.c:649
> >  instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1043 [inline]
> >  sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1043
> >  asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702
> > 
> > Last potentially related work creation:

Also fine.

> >  kasan_save_stack+0x3f/0x60 mm/kasan/common.c:47
> >  __kasan_record_aux_stack+0xac/0xc0 mm/kasan/generic.c:541
> >  __call_rcu_common kernel/rcu/tree.c:3106 [inline]
> >  call_rcu+0x167/0xa70 kernel/rcu/tree.c:3210
> >  remove_vma mm/mmap.c:189 [inline]
> >  remove_mt mm/mmap.c:2415 [inline]
> >  do_vmi_align_munmap+0x155c/0x18c0 mm/mmap.c:2758
> >  do_vmi_munmap+0x261/0x2f0 mm/mmap.c:2830
> >  mmap_region+0x72f/0x2090 mm/mmap.c:2881
> >  do_mmap+0x8f9/0x1010 mm/mmap.c:1468
> >  vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
> >  ksys_mmap_pgoff+0x544/0x720 mm/mmap.c:1514
> >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > 
> > The buggy address belongs to the object at ffff88802c4719b0
> >  which belongs to the cache vm_area_struct of size 184

...



^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 15:43   ` Liam R. Howlett
@ 2024-07-18 16:20     ` Suren Baghdasaryan
  2024-07-18 16:23       ` Suren Baghdasaryan
  0 siblings, 1 reply; 10+ messages in thread
From: Suren Baghdasaryan @ 2024-07-18 16:20 UTC (permalink / raw)
  To: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Suren Baghdasaryan, Lorenzo Stoakes

On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
>
> * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > On 7/16/24 10:29 AM, syzbot wrote:
> > > Hello,
> >
> > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > or maybe it's due to some changes to gup?
>
> Yes, that crypto part is very odd.
>
> >
> > > syzbot found the following issue on:
> > >
> > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > git tree:       linux-next
> > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > >
> > > Downloadable assets:
> > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > >
> > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > >
> > > ==================================================================
> > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > >
> > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > Call Trace:
> > >  <TASK>
> > >  __dump_stack lib/dump_stack.c:94 [inline]
> > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > >  print_address_description mm/kasan/report.c:377 [inline]
> > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
>
> /*
>  * By the time we get here, we already hold the mm semaphore
>  *
>  * The mmap_lock may have been released depending on flags and our
>  * return value.  See filemap_fault() and __folio_lock_or_retry().
>  */
>
> Somehow we are here without an RCU or mmap_lock held?

I'm guessing we did enter handle_mm_fault() with mmap_lock held but
__handle_mm_fault() dropped it before returning, see the comment for
__handle_mm_fault():

/*
 * On entry, we hold either the VMA lock or the mmap_lock
 * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
 * the result, the mmap_lock is not held on exit.  See filemap_fault()
 * and __folio_lock_or_retry().
 */

So after that there is nothing that guarantees VMA is not destroyed
from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
Hillf's suggestion should fix this issue but we need to figure out how
to make this path more robust. Currently it's very easy to make a
similar mistake. Maybe a WARNING comment after __handle_mm_fault()
that VMA might be unstable after that function and should not be used?

>
> > >  faultin_page mm/gup.c:1194 [inline]
>
> /*
>  * mmap_lock must be held on entry.  If @flags has FOLL_UNLOCKABLE but not
>  * FOLL_NOWAIT, the mmap_lock may be released.  If it is, *@locked will be set
>  * to 0 and -EBUSY returned.
>  */
>
> We should probably have a lockdep check there then?
>
> > >  __get_user_pages+0x6ec/0x16a0 mm/gup.c:1493
> > >  populate_vma_page_range+0x264/0x330 mm/gup.c:1932
> > >  __mm_populate+0x27a/0x460 mm/gup.c:2035
>
> /*
>  * __mm_populate - populate and/or mlock pages within a range of address space.
>  *
>  * This is used to implement mlock() and the MAP_POPULATE / MAP_LOCKED mmap
>  * flags. VMAs must be already marked with the desired vm_flags, and
>  * mmap_lock must not be held.
>  */
>
> What ensures the vma doesn't go away then? - I guess nothing, because it
> went away.
>
> I don't get it.. __mm_populate() must NOT have the mmap_lock, but
> faultin_page() must hold the mmap_lock...
>
> > >  mm_populate include/linux/mm.h:3429 [inline]
> > >  vm_mmap_pgoff+0x2c3/0x3d0 mm/util.c:593
> > >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> > >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > RIP: 0033:0x7f093ce17fe9
> > > Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 1d 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
> > > RSP: 002b:00007f093cd9e158 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
> > > RAX: ffffffffffffffda RBX: 00007f093ce9f4b8 RCX: 00007f093ce17fe9
> > > RDX: 0000000000000002 RSI: 0000000000b36000 RDI: 0000000020000000
> > > RBP: 00007f093ce9f4b0 R08: 00000000ffffffff R09: 0000000000000000
> > > R10: 0000000000008031 R11: 0000000000000246 R12: 00007f093ce9f4bc
> > > R13: 000000000000006e R14: 00007ffe8008cc30 R15: 00007ffe8008cd18
> > >  </TASK>
> > >
> > > Allocated by task 5235:
> ...
>
> > >
> > > Freed by task 5237:
> > >  kasan_save_stack mm/kasan/common.c:47 [inline]
> > >  kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
> > >  kasan_save_free_info+0x40/0x50 mm/kasan/generic.c:579
> > >  poison_slab_object+0xe0/0x150 mm/kasan/common.c:240
> > >  __kasan_slab_free+0x37/0x60 mm/kasan/common.c:256
> > >  kasan_slab_free include/linux/kasan.h:184 [inline]
> > >  slab_free_hook mm/slub.c:2252 [inline]
> > >  slab_free mm/slub.c:4473 [inline]
> > >  kmem_cache_free+0x145/0x350 mm/slub.c:4548
> > >  rcu_do_batch kernel/rcu/tree.c:2569 [inline]
> > >  rcu_core+0xafd/0x1830 kernel/rcu/tree.c:2843
>
> This seems right.  RCU freeing of a vma here, so that's okay.
>
> > >  handle_softirqs+0x2c4/0x970 kernel/softirq.c:554
> > >  __do_softirq kernel/softirq.c:588 [inline]
> > >  invoke_softirq kernel/softirq.c:428 [inline]
> > >  __irq_exit_rcu+0xf4/0x1c0 kernel/softirq.c:637
> > >  irq_exit_rcu+0x9/0x30 kernel/softirq.c:649
> > >  instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1043 [inline]
> > >  sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1043
> > >  asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702
> > >
> > > Last potentially related work creation:
>
> Also fine.
>
> > >  kasan_save_stack+0x3f/0x60 mm/kasan/common.c:47
> > >  __kasan_record_aux_stack+0xac/0xc0 mm/kasan/generic.c:541
> > >  __call_rcu_common kernel/rcu/tree.c:3106 [inline]
> > >  call_rcu+0x167/0xa70 kernel/rcu/tree.c:3210
> > >  remove_vma mm/mmap.c:189 [inline]
> > >  remove_mt mm/mmap.c:2415 [inline]
> > >  do_vmi_align_munmap+0x155c/0x18c0 mm/mmap.c:2758
> > >  do_vmi_munmap+0x261/0x2f0 mm/mmap.c:2830
> > >  mmap_region+0x72f/0x2090 mm/mmap.c:2881
> > >  do_mmap+0x8f9/0x1010 mm/mmap.c:1468
> > >  vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
> > >  ksys_mmap_pgoff+0x544/0x720 mm/mmap.c:1514
> > >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> > >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > >
> > > The buggy address belongs to the object at ffff88802c4719b0
> > >  which belongs to the cache vm_area_struct of size 184
>
> ...
>


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:20     ` Suren Baghdasaryan
@ 2024-07-18 16:23       ` Suren Baghdasaryan
  2024-07-18 16:36         ` Jason A. Donenfeld
  0 siblings, 1 reply; 10+ messages in thread
From: Suren Baghdasaryan @ 2024-07-18 16:23 UTC (permalink / raw)
  To: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Suren Baghdasaryan, Lorenzo Stoakes
  Cc: Jason

On Thu, Jul 18, 2024 at 4:20 PM Suren Baghdasaryan <surenb@google.com> wrote:
>
> On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
> >
> > * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > > On 7/16/24 10:29 AM, syzbot wrote:
> > > > Hello,
> > >
> > > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > > or maybe it's due to some changes to gup?
> >
> > Yes, that crypto part is very odd.
> >
> > >
> > > > syzbot found the following issue on:
> > > >
> > > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > > git tree:       linux-next
> > > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > > >
> > > > Downloadable assets:
> > > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > > >
> > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > > >
> > > > ==================================================================
> > > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > > >
> > > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > > Call Trace:
> > > >  <TASK>
> > > >  __dump_stack lib/dump_stack.c:94 [inline]
> > > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > > >  print_address_description mm/kasan/report.c:377 [inline]
> > > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> >
> > /*
> >  * By the time we get here, we already hold the mm semaphore
> >  *
> >  * The mmap_lock may have been released depending on flags and our
> >  * return value.  See filemap_fault() and __folio_lock_or_retry().
> >  */
> >
> > Somehow we are here without an RCU or mmap_lock held?
>
> I'm guessing we did enter handle_mm_fault() with mmap_lock held but
> __handle_mm_fault() dropped it before returning, see the comment for
> __handle_mm_fault():
>
> /*
>  * On entry, we hold either the VMA lock or the mmap_lock
>  * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
>  * the result, the mmap_lock is not held on exit.  See filemap_fault()
>  * and __folio_lock_or_retry().
>  */
>
> So after that there is nothing that guarantees VMA is not destroyed
> from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
> Hillf's suggestion should fix this issue but we need to figure out how
> to make this path more robust. Currently it's very easy to make a
> similar mistake. Maybe a WARNING comment after __handle_mm_fault()
> that VMA might be unstable after that function and should not be used?

CC'ing Jason.

>
> >
> > > >  faultin_page mm/gup.c:1194 [inline]
> >
> > /*
> >  * mmap_lock must be held on entry.  If @flags has FOLL_UNLOCKABLE but not
> >  * FOLL_NOWAIT, the mmap_lock may be released.  If it is, *@locked will be set
> >  * to 0 and -EBUSY returned.
> >  */
> >
> > We should probably have a lockdep check there then?
> >
> > > >  __get_user_pages+0x6ec/0x16a0 mm/gup.c:1493
> > > >  populate_vma_page_range+0x264/0x330 mm/gup.c:1932
> > > >  __mm_populate+0x27a/0x460 mm/gup.c:2035
> >
> > /*
> >  * __mm_populate - populate and/or mlock pages within a range of address space.
> >  *
> >  * This is used to implement mlock() and the MAP_POPULATE / MAP_LOCKED mmap
> >  * flags. VMAs must be already marked with the desired vm_flags, and
> >  * mmap_lock must not be held.
> >  */
> >
> > What ensures the vma doesn't go away then? - I guess nothing, because it
> > went away.
> >
> > I don't get it.. __mm_populate() must NOT have the mmap_lock, but
> > faultin_page() must hold the mmap_lock...
> >
> > > >  mm_populate include/linux/mm.h:3429 [inline]
> > > >  vm_mmap_pgoff+0x2c3/0x3d0 mm/util.c:593
> > > >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> > > >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> > > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > > RIP: 0033:0x7f093ce17fe9
> > > > Code: 28 00 00 00 75 05 48 83 c4 28 c3 e8 e1 1d 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 c7 c1 b0 ff ff ff f7 d8 64 89 01 48
> > > > RSP: 002b:00007f093cd9e158 EFLAGS: 00000246 ORIG_RAX: 0000000000000009
> > > > RAX: ffffffffffffffda RBX: 00007f093ce9f4b8 RCX: 00007f093ce17fe9
> > > > RDX: 0000000000000002 RSI: 0000000000b36000 RDI: 0000000020000000
> > > > RBP: 00007f093ce9f4b0 R08: 00000000ffffffff R09: 0000000000000000
> > > > R10: 0000000000008031 R11: 0000000000000246 R12: 00007f093ce9f4bc
> > > > R13: 000000000000006e R14: 00007ffe8008cc30 R15: 00007ffe8008cd18
> > > >  </TASK>
> > > >
> > > > Allocated by task 5235:
> > ...
> >
> > > >
> > > > Freed by task 5237:
> > > >  kasan_save_stack mm/kasan/common.c:47 [inline]
> > > >  kasan_save_track+0x3f/0x80 mm/kasan/common.c:68
> > > >  kasan_save_free_info+0x40/0x50 mm/kasan/generic.c:579
> > > >  poison_slab_object+0xe0/0x150 mm/kasan/common.c:240
> > > >  __kasan_slab_free+0x37/0x60 mm/kasan/common.c:256
> > > >  kasan_slab_free include/linux/kasan.h:184 [inline]
> > > >  slab_free_hook mm/slub.c:2252 [inline]
> > > >  slab_free mm/slub.c:4473 [inline]
> > > >  kmem_cache_free+0x145/0x350 mm/slub.c:4548
> > > >  rcu_do_batch kernel/rcu/tree.c:2569 [inline]
> > > >  rcu_core+0xafd/0x1830 kernel/rcu/tree.c:2843
> >
> > This seems right.  RCU freeing of a vma here, so that's okay.
> >
> > > >  handle_softirqs+0x2c4/0x970 kernel/softirq.c:554
> > > >  __do_softirq kernel/softirq.c:588 [inline]
> > > >  invoke_softirq kernel/softirq.c:428 [inline]
> > > >  __irq_exit_rcu+0xf4/0x1c0 kernel/softirq.c:637
> > > >  irq_exit_rcu+0x9/0x30 kernel/softirq.c:649
> > > >  instr_sysvec_apic_timer_interrupt arch/x86/kernel/apic/apic.c:1043 [inline]
> > > >  sysvec_apic_timer_interrupt+0xa6/0xc0 arch/x86/kernel/apic/apic.c:1043
> > > >  asm_sysvec_apic_timer_interrupt+0x1a/0x20 arch/x86/include/asm/idtentry.h:702
> > > >
> > > > Last potentially related work creation:
> >
> > Also fine.
> >
> > > >  kasan_save_stack+0x3f/0x60 mm/kasan/common.c:47
> > > >  __kasan_record_aux_stack+0xac/0xc0 mm/kasan/generic.c:541
> > > >  __call_rcu_common kernel/rcu/tree.c:3106 [inline]
> > > >  call_rcu+0x167/0xa70 kernel/rcu/tree.c:3210
> > > >  remove_vma mm/mmap.c:189 [inline]
> > > >  remove_mt mm/mmap.c:2415 [inline]
> > > >  do_vmi_align_munmap+0x155c/0x18c0 mm/mmap.c:2758
> > > >  do_vmi_munmap+0x261/0x2f0 mm/mmap.c:2830
> > > >  mmap_region+0x72f/0x2090 mm/mmap.c:2881
> > > >  do_mmap+0x8f9/0x1010 mm/mmap.c:1468
> > > >  vm_mmap_pgoff+0x1dd/0x3d0 mm/util.c:588
> > > >  ksys_mmap_pgoff+0x544/0x720 mm/mmap.c:1514
> > > >  do_syscall_x64 arch/x86/entry/common.c:52 [inline]
> > > >  do_syscall_64+0xf3/0x230 arch/x86/entry/common.c:83
> > > >  entry_SYSCALL_64_after_hwframe+0x77/0x7f
> > > >
> > > > The buggy address belongs to the object at ffff88802c4719b0
> > > >  which belongs to the cache vm_area_struct of size 184
> >
> > ...
> >


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:23       ` Suren Baghdasaryan
@ 2024-07-18 16:36         ` Jason A. Donenfeld
  2024-07-18 16:42           ` Suren Baghdasaryan
  0 siblings, 1 reply; 10+ messages in thread
From: Jason A. Donenfeld @ 2024-07-18 16:36 UTC (permalink / raw)
  To: Suren Baghdasaryan
  Cc: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Lorenzo Stoakes

On Thu, Jul 18, 2024 at 04:23:47PM +0000, Suren Baghdasaryan wrote:
> On Thu, Jul 18, 2024 at 4:20 PM Suren Baghdasaryan <surenb@google.com> wrote:
> >
> > On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
> > >
> > > * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > > > On 7/16/24 10:29 AM, syzbot wrote:
> > > > > Hello,
> > > >
> > > > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > > > or maybe it's due to some changes to gup?
> > >
> > > Yes, that crypto part is very odd.
> > >
> > > >
> > > > > syzbot found the following issue on:
> > > > >
> > > > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > > > git tree:       linux-next
> > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > > > >
> > > > > Downloadable assets:
> > > > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > > > >
> > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > > > >
> > > > > ==================================================================
> > > > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > > > >
> > > > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > > > Call Trace:
> > > > >  <TASK>
> > > > >  __dump_stack lib/dump_stack.c:94 [inline]
> > > > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > > > >  print_address_description mm/kasan/report.c:377 [inline]
> > > > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > > > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > > > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > >
> > > /*
> > >  * By the time we get here, we already hold the mm semaphore
> > >  *
> > >  * The mmap_lock may have been released depending on flags and our
> > >  * return value.  See filemap_fault() and __folio_lock_or_retry().
> > >  */
> > >
> > > Somehow we are here without an RCU or mmap_lock held?
> >
> > I'm guessing we did enter handle_mm_fault() with mmap_lock held but
> > __handle_mm_fault() dropped it before returning, see the comment for
> > __handle_mm_fault():
> >
> > /*
> >  * On entry, we hold either the VMA lock or the mmap_lock
> >  * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
> >  * the result, the mmap_lock is not held on exit.  See filemap_fault()
> >  * and __folio_lock_or_retry().
> >  */
> >
> > So after that there is nothing that guarantees VMA is not destroyed
> > from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
> > Hillf's suggestion should fix this issue but we need to figure out how
> > to make this path more robust. Currently it's very easy to make a
> > similar mistake. Maybe a WARNING comment after __handle_mm_fault()
> > that VMA might be unstable after that function and should not be used?
> 
> CC'ing Jason.

Thanks for bringing this to my attention. I'll incorporate Hillf's patch
and also add a comment as you suggested. Something like the below?

diff --git a/mm/memory.c b/mm/memory.c
index 18fe893ce96d..f596a8d508ef 100644
--- a/mm/memory.c
+++ b/mm/memory.c
@@ -5660,6 +5660,7 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
 	/* If the fault handler drops the mmap_lock, vma may be freed */
 	struct mm_struct *mm = vma->vm_mm;
 	vm_fault_t ret;
+	bool is_droppable;

 	__set_current_state(TASK_RUNNING);

@@ -5674,6 +5675,8 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
 		goto out;
 	}

+	is_droppable = !!(vma->vm_flags & VM_DROPPABLE);
+
 	/*
 	 * Enable the memcg OOM handling for faults triggered in user
 	 * space.  Kernel faults are handled more gracefully.
@@ -5688,10 +5691,15 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
 	else
 		ret = __handle_mm_fault(vma, address, flags);

+	/*
+	 * It is no longer safe to dereference vma-> after this point, as
+	 * __handle_mm_fault may have already destroyed it.
+	 */
+
 	lru_gen_exit_fault();

-	/* If the mapping is droppable, then errors due to OOM aren't fatal. */
-	if (vma->vm_flags & VM_DROPPABLE)
+	/* If the mapping is is_droppable, then errors due to OOM aren't fatal. */
+	if (is_droppable)
 		ret &= ~VM_FAULT_OOM;

 	if (flags & FAULT_FLAG_USER) {



^ permalink raw reply related	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:36         ` Jason A. Donenfeld
@ 2024-07-18 16:42           ` Suren Baghdasaryan
  2024-07-18 16:44             ` Jason A. Donenfeld
  0 siblings, 1 reply; 10+ messages in thread
From: Suren Baghdasaryan @ 2024-07-18 16:42 UTC (permalink / raw)
  To: Jason A. Donenfeld
  Cc: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Lorenzo Stoakes

On Thu, Jul 18, 2024 at 4:36 PM Jason A. Donenfeld <Jason@zx2c4.com> wrote:
>
> On Thu, Jul 18, 2024 at 04:23:47PM +0000, Suren Baghdasaryan wrote:
> > On Thu, Jul 18, 2024 at 4:20 PM Suren Baghdasaryan <surenb@google.com> wrote:
> > >
> > > On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
> > > >
> > > > * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > > > > On 7/16/24 10:29 AM, syzbot wrote:
> > > > > > Hello,
> > > > >
> > > > > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > > > > or maybe it's due to some changes to gup?
> > > >
> > > > Yes, that crypto part is very odd.
> > > >
> > > > >
> > > > > > syzbot found the following issue on:
> > > > > >
> > > > > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > > > > git tree:       linux-next
> > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > > > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > > > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > > > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > > > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > > > > >
> > > > > > Downloadable assets:
> > > > > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > > > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > > > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > > > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > > > > >
> > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > > > > >
> > > > > > ==================================================================
> > > > > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > > > > >
> > > > > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > > > > Call Trace:
> > > > > >  <TASK>
> > > > > >  __dump_stack lib/dump_stack.c:94 [inline]
> > > > > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > > > > >  print_address_description mm/kasan/report.c:377 [inline]
> > > > > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > > > > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > > > > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > >
> > > > /*
> > > >  * By the time we get here, we already hold the mm semaphore
> > > >  *
> > > >  * The mmap_lock may have been released depending on flags and our
> > > >  * return value.  See filemap_fault() and __folio_lock_or_retry().
> > > >  */
> > > >
> > > > Somehow we are here without an RCU or mmap_lock held?
> > >
> > > I'm guessing we did enter handle_mm_fault() with mmap_lock held but
> > > __handle_mm_fault() dropped it before returning, see the comment for
> > > __handle_mm_fault():
> > >
> > > /*
> > >  * On entry, we hold either the VMA lock or the mmap_lock
> > >  * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
> > >  * the result, the mmap_lock is not held on exit.  See filemap_fault()
> > >  * and __folio_lock_or_retry().
> > >  */
> > >
> > > So after that there is nothing that guarantees VMA is not destroyed
> > > from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
> > > Hillf's suggestion should fix this issue but we need to figure out how
> > > to make this path more robust. Currently it's very easy to make a
> > > similar mistake. Maybe a WARNING comment after __handle_mm_fault()
> > > that VMA might be unstable after that function and should not be used?
> >
> > CC'ing Jason.
>
> Thanks for bringing this to my attention. I'll incorporate Hillf's patch
> and also add a comment as you suggested. Something like the below?
>
> diff --git a/mm/memory.c b/mm/memory.c
> index 18fe893ce96d..f596a8d508ef 100644
> --- a/mm/memory.c
> +++ b/mm/memory.c
> @@ -5660,6 +5660,7 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
>         /* If the fault handler drops the mmap_lock, vma may be freed */
>         struct mm_struct *mm = vma->vm_mm;
>         vm_fault_t ret;
> +       bool is_droppable;
>
>         __set_current_state(TASK_RUNNING);
>
> @@ -5674,6 +5675,8 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
>                 goto out;
>         }
>
> +       is_droppable = !!(vma->vm_flags & VM_DROPPABLE);
> +
>         /*
>          * Enable the memcg OOM handling for faults triggered in user
>          * space.  Kernel faults are handled more gracefully.
> @@ -5688,10 +5691,15 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
>         else
>                 ret = __handle_mm_fault(vma, address, flags);
>
> +       /*
> +        * It is no longer safe to dereference vma-> after this point, as
> +        * __handle_mm_fault may have already destroyed it.

__handle_mm_fault does not really destroy the vma. It might drop
mmap_lock and another task might destroy it from under us.

> +        */
> +
>         lru_gen_exit_fault();
>
> -       /* If the mapping is droppable, then errors due to OOM aren't fatal. */
> -       if (vma->vm_flags & VM_DROPPABLE)
> +       /* If the mapping is is_droppable, then errors due to OOM aren't fatal. */
> +       if (is_droppable)
>                 ret &= ~VM_FAULT_OOM;
>
>         if (flags & FAULT_FLAG_USER) {
>


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:42           ` Suren Baghdasaryan
@ 2024-07-18 16:44             ` Jason A. Donenfeld
  2024-07-18 16:49               ` Suren Baghdasaryan
  0 siblings, 1 reply; 10+ messages in thread
From: Jason A. Donenfeld @ 2024-07-18 16:44 UTC (permalink / raw)
  To: Suren Baghdasaryan
  Cc: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Lorenzo Stoakes

On Thu, Jul 18, 2024 at 6:42 PM Suren Baghdasaryan <surenb@google.com> wrote:
>
> On Thu, Jul 18, 2024 at 4:36 PM Jason A. Donenfeld <Jason@zx2c4.com> wrote:
> >
> > On Thu, Jul 18, 2024 at 04:23:47PM +0000, Suren Baghdasaryan wrote:
> > > On Thu, Jul 18, 2024 at 4:20 PM Suren Baghdasaryan <surenb@google.com> wrote:
> > > >
> > > > On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
> > > > >
> > > > > * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > > > > > On 7/16/24 10:29 AM, syzbot wrote:
> > > > > > > Hello,
> > > > > >
> > > > > > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > > > > > or maybe it's due to some changes to gup?
> > > > >
> > > > > Yes, that crypto part is very odd.
> > > > >
> > > > > >
> > > > > > > syzbot found the following issue on:
> > > > > > >
> > > > > > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > > > > > git tree:       linux-next
> > > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > > > > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > > > > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > > > > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > > > > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > > > > > >
> > > > > > > Downloadable assets:
> > > > > > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > > > > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > > > > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > > > > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > > > > > >
> > > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > > > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > > > > > >
> > > > > > > ==================================================================
> > > > > > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > > > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > > > > > >
> > > > > > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > > > > > Call Trace:
> > > > > > >  <TASK>
> > > > > > >  __dump_stack lib/dump_stack.c:94 [inline]
> > > > > > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > > > > > >  print_address_description mm/kasan/report.c:377 [inline]
> > > > > > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > > > > > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > > > > > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > >
> > > > > /*
> > > > >  * By the time we get here, we already hold the mm semaphore
> > > > >  *
> > > > >  * The mmap_lock may have been released depending on flags and our
> > > > >  * return value.  See filemap_fault() and __folio_lock_or_retry().
> > > > >  */
> > > > >
> > > > > Somehow we are here without an RCU or mmap_lock held?
> > > >
> > > > I'm guessing we did enter handle_mm_fault() with mmap_lock held but
> > > > __handle_mm_fault() dropped it before returning, see the comment for
> > > > __handle_mm_fault():
> > > >
> > > > /*
> > > >  * On entry, we hold either the VMA lock or the mmap_lock
> > > >  * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
> > > >  * the result, the mmap_lock is not held on exit.  See filemap_fault()
> > > >  * and __folio_lock_or_retry().
> > > >  */
> > > >
> > > > So after that there is nothing that guarantees VMA is not destroyed
> > > > from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
> > > > Hillf's suggestion should fix this issue but we need to figure out how
> > > > to make this path more robust. Currently it's very easy to make a
> > > > similar mistake. Maybe a WARNING comment after __handle_mm_fault()
> > > > that VMA might be unstable after that function and should not be used?
> > >
> > > CC'ing Jason.
> >
> > Thanks for bringing this to my attention. I'll incorporate Hillf's patch
> > and also add a comment as you suggested. Something like the below?
> >
> > diff --git a/mm/memory.c b/mm/memory.c
> > index 18fe893ce96d..f596a8d508ef 100644
> > --- a/mm/memory.c
> > +++ b/mm/memory.c
> > @@ -5660,6 +5660,7 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> >         /* If the fault handler drops the mmap_lock, vma may be freed */
> >         struct mm_struct *mm = vma->vm_mm;
> >         vm_fault_t ret;
> > +       bool is_droppable;
> >
> >         __set_current_state(TASK_RUNNING);
> >
> > @@ -5674,6 +5675,8 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> >                 goto out;
> >         }
> >
> > +       is_droppable = !!(vma->vm_flags & VM_DROPPABLE);
> > +
> >         /*
> >          * Enable the memcg OOM handling for faults triggered in user
> >          * space.  Kernel faults are handled more gracefully.
> > @@ -5688,10 +5691,15 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> >         else
> >                 ret = __handle_mm_fault(vma, address, flags);
> >
> > +       /*
> > +        * It is no longer safe to dereference vma-> after this point, as
> > +        * __handle_mm_fault may have already destroyed it.
>
> __handle_mm_fault does not really destroy the vma. It might drop
> mmap_lock and another task might destroy it from under us.

Err, right. Okay, wording time:

> Warning: It is no longer safe to dereference vma-> after this point, because mmap_lock is dropped, so vma might be destroyed from underneath us.

How about that?


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:44             ` Jason A. Donenfeld
@ 2024-07-18 16:49               ` Suren Baghdasaryan
  2024-07-18 16:51                 ` Jason A. Donenfeld
  0 siblings, 1 reply; 10+ messages in thread
From: Suren Baghdasaryan @ 2024-07-18 16:49 UTC (permalink / raw)
  To: Jason A. Donenfeld
  Cc: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Lorenzo Stoakes

On Thu, Jul 18, 2024 at 9:44 AM Jason A. Donenfeld <Jason@zx2c4.com> wrote:
>
> On Thu, Jul 18, 2024 at 6:42 PM Suren Baghdasaryan <surenb@google.com> wrote:
> >
> > On Thu, Jul 18, 2024 at 4:36 PM Jason A. Donenfeld <Jason@zx2c4.com> wrote:
> > >
> > > On Thu, Jul 18, 2024 at 04:23:47PM +0000, Suren Baghdasaryan wrote:
> > > > On Thu, Jul 18, 2024 at 4:20 PM Suren Baghdasaryan <surenb@google.com> wrote:
> > > > >
> > > > > On Thu, Jul 18, 2024 at 3:43 PM Liam R. Howlett <Liam.Howlett@oracle.com> wrote:
> > > > > >
> > > > > > * Vlastimil Babka (SUSE) <vbabka@kernel.org> [240718 07:00]:
> > > > > > > On 7/16/24 10:29 AM, syzbot wrote:
> > > > > > > > Hello,
> > > > > > >
> > > > > > > dunno about the [crypto?] parts, sounds rather something for Suren or Liam
> > > > > > > or maybe it's due to some changes to gup?
> > > > > >
> > > > > > Yes, that crypto part is very odd.
> > > > > >
> > > > > > >
> > > > > > > > syzbot found the following issue on:
> > > > > > > >
> > > > > > > > HEAD commit:    3fe121b62282 Add linux-next specific files for 20240712
> > > > > > > > git tree:       linux-next
> > > > > > > > console output: https://syzkaller.appspot.com/x/log.txt?x=1097ebed980000
> > > > > > > > kernel config:  https://syzkaller.appspot.com/x/.config?x=98dd8c4bab5cdce
> > > > > > > > dashboard link: https://syzkaller.appspot.com/bug?extid=4c882a4a0697c4a25364
> > > > > > > > compiler:       Debian clang version 15.0.6, GNU ld (GNU Binutils for Debian) 2.40
> > > > > > > > syz repro:      https://syzkaller.appspot.com/x/repro.syz?x=11d611a5980000
> > > > > > > > C reproducer:   https://syzkaller.appspot.com/x/repro.c?x=13ce3259980000
> > > > > > > >
> > > > > > > > Downloadable assets:
> > > > > > > > disk image: https://storage.googleapis.com/syzbot-assets/8c6fbf69718d/disk-3fe121b6.raw.xz
> > > > > > > > vmlinux: https://storage.googleapis.com/syzbot-assets/39fc7e43dfc1/vmlinux-3fe121b6.xz
> > > > > > > > kernel image: https://storage.googleapis.com/syzbot-assets/0a78e70e4b4e/bzImage-3fe121b6.xz
> > > > > > > > mounted in repro: https://storage.googleapis.com/syzbot-assets/66cfe5a679f2/mount_0.gz
> > > > > > > >
> > > > > > > > IMPORTANT: if you fix the issue, please add the following tag to the commit:
> > > > > > > > Reported-by: syzbot+4c882a4a0697c4a25364@syzkaller.appspotmail.com
> > > > > > > >
> > > > > > > > ==================================================================
> > > > > > > > BUG: KASAN: slab-use-after-free in handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > > > > > Read of size 8 at addr ffff88802c4719d0 by task syz-executor125/5235
> > > > > > > >
> > > > > > > > CPU: 1 UID: 0 PID: 5235 Comm: syz-executor125 Not tainted 6.10.0-rc7-next-20240712-syzkaller #0
> > > > > > > > Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 06/07/2024
> > > > > > > > Call Trace:
> > > > > > > >  <TASK>
> > > > > > > >  __dump_stack lib/dump_stack.c:94 [inline]
> > > > > > > >  dump_stack_lvl+0x241/0x360 lib/dump_stack.c:120
> > > > > > > >  print_address_description mm/kasan/report.c:377 [inline]
> > > > > > > >  print_report+0x169/0x550 mm/kasan/report.c:488
> > > > > > > >  kasan_report+0x143/0x180 mm/kasan/report.c:601
> > > > > > > >  handle_mm_fault+0x14f0/0x19a0 mm/memory.c:5842
> > > > > >
> > > > > > /*
> > > > > >  * By the time we get here, we already hold the mm semaphore
> > > > > >  *
> > > > > >  * The mmap_lock may have been released depending on flags and our
> > > > > >  * return value.  See filemap_fault() and __folio_lock_or_retry().
> > > > > >  */
> > > > > >
> > > > > > Somehow we are here without an RCU or mmap_lock held?
> > > > >
> > > > > I'm guessing we did enter handle_mm_fault() with mmap_lock held but
> > > > > __handle_mm_fault() dropped it before returning, see the comment for
> > > > > __handle_mm_fault():
> > > > >
> > > > > /*
> > > > >  * On entry, we hold either the VMA lock or the mmap_lock
> > > > >  * (FAULT_FLAG_VMA_LOCK tells you which).  If VM_FAULT_RETRY is set in
> > > > >  * the result, the mmap_lock is not held on exit.  See filemap_fault()
> > > > >  * and __folio_lock_or_retry().
> > > > >  */
> > > > >
> > > > > So after that there is nothing that guarantees VMA is not destroyed
> > > > > from under us and if (vma->vm_flags & VM_DROPPABLE) check is unsafe.
> > > > > Hillf's suggestion should fix this issue but we need to figure out how
> > > > > to make this path more robust. Currently it's very easy to make a
> > > > > similar mistake. Maybe a WARNING comment after __handle_mm_fault()
> > > > > that VMA might be unstable after that function and should not be used?
> > > >
> > > > CC'ing Jason.
> > >
> > > Thanks for bringing this to my attention. I'll incorporate Hillf's patch
> > > and also add a comment as you suggested. Something like the below?
> > >
> > > diff --git a/mm/memory.c b/mm/memory.c
> > > index 18fe893ce96d..f596a8d508ef 100644
> > > --- a/mm/memory.c
> > > +++ b/mm/memory.c
> > > @@ -5660,6 +5660,7 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> > >         /* If the fault handler drops the mmap_lock, vma may be freed */
> > >         struct mm_struct *mm = vma->vm_mm;
> > >         vm_fault_t ret;
> > > +       bool is_droppable;
> > >
> > >         __set_current_state(TASK_RUNNING);
> > >
> > > @@ -5674,6 +5675,8 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> > >                 goto out;
> > >         }
> > >
> > > +       is_droppable = !!(vma->vm_flags & VM_DROPPABLE);
> > > +
> > >         /*
> > >          * Enable the memcg OOM handling for faults triggered in user
> > >          * space.  Kernel faults are handled more gracefully.
> > > @@ -5688,10 +5691,15 @@ vm_fault_t handle_mm_fault(struct vm_area_struct *vma, unsigned long address,
> > >         else
> > >                 ret = __handle_mm_fault(vma, address, flags);
> > >
> > > +       /*
> > > +        * It is no longer safe to dereference vma-> after this point, as
> > > +        * __handle_mm_fault may have already destroyed it.
> >
> > __handle_mm_fault does not really destroy the vma. It might drop
> > mmap_lock and another task might destroy it from under us.
>
> Err, right. Okay, wording time:
>
> > Warning: It is no longer safe to dereference vma-> after this point, because mmap_lock is dropped, so vma might be destroyed from underneath us.

Better but I would change "mmap_lock is dropped" to "mmap_lock might
have been dropped by __handle_mm_fault()" because mmap_lock is not
always dropped by __handle_mm_fault(). Technicality but better be
clear about it.
With that changed feel free to add:

Reviewed-by: Suren Baghdasaryan <surenb@google.com>

>
> How about that?


^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault
  2024-07-18 16:49               ` Suren Baghdasaryan
@ 2024-07-18 16:51                 ` Jason A. Donenfeld
  0 siblings, 0 replies; 10+ messages in thread
From: Jason A. Donenfeld @ 2024-07-18 16:51 UTC (permalink / raw)
  To: Suren Baghdasaryan
  Cc: Liam R. Howlett, Vlastimil Babka (SUSE), syzbot, akpm, davem,
	herbert, linux-crypto, linux-kernel, linux-mm, syzkaller-bugs,
	Lorenzo Stoakes

Hi Suren,

On Thu, Jul 18, 2024 at 6:49 PM Suren Baghdasaryan <surenb@google.com> wrote:
> > > Warning: It is no longer safe to dereference vma-> after this point, because mmap_lock is dropped, so vma might be destroyed from underneath us.
>
> Better but I would change "mmap_lock is dropped" to "mmap_lock might
> have been dropped by __handle_mm_fault()" because mmap_lock is not
> always dropped by __handle_mm_fault(). Technicality but better be
> clear about it.
> With that changed feel free to add:
>
> Reviewed-by: Suren Baghdasaryan <surenb@google.com>

Will do.

Thanks,
Jason


^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2024-07-18 16:52 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2024-07-16  8:29 [syzbot] [crypto?] KASAN: slab-use-after-free Read in handle_mm_fault syzbot
2024-07-18 10:59 ` Vlastimil Babka (SUSE)
2024-07-18 15:43   ` Liam R. Howlett
2024-07-18 16:20     ` Suren Baghdasaryan
2024-07-18 16:23       ` Suren Baghdasaryan
2024-07-18 16:36         ` Jason A. Donenfeld
2024-07-18 16:42           ` Suren Baghdasaryan
2024-07-18 16:44             ` Jason A. Donenfeld
2024-07-18 16:49               ` Suren Baghdasaryan
2024-07-18 16:51                 ` Jason A. Donenfeld

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).