Message ID | 20180511161127.15158-1-zlang@redhat.com (mailing list archive) |
---|---|
State | New, archived |
Headers | show |
On Sat, May 12, 2018 at 12:11:27AM +0800, Zorro Lang wrote: > There's a situation where the directory structure and the inobt > thinks the inode is free, but the inode on disk thinks it is still > in use. XFS should detect it and prevent the kernel from oopsing > on lookup. Isn't this testing the same thing that I recently posted "xfs: test inobt/on disk free state mismatches" for? Cheers, Dave.
On Sat, May 12, 2018 at 09:32:39AM +1000, Dave Chinner wrote: > On Sat, May 12, 2018 at 12:11:27AM +0800, Zorro Lang wrote: > > There's a situation where the directory structure and the inobt > > thinks the inode is free, but the inode on disk thinks it is still > > in use. XFS should detect it and prevent the kernel from oopsing > > on lookup. > > Isn't this testing the same thing that I recently posted "xfs: test > inobt/on disk free state mismatches" for? Hmm... I think so. This case is written to test your patch: https://marc.info/?l=linux-xfs&m=152161877728015&w=2 Sorry I sent this case about 2 month ago, but I just got free time to send V2 patch yesterday :P As you're the original author of that bug fix, I think your case will be better than mine. Please keep your case reviewing, ignore this one. Thanks, Zorro > > Cheers, > > Dave. > -- > Dave Chinner > david@fromorbit.com > -- > To unsubscribe from this list: send the line "unsubscribe fstests" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe fstests" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Sat, May 12, 2018 at 09:32:39AM +1000, Dave Chinner wrote: > On Sat, May 12, 2018 at 12:11:27AM +0800, Zorro Lang wrote: > > There's a situation where the directory structure and the inobt > > thinks the inode is free, but the inode on disk thinks it is still > > in use. XFS should detect it and prevent the kernel from oopsing > > on lookup. > > Isn't this testing the same thing that I recently posted "xfs: test > inobt/on disk free state mismatches" for? Hi Dave, Last week I replied you that we wrote test case for same bug. But I just found I can't reproduce any bugs on rhel-7.4 kernel and upstream 4.16 kernel by your case [1], is there anything I misunderstood? But the case which I wrote for this bug [3] can trigger failures [2]. Even ignore that dmesg error (which I don't know if it's related with this bug), it still can trigger an error. And this error can't be triggered after merged your patch. Thanks, Zorro [1] # ./check xfs/132 FSTYP -- xfs (non-debug) PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64 MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch xfs/132 3s Ran: xfs/132 Passed all 1 tests # ./check xfs/132 FSTYP -- xfs (non-debug) PLATFORM -- Linux/x86_64 ibm-x3550m3-05 3.10.0-862.el7.x86_64 MKFS_OPTIONS -- -f -bsize=4096 /dev/sdb2 MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/sdb2 /mnt/scratch xfs/132 2s Ran: xfs/132 Passed all 1 tests # ./check xfs/132 FSTYP -- xfs (non-debug) PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64+debug MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch xfs/132 3s ... 3s Ran: xfs/132 Passed all 1 tests [2] # ./check xfs/999 FSTYP -- xfs (non-debug) PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64+debug MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch xfs/999 4s ... - output mismatch (see /root/git/xfstests-zlang/results//xfs/999.out.bad) --- tests/xfs/999.out 2018-05-11 11:12:21.129590901 -0400 +++ /root/git/xfstests-zlang/results//xfs/999.out.bad 2018-05-16 04:04:39.958393813 -0400 @@ -1,2 +1,2 @@ QA output created by 999 -SCRATCH_MNT/dir/newfile: Structure needs cleaning +_check_dmesg: something found in dmesg (see /root/git/xfstests-zlang/results//xfs/999.dmesg) ... (Run 'diff -u tests/xfs/999.out /root/git/xfstests-zlang/results//xfs/999.out.bad' to see the entire diff) Ran: xfs/999 Failures: xfs/999 Failed 1 of 1 tests # dmesg [ 1160.076247] ------------[ cut here ]------------ [ 1160.081403] kernel BUG at lib/list_debug.c:31! [ 1160.086399] invalid opcode: 0000 [#1] SMP PTI [ 1160.091274] Modules linked in: sunrpc intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel intel_cstate igb intel_uncore ptp iTCO_wdt iTCO_vendor_support pps_core intel_rapl_perf wmi tpm_tis ipmi_ssif tpm_tis_core cdc_ether usbnet mii tpm i2c_i801 ipmi_si ipmi_devintf lpc_ich ipmi_msghandler shpchp ioatdma dca xfs libcrc32c mgag200 i2c_algo_bit drm_kms_helper ttm drm crc32c_intel megaraid_sas [ 1160.138576] CPU: 21 PID: 2746 Comm: xfs_io Not tainted 4.16.7-200.fc27.x86_64+debug #1 [ 1160.147412] Hardware name: IBM System x3650 M4 -[7915ON3]-/00J6520, BIOS -[VVE124AUS-1.30]- 11/21/2012 [ 1160.157809] RIP: 0010:__list_add_valid+0x61/0x70 [ 1160.162963] RSP: 0018:ffffb9a28709b970 EFLAGS: 00010282 [ 1160.168796] RAX: 0000000000000058 RBX: ffff8c6da4a7da30 RCX: 0000000000000000 [ 1160.176760] RDX: 0000000000000000 RSI: ffff8c6db71d6c48 RDI: ffff8c6db71d6c48 [ 1160.184723] RBP: ffff8c6db1fe3000 R08: 0000000000000001 R09: 0000000000000000 [ 1160.192687] R10: ffffb9a28709b8f0 R11: 0000000000000000 R12: ffff8c6da4a7dc10 [ 1160.200651] R13: ffff8c6da4a7dc10 R14: ffff8c6db1fe3a88 R15: ffff8c6daff70360 [ 1160.208616] FS: 00007f5911fad840(0000) GS:ffff8c6db7000000(0000) knlGS:0000000000000000 [ 1160.217646] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [ 1160.224058] CR2: 0000564ff49e3290 CR3: 0000000467b92001 CR4: 00000000000606e0 [ 1160.232023] Call Trace: [ 1160.234756] inode_sb_list_add+0x47/0x80 [ 1160.239200] xfs_setup_inode+0x28/0x160 [xfs] [ 1160.244093] xfs_ialloc+0x30d/0x520 [xfs] [ 1160.248600] xfs_dir_ialloc+0x74/0x240 [xfs] [ 1160.253370] ? __lock_is_held+0x59/0xa0 [ 1160.257680] xfs_create+0x4ed/0x7e0 [xfs] [ 1160.262188] xfs_generic_create+0x21b/0x2e0 [xfs] [ 1160.267442] ? _raw_spin_unlock+0x24/0x30 [ 1160.271920] lookup_open+0x5ad/0x750 [ 1160.275915] ? __wake_up_common_lock+0x63/0xc0 [ 1160.280876] ? find_held_lock+0x34/0xa0 [ 1160.285158] path_openat+0x31a/0xc80 [ 1160.289151] do_filp_open+0x9b/0x110 [ 1160.293145] ? __alloc_fd+0xe5/0x1f0 [ 1160.297136] ? _raw_spin_unlock+0x24/0x30 [ 1160.301615] ? do_sys_open+0x1bd/0x250 [ 1160.305797] do_sys_open+0x1bd/0x250 [ 1160.309791] do_syscall_64+0x79/0x220 [ 1160.313878] entry_SYSCALL_64_after_hwframe+0x42/0xb7 [ 1160.319517] RIP: 0033:0x7f5911b8d6de [ 1160.323495] RSP: 002b:00007ffd812f97a0 EFLAGS: 00000246 ORIG_RAX: 0000000000000101 [ 1160.331947] RAX: ffffffffffffffda RBX: 0000000000000042 RCX: 00007f5911b8d6de [ 1160.339903] RDX: 0000000000000042 RSI: 00007ffd812fb46e RDI: ffffffffffffff9c [ 1160.347865] RBP: 0000000000000020 R08: 00007ffd812f99c0 R09: 0000000000000000 [ 1160.355829] R10: 0000000000000180 R11: 0000000000000246 R12: 00007ffd812fb46e [ 1160.363793] R13: 0000000000000007 R14: 00007ffd812f9a00 R15: 0000000000000180 [ 1160.371762] Code: 0f 0b 48 89 c1 4c 89 c6 48 c7 c7 d0 ba 11 90 e8 e0 ad c4 ff 0f 0b 48 89 f2 4c 89 c1 48 89 fe 48 c7 c7 20 bb 11 90 e8 c9 ad c4 ff <0f> 0b 0f 1f 00 66 2e 0f 1f 84 00 00 00 00 00 48 8b 07 48 b9 00 [ 1160.392895] RIP: __list_add_valid+0x61/0x70 RSP: ffffb9a28709b970 [ 1160.399706] ---[ end trace df0b581bb7404c65 ]--- [ 1160.404867] BUG: sleeping function called from invalid context at ./include/linux/percpu-rwsem.h:34 [ 1160.414974] in_atomic(): 1, irqs_disabled(): 0, pid: 2746, name: xfs_io [ 1160.422365] INFO: lockdep is turned off. [ 1160.426753] CPU: 21 PID: 2746 Comm: xfs_io Tainted: G D 4.16.7-200.fc27.x86_64+debug #1 [ 1160.437035] Hardware name: IBM System x3650 M4 -[7915ON3]-/00J6520, BIOS -[VVE124AUS-1.30]- 11/21/2012 [ 1160.447421] Call Trace: [ 1160.450155] dump_stack+0x85/0xbf [ 1160.453857] ___might_sleep+0x15b/0x240 [ 1160.458140] exit_signals+0x30/0x240 [ 1160.462131] do_exit+0xb8/0xd70 [ 1160.465640] rewind_stack_do_exit+0x17/0x20 [ 1160.470324] note: xfs_io[2746] exited with preempt_count 1 [ 1160.654155] XFS (dm-3): Unmounting Filesystem [3] https://marc.info/?l=fstests&m=152605509711179&w=2 > > Cheers, > > Dave. > -- > Dave Chinner > david@fromorbit.com -- To unsubscribe from this list: send the line "unsubscribe fstests" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
On Wed, May 16, 2018 at 04:18:32PM +0800, Zorro Lang wrote: > On Sat, May 12, 2018 at 09:32:39AM +1000, Dave Chinner wrote: > > On Sat, May 12, 2018 at 12:11:27AM +0800, Zorro Lang wrote: > > > There's a situation where the directory structure and the inobt > > > thinks the inode is free, but the inode on disk thinks it is still > > > in use. XFS should detect it and prevent the kernel from oopsing > > > on lookup. > > > > Isn't this testing the same thing that I recently posted "xfs: test > > inobt/on disk free state mismatches" for? > > Hi Dave, > > Last week I replied you that we wrote test case for same bug. But I just found > I can't reproduce any bugs on rhel-7.4 kernel and upstream 4.16 kernel by > your case [1], is there anything I misunderstood? > > But the case which I wrote for this bug [3] can trigger failures [2]. Even > ignore that dmesg error (which I don't know if it's related with this bug), > it still can trigger an error. And this error can't be triggered after merged > your patch. > > Thanks, > Zorro > > > [1] > # ./check xfs/132 > FSTYP -- xfs (non-debug) > PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64 > MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev > MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch > > xfs/132 3s > Ran: xfs/132 > Passed all 1 tests What did it dump in dmesg? It may be that the filesystem shut down because of a cascading failure, but I never ran the test on non-debug builds so I've got no idea what behaviour to expect there. Whoever runs xfstests without debug being enabled? :) I may be that you need KASAN turned on to catch the failure - this version of the test corrupts dentry cache memory and KASAN was reliably catching that on unfixed kernels. Memory corruption may not be immeidately visible on non-debug kernels - it's guaranteed to kill the machine sooner or later, though. > [2] > # ./check xfs/999 > FSTYP -- xfs (non-debug) > PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64+debug > MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev > MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch > > xfs/999 4s ... - output mismatch (see /root/git/xfstests-zlang/results//xfs/999.out.bad) > --- tests/xfs/999.out 2018-05-11 11:12:21.129590901 -0400 > +++ /root/git/xfstests-zlang/results//xfs/999.out.bad 2018-05-16 04:04:39.958393813 -0400 > @@ -1,2 +1,2 @@ > QA output created by 999 > -SCRATCH_MNT/dir/newfile: Structure needs cleaning > +_check_dmesg: something found in dmesg (see /root/git/xfstests-zlang/results//xfs/999.dmesg) > ... > (Run 'diff -u tests/xfs/999.out /root/git/xfstests-zlang/results//xfs/999.out.bad' to see the entire diff) > > Ran: xfs/999 > Failures: xfs/999 > Failed 1 of 1 tests > > # dmesg > [ 1160.076247] ------------[ cut here ]------------ > [ 1160.081403] kernel BUG at lib/list_debug.c:31! > [ 1160.086399] invalid opcode: 0000 [#1] SMP PTI > [ 1160.091274] Modules linked in: sunrpc intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel intel_cstate igb intel_uncore ptp iTCO_wdt iTCO_vendor_support pps_core intel_rapl_perf wmi tpm_tis ipmi_ssif tpm_tis_core cdc_ether usbnet mii tpm i2c_i801 ipmi_si ipmi_devintf lpc_ich ipmi_msghandler shpchp ioatdma dca xfs libcrc32c mgag200 i2c_algo_bit drm_kms_helper ttm drm crc32c_intel megaraid_sas > [ 1160.138576] CPU: 21 PID: 2746 Comm: xfs_io Not tainted 4.16.7-200.fc27.x86_64+debug #1 > [ 1160.147412] Hardware name: IBM System x3650 M4 -[7915ON3]-/00J6520, BIOS -[VVE124AUS-1.30]- 11/21/2012 > [ 1160.157809] RIP: 0010:__list_add_valid+0x61/0x70 > [ 1160.162963] RSP: 0018:ffffb9a28709b970 EFLAGS: 00010282 > [ 1160.168796] RAX: 0000000000000058 RBX: ffff8c6da4a7da30 RCX: 0000000000000000 > [ 1160.176760] RDX: 0000000000000000 RSI: ffff8c6db71d6c48 RDI: ffff8c6db71d6c48 > [ 1160.184723] RBP: ffff8c6db1fe3000 R08: 0000000000000001 R09: 0000000000000000 > [ 1160.192687] R10: ffffb9a28709b8f0 R11: 0000000000000000 R12: ffff8c6da4a7dc10 > [ 1160.200651] R13: ffff8c6da4a7dc10 R14: ffff8c6db1fe3a88 R15: ffff8c6daff70360 > [ 1160.208616] FS: 00007f5911fad840(0000) GS:ffff8c6db7000000(0000) knlGS:0000000000000000 > [ 1160.217646] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > [ 1160.224058] CR2: 0000564ff49e3290 CR3: 0000000467b92001 CR4: 00000000000606e0 > [ 1160.232023] Call Trace: > [ 1160.234756] inode_sb_list_add+0x47/0x80 > [ 1160.239200] xfs_setup_inode+0x28/0x160 [xfs] > [ 1160.244093] xfs_ialloc+0x30d/0x520 [xfs] > [ 1160.248600] xfs_dir_ialloc+0x74/0x240 [xfs] So this has allocated an inode that is already cached in memory, which is a different symptom of the same problem. i.e. there are two fixes for the problem. The initial cold-cache test fixes were in commit ee457001ed6c ("xfs: catch inode allocation state mismatch corruption"). The hot cache fixes were in commit afca6c5b2595 ("xfs: validate cached inodes are free when allocated"), and the commit message says: We recently fixed a similar inode allocation issue caused by inobt record corruption problem in xfs_iget_cache_miss() in commit ee457001ed6c ("xfs: catch inode allocation state mismatch corruption"). This change adds similar checks to the cache-hit path to catch it, and turns the reproducer into a corruption shutdown situation. IOWs, the tests are exercising the same corruption, just through different code paths. So it would seem that we need both tests... Cheers, Dave.
On Fri, May 18, 2018 at 01:59:39PM +1000, Dave Chinner wrote: > On Wed, May 16, 2018 at 04:18:32PM +0800, Zorro Lang wrote: > > On Sat, May 12, 2018 at 09:32:39AM +1000, Dave Chinner wrote: > > > On Sat, May 12, 2018 at 12:11:27AM +0800, Zorro Lang wrote: > > > > There's a situation where the directory structure and the inobt > > > > thinks the inode is free, but the inode on disk thinks it is still > > > > in use. XFS should detect it and prevent the kernel from oopsing > > > > on lookup. > > > > > > Isn't this testing the same thing that I recently posted "xfs: test > > > inobt/on disk free state mismatches" for? > > > > Hi Dave, > > > > Last week I replied you that we wrote test case for same bug. But I just found > > I can't reproduce any bugs on rhel-7.4 kernel and upstream 4.16 kernel by > > your case [1], is there anything I misunderstood? > > > > But the case which I wrote for this bug [3] can trigger failures [2]. Even > > ignore that dmesg error (which I don't know if it's related with this bug), > > it still can trigger an error. And this error can't be triggered after merged > > your patch. > > > > Thanks, > > Zorro > > > > > > [1] > > # ./check xfs/132 > > FSTYP -- xfs (non-debug) > > PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64 > > MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev > > MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch > > > > xfs/132 3s > > Ran: xfs/132 > > Passed all 1 tests > > What did it dump in dmesg? It may be that the filesystem shut down > because of a cascading failure, but I never ran the test on > non-debug builds so I've got no idea what behaviour to expect there. > Whoever runs xfstests without debug being enabled? :) > > I may be that you need KASAN turned on to catch the failure - this > version of the test corrupts dentry cache memory and KASAN was > reliably catching that on unfixed kernels. Memory corruption may not > be immeidately visible on non-debug kernels - it's guaranteed to > kill the machine sooner or later, though. > > > [2] > > # ./check xfs/999 > > FSTYP -- xfs (non-debug) > > PLATFORM -- Linux/x86_64 ibm-x3650m4-10 4.16.7-200.fc27.x86_64+debug > > MKFS_OPTIONS -- -f -bsize=4096 /dev/mapper/fedora-scratchdev > > MOUNT_OPTIONS -- -o context=system_u:object_r:root_t:s0 /dev/mapper/fedora-scratchdev /mnt/scratch > > > > xfs/999 4s ... - output mismatch (see /root/git/xfstests-zlang/results//xfs/999.out.bad) > > --- tests/xfs/999.out 2018-05-11 11:12:21.129590901 -0400 > > +++ /root/git/xfstests-zlang/results//xfs/999.out.bad 2018-05-16 04:04:39.958393813 -0400 > > @@ -1,2 +1,2 @@ > > QA output created by 999 > > -SCRATCH_MNT/dir/newfile: Structure needs cleaning > > +_check_dmesg: something found in dmesg (see /root/git/xfstests-zlang/results//xfs/999.dmesg) > > ... > > (Run 'diff -u tests/xfs/999.out /root/git/xfstests-zlang/results//xfs/999.out.bad' to see the entire diff) > > > > Ran: xfs/999 > > Failures: xfs/999 > > Failed 1 of 1 tests > > > > # dmesg > > [ 1160.076247] ------------[ cut here ]------------ > > [ 1160.081403] kernel BUG at lib/list_debug.c:31! > > [ 1160.086399] invalid opcode: 0000 [#1] SMP PTI > > [ 1160.091274] Modules linked in: sunrpc intel_rapl sb_edac x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel intel_cstate igb intel_uncore ptp iTCO_wdt iTCO_vendor_support pps_core intel_rapl_perf wmi tpm_tis ipmi_ssif tpm_tis_core cdc_ether usbnet mii tpm i2c_i801 ipmi_si ipmi_devintf lpc_ich ipmi_msghandler shpchp ioatdma dca xfs libcrc32c mgag200 i2c_algo_bit drm_kms_helper ttm drm crc32c_intel megaraid_sas > > [ 1160.138576] CPU: 21 PID: 2746 Comm: xfs_io Not tainted 4.16.7-200.fc27.x86_64+debug #1 > > [ 1160.147412] Hardware name: IBM System x3650 M4 -[7915ON3]-/00J6520, BIOS -[VVE124AUS-1.30]- 11/21/2012 > > [ 1160.157809] RIP: 0010:__list_add_valid+0x61/0x70 > > [ 1160.162963] RSP: 0018:ffffb9a28709b970 EFLAGS: 00010282 > > [ 1160.168796] RAX: 0000000000000058 RBX: ffff8c6da4a7da30 RCX: 0000000000000000 > > [ 1160.176760] RDX: 0000000000000000 RSI: ffff8c6db71d6c48 RDI: ffff8c6db71d6c48 > > [ 1160.184723] RBP: ffff8c6db1fe3000 R08: 0000000000000001 R09: 0000000000000000 > > [ 1160.192687] R10: ffffb9a28709b8f0 R11: 0000000000000000 R12: ffff8c6da4a7dc10 > > [ 1160.200651] R13: ffff8c6da4a7dc10 R14: ffff8c6db1fe3a88 R15: ffff8c6daff70360 > > [ 1160.208616] FS: 00007f5911fad840(0000) GS:ffff8c6db7000000(0000) knlGS:0000000000000000 > > [ 1160.217646] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > [ 1160.224058] CR2: 0000564ff49e3290 CR3: 0000000467b92001 CR4: 00000000000606e0 > > [ 1160.232023] Call Trace: > > [ 1160.234756] inode_sb_list_add+0x47/0x80 > > [ 1160.239200] xfs_setup_inode+0x28/0x160 [xfs] > > [ 1160.244093] xfs_ialloc+0x30d/0x520 [xfs] > > [ 1160.248600] xfs_dir_ialloc+0x74/0x240 [xfs] > > So this has allocated an inode that is already cached in memory, > which is a different symptom of the same problem. > > i.e. there are two fixes for the problem. The initial cold-cache > test fixes were in commit ee457001ed6c ("xfs: catch inode allocation > state mismatch corruption"). The hot cache fixes were in > commit afca6c5b2595 ("xfs: validate cached inodes > are free when allocated"), and the commit message says: > > We recently fixed a similar inode allocation issue caused by > inobt record corruption problem in xfs_iget_cache_miss() in > commit ee457001ed6c ("xfs: catch inode allocation state > mismatch corruption"). This change adds similar checks to > the cache-hit path to catch it, and turns the reproducer > into a corruption shutdown situation. > > IOWs, the tests are exercising the same corruption, just through > different code paths. So it would seem that we need both tests... Many thanks for your explanation. I'll send the case to be reviewed again. Thanks, Zorro > > Cheers, > > Dave. > -- > Dave Chinner > david@fromorbit.com > -- > To unsubscribe from this list: send the line "unsubscribe fstests" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html -- To unsubscribe from this list: send the line "unsubscribe fstests" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html
diff --git a/tests/xfs/999 b/tests/xfs/999 new file mode 100755 index 00000000..be26420a --- /dev/null +++ b/tests/xfs/999 @@ -0,0 +1,114 @@ +#! /bin/bash +# FS QA Test No. 999 +# +# Test a corruption when the directory structure and the inobt thinks the inode +# is free, but the inode on disk thinks it is still in use. +# +#----------------------------------------------------------------------- +# Copyright (c) 2018 Red Hat, Inc. All Rights Reserved. +# +# This program is free software; you can redistribute it and/or +# modify it under the terms of the GNU General Public License as +# published by the Free Software Foundation. +# +# This program is distributed in the hope that it would be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the +# GNU General Public License for more details. +# +# You should have received a copy of the GNU General Public License +# along with this program; if not, write the Free Software Foundation, +# Inc., 51 Franklin St, Fifth Floor, Boston, MA 02110-1301 USA +#----------------------------------------------------------------------- +# + +seq=`basename $0` +seqres=$RESULT_DIR/$seq +echo "QA output created by $seq" + +here=`pwd` +tmp=/tmp/$$ +status=1 # failure is the default! +trap "_cleanup; exit \$status" 0 1 2 3 15 + +_cleanup() +{ + cd / + rm -f $tmp.* +} + +# get standard environment, filters and checks +. ./common/rc +. ./common/filter + +# remove previous $seqres.full before test +rm -f $seqres.full + +# real QA test starts here + +# Modify as appropriate. +_supported_fs xfs +_supported_os Linux +_require_scratch_nocheck +_require_no_xfs_bug_on_assert + +_filter_dmesg() +{ + local warn1="Internal error xfs_trans_cancel.*fs/xfs/xfs_trans\.c.*" + local warn2="WARNING:.*fs/xfs/xfs_message\.c:.*assfail.*" + + sed -e "s#$warn1#Intentional error in xfs_trans_cancel#" \ + -e "s#$warn2#Intentional warnings in assfail#" +} + +# Use crc=0, due to this crash is only possible on v4 XFS or v5 XFS mounted +# with the ikeep mount option. For all other V5 XFS, this problem cannot +# occur because we don't read inodes we are allocating from disk - we simply +# overwrite them with the new inode information. +_scratch_mkfs_xfs -m crc=0 >> $seqres.full 2>&1 +blksz=$(_scratch_xfs_get_sb_field blocksize) +agcount=$(_scratch_xfs_get_sb_field agcount) + +_scratch_mount +# Create a directory for later allocation in same AG (AG 0, due to this's an +# empty XFS for now) +mkdir $SCRATCH_MNT/dir + +# Allocate 1 block for testfile +$XFS_IO_PROG -fc 'pwrite 0 $blksz' -c fsync $SCRATCH_MNT/dir/testfile >> $seqres.full +inum=`stat -c %i $SCRATCH_MNT/dir/testfile` +_scratch_unmount + +# Find the AG which contains testfile +agi=`_scratch_xfs_db -c "convert inode $inum agno" | sed -e 's/^.*(\([0-9]*\).*$/\1/g'` + +# Due to we only allocate 1 block for testfile, and this's the only one data +# block we use. So we use single level inobt, So the ${agi}->root->recs[1] +# should be the only one record points the chunk which contains testfile's +# inode. +# An exmaple of inode record is as below: +# recs[1] = [startino,freecount,free] 1:[1024,59,0xffffffffffffffe0] +freecount=$(_scratch_xfs_get_metadata_field "recs[1].freecount" \ + "agi $agi" "addr root") +fmask=$(_scratch_xfs_get_metadata_field "recs[1].free" "agi $agi" "addr root") + +# fmask shift right 1 bit, and freecount++, to mark testfile inode as free in +# inobt. (But the inode itself isn't freed, it still has allocated block) +freecount="$((freecount + 1))" +fmask="$((fmask / 2))" +_scratch_xfs_set_metadata_field "recs[1].freecount" "$freecount" \ + "agi $agi" "addr root" >/dev/null +_scratch_xfs_set_metadata_field "recs[1].free" "$fmask" \ + "agi $agi" "addr root" >/dev/null + +# Mount again and create a new inode cover that inode we just 'freed' from inobt +_scratch_mount +$XFS_IO_PROG -fc 'pwrite 0 $blksz' -c fsync $SCRATCH_MNT/dir/newfile 2>&1 | \ + grep -i "Structure needs cleaning" | _filter_scratch + +# filter a intentional internal errors +_check_dmesg _filter_dmesg + +# success, all done +status=0 +exit diff --git a/tests/xfs/999.out b/tests/xfs/999.out new file mode 100644 index 00000000..cb8d9e34 --- /dev/null +++ b/tests/xfs/999.out @@ -0,0 +1,2 @@ +QA output created by 999 +SCRATCH_MNT/dir/newfile: Structure needs cleaning diff --git a/tests/xfs/group b/tests/xfs/group index 32bf4f71..ec3cca62 100644 --- a/tests/xfs/group +++ b/tests/xfs/group @@ -442,3 +442,4 @@ 443 auto quick ioctl fsr 444 auto quick 445 auto quick filestreams +999 auto quick
There's a situation where the directory structure and the inobt thinks the inode is free, but the inode on disk thinks it is still in use. XFS should detect it and prevent the kernel from oopsing on lookup. Signed-off-by: Zorro Lang <zlang@redhat.com> --- Hi, V2 did below changes: 1) Fix Copyright 2) Use 'convert' command of xfs_db to get agino from inode number. V1 and related reply as below: https://marc.info/?l=fstests&m=152229518811044&w=2 Thanks, Zorro tests/xfs/999 | 114 ++++++++++++++++++++++++++++++++++++++++++++++++++++++ tests/xfs/999.out | 2 + tests/xfs/group | 1 + 3 files changed, 117 insertions(+) create mode 100755 tests/xfs/999 create mode 100644 tests/xfs/999.out