From: Chandan Rajendra <chandan@linux.vnet.ibm.com>
To: Anand Jain <anand.jain@oracle.com>
Cc: linux-btrfs@vger.kernel.org
Subject: Re: [PATCH] generic/311: Disable dmesg check
Date: Wed, 22 Feb 2017 15:02:10 +0530 [thread overview]
Message-ID: <2485150.WmAGPpieTt@localhost.localdomain> (raw)
In-Reply-To: <44614455-beac-5191-59c8-2bda3cd14881@oracle.com>
On Monday, February 20, 2017 11:03:11 PM Anand Jain wrote:
>
> Hi Chandan,
>
> On 07/17/15 12:56, Chandan Rajendra wrote:
> > When running generic/311 on Btrfs' subpagesize-blocksize patchset (on ppc64
> > with 4k sectorsize and 16k node/leaf size) I noticed the following call trace,
> >
> > BTRFS (device dm-0): parent transid verify failed on 29720576 wanted 160 found 158
> > BTRFS (device dm-0): parent transid verify failed on 29720576 wanted 160 found 158
> > BTRFS: Transaction aborted (error -5)
> >
> > WARNING: at /root/repos/linux/fs/btrfs/super.c:260
> > Modules linked in:
> > CPU: 3 PID: 30769 Comm: umount Tainted: G W L 4.0.0-rc5-11671-g8b82e73e #63
> > task: c000000079aaddb0 ti: c000000079a48000 task.ti: c000000079a48000
> > NIP: c000000000499aa0 LR: c000000000499a9c CTR: c000000000779630
> > REGS: c000000079a4b480 TRAP: 0700 Tainted: G W L (4.0.0-rc5-11671-g8b82e73e)
> > MSR: 8000000100029032 <SF,EE,ME,IR,DR,RI> CR: 28008828 XER: 20000000
> > CFAR: c000000000a23914 SOFTE: 1
> > GPR00: c000000000499a9c c000000079a4b700 c00000000103bdf8 0000000000000025
> > GPR04: 0000000000000001 0000000000000502 c00000000107e918 0000000000000cda
> > GPR08: 0000000000000007 0000000000000007 0000000000000001 c0000000010f5044
> > GPR12: 0000000028008822 c00000000fdc0d80 0000000020000000 0000000010152e00
> > GPR16: 0000010002979380 0000000010140724 0000000000000000 0000000000000000
> > GPR20: ffffffffffffffff 0000000000000000 0000000000000000 0000000000000000
> > GPR24: c0000000151f61a8 0000000000000000 c000000055e5e800 c000000000aac270
> > GPR28: 00000000000004a4 fffffffffffffffb c000000055e5e800 c0000000679204d0
> > NIP [c000000000499aa0] .__btrfs_abort_transaction+0x180/0x190
> > LR [c000000000499a9c] .__btrfs_abort_transaction+0x17c/0x190
> > Call Trace:
> > [c000000079a4b700] [c000000000499a9c] .__btrfs_abort_transaction+0x17c/0x190 (unreliable)
> > [c000000079a4b7a0] [c000000000541678] .__btrfs_run_delayed_items+0xe8/0x220
> > [c000000079a4b850] [c0000000004d5b3c] .btrfs_commit_transaction+0x37c/0xca0
> > [c000000079a4b960] [c00000000049824c] .btrfs_sync_fs+0x6c/0x1a0
> > [c000000079a4ba00] [c000000000255270] .sync_filesystem+0xd0/0x100
> > [c000000079a4ba80] [c000000000218070] .generic_shutdown_super+0x40/0x170
> > [c000000079a4bb10] [c000000000218598] .kill_anon_super+0x18/0x30
> > [c000000079a4bb90] [c000000000498418] .btrfs_kill_super+0x18/0xc0
> > [c000000079a4bc10] [c000000000218ac8] .deactivate_locked_super+0x98/0xe0
> > [c000000079a4bc90] [c00000000023e744] .cleanup_mnt+0x54/0xa0
> > [c000000079a4bd10] [c0000000000b7d14] .task_work_run+0x114/0x150
> > [c000000079a4bdb0] [c000000000015f84] .do_notify_resume+0x74/0x80
> > [c000000079a4be30] [c000000000009838] .ret_from_except_lite+0x64/0x68
> > Instruction dump:
> > ebc1fff0 ebe1fff8 4bfffb28 60000000 3ce2ffcd 38e7e818 4bffffbc 3c62ffd2
> > 7fa4eb78 3863b808 48589e1d 60000000 <0fe00000> 4bfffedc 60000000 60000000
> > BTRFS: error (device dm-0) in __btrfs_run_delayed_items:1188: errno=-5 IO failure
> >
> >
> > The call trace is seen when executing _run_test() for the 8th time.
> > The above trace is actually a false-positive failure as indicated below,
> > fsync-tester
> > fsync(fd)
> > Write delayed inode item to fs tree
> > (assume transid to be 160)
> > (assume tree block to start at logical address 29720576)
> > md5sum $testfile
> > This causes a delayed inode to be added
> > Load flakey table
> > i.e. drop writes that are initiated from now onwards
> > Unmount filesystem
> > btrfs_sync_fs is invoked
> > Write 29720576 metadata block to disk
> > free_extent_buffer(29720576)
> > release_extent_buffer(29720576)
> > Start writing delayed inode
> > Traverse the fs tree
> > (assume the parent tree block of 29720576 is still in memory)
> > When reading 29720576 from disk, parent's blkptr will have generation
> > set to 160. But the on-disk tree block will have an older
> > generation (say, 158). Transid verification fails and hence the
> > transaction gets aborted
> >
> > The test only cares about the FS instance before the unmount
> > operation (i.e. the synced FS). Hence to get the test to pass, ignore the
> > false-positive trace that could be generated.
>
> Looks like this patch didn't make it, is there any kernel patch
> which fixed this bug ? Or any hints on how to reproduce this bug ?
>
Hi Anand,
This bug is easily recreated when executing the test on Btrfs with
subpage-blocksize patchset applied. I haven't been able to test the recently
rebased subpage-blocksize patchset yet.
Coming back to the issue ... The problem exists because the test code uses
dm-flakey. Josef had suggested that using dm-log-writes instead of dm-flakey
should fix the problem. I will work on this and post a patch soon.
--
chandan
next prev parent reply other threads:[~2017-02-22 9:32 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-07-17 4:56 [PATCH] generic/311: Disable dmesg check Chandan Rajendra
2015-07-17 10:16 ` Brian Foster
2015-07-17 11:40 ` Chandan Rajendra
2015-07-19 23:54 ` Dave Chinner
2015-07-20 12:55 ` Josef Bacik
2015-07-20 22:12 ` Dave Chinner
2015-07-21 10:33 ` Chandan Rajendra
2015-07-22 0:01 ` Dave Chinner
2015-07-22 0:37 ` Josef Bacik
2015-07-22 5:27 ` Dave Chinner
2015-07-22 11:28 ` Josef Bacik
2015-07-22 21:10 ` Dave Chinner
2015-07-22 12:47 ` Josef Bacik
2015-07-22 23:02 ` Dave Chinner
2017-02-20 15:03 ` Anand Jain
2017-02-22 9:32 ` Chandan Rajendra [this message]
2017-02-23 6:40 ` Anand Jain
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=2485150.WmAGPpieTt@localhost.localdomain \
--to=chandan@linux.vnet.ibm.com \
--cc=anand.jain@oracle.com \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).