From: Chao Yu <chao@kernel.org>
To: Jaegeuk Kim <jaegeuk@kernel.org>, Yunlong Song <yunlong.song@huawei.com>
Cc: yuchao0@huawei.com, yunlong.song@icloud.com, miaoxie@huawei.com,
bintian.wang@huawei.com, shengyong1@huawei.com,
heyunlei@huawei.com, linux-f2fs-devel@lists.sourceforge.net,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH] f2fs: avoid GC causing encrypted file corrupted
Date: Fri, 21 Sep 2018 22:20:20 +0800 [thread overview]
Message-ID: <aa837149-1dc3-9b75-a54c-00703781b2b4@kernel.org> (raw)
In-Reply-To: <20180918181705.GG91945@jaegeuk-macbookpro.roam.corp.google.com>
On 2018/9/19 2:17, Jaegeuk Kim wrote:
> On 09/18, Yunlong Song wrote:
>> The encrypted file may be corrupted by GC in following case:
>>
>> Time 1: | segment 1 blkaddr = A | GC -> | segment 2 blkaddr = B |
>> Encrypted block 1 is moved from blkaddr A of segment 1 to blkaddr B of
>> segment 2,
>>
>> Time 2: | segment 1 blkaddr = B | GC -> | segment 3 blkaddr = C |
>
> segment 2 blkaddr = B?
>
>>
>> Before page 1 is written back and if segment 2 become a victim, then
>> page 1 is moved from blkaddr B of segment 2 to blkaddr Cof segment 3,
>
> C of ?
>
>> during the GC process of Time 2, f2fs should wait for page 1 written back
>> before reading it, or move_data_block will read a garbage block from
>> blkaddr B since page is not written back to blkaddr B yet.
>
> move_data_block() checks PageUptodate() so it won't get garbage, yes?
I think the problem here is:
Thread A Background GC Thread
- writepage
- f2fs_outplace_write_data
fio->encrypted_page is in-flight
- gc_data_segment
- ra_data_block
- f2fs_pagecache_get_page
- f2fs_submit_page_bio
cache garbage data in meta page
Device
Receive encrypted data
- f2fs_write_end_io
- move_data_block
- f2fs_pagecache_get_page
- if (PageUptodate(mpage)) memcpy()
So here we copy garbage data into meta page
- f2fs_submit_page_write
Here we migrate incorrect data to new address
> So, does ra_data_block need to check PageUptodate?
Yes, I think so, could improve this in another patch.
Thanks,
>
>>
>> Commit 6aa58d8a ("f2fs: readahead encrypted block during GC") introduce
>> ra_data_block to read encrypted block, but it forgets to add
>> f2fs_wait_on_page_writeback to avoid racing between GC and flush.
>>
>> Signed-off-by: Yunlong Song <yunlong.song@huawei.com>
>> ---
>> fs/f2fs/gc.c | 10 ++++++++++
>> 1 file changed, 10 insertions(+)
>>
>> diff --git a/fs/f2fs/gc.c b/fs/f2fs/gc.c
>> index a4c1a41..c55fb62 100644
>> --- a/fs/f2fs/gc.c
>> +++ b/fs/f2fs/gc.c
>> @@ -641,6 +641,14 @@ static int ra_data_block(struct inode *inode, pgoff_t index)
>> fio.page = page;
>> fio.new_blkaddr = fio.old_blkaddr = dn.data_blkaddr;
>>
>> + /*
>> + * don't cache encrypted data into meta inode until previous dirty
>> + * data were writebacked to avoid racing between GC and flush.
>> + */
>> + f2fs_wait_on_page_writeback(page, DATA, true);
>> +
>> + f2fs_wait_on_block_writeback(inode, dn.data_blkaddr);
>> +
>> fio.encrypted_page = f2fs_pagecache_get_page(META_MAPPING(sbi),
>> dn.data_blkaddr,
>> FGP_LOCK | FGP_CREAT, GFP_NOFS);
>> @@ -723,6 +731,8 @@ static void move_data_block(struct inode *inode, block_t bidx,
>> */
>> f2fs_wait_on_page_writeback(page, DATA, true);
>>
>> + f2fs_wait_on_block_writeback(inode, dn.data_blkaddr);
>> +
>> err = f2fs_get_node_info(fio.sbi, dn.nid, &ni);
>> if (err)
>> goto put_out;
>> --
>> 1.8.5.2
next prev parent reply other threads:[~2018-09-21 14:20 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-09-18 12:39 [PATCH] f2fs: avoid GC causing encrypted file corrupted Yunlong Song
2018-09-18 13:21 ` Chao Yu
2018-11-13 3:16 ` Chao Yu
2018-11-14 21:54 ` Jaegeuk Kim
2018-09-18 18:17 ` Jaegeuk Kim
2018-09-19 2:37 ` Yunlong Song
2018-09-21 14:20 ` Chao Yu [this message]
2018-10-24 8:07 ` Yunlong Song
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=aa837149-1dc3-9b75-a54c-00703781b2b4@kernel.org \
--to=chao@kernel.org \
--cc=bintian.wang@huawei.com \
--cc=heyunlei@huawei.com \
--cc=jaegeuk@kernel.org \
--cc=linux-f2fs-devel@lists.sourceforge.net \
--cc=linux-kernel@vger.kernel.org \
--cc=miaoxie@huawei.com \
--cc=shengyong1@huawei.com \
--cc=yuchao0@huawei.com \
--cc=yunlong.song@huawei.com \
--cc=yunlong.song@icloud.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).