From: Miao Xie <miaox@cn.fujitsu.com>
To: bo.li.liu@oracle.com
Cc: dsterba@suse.cz, Linux Btrfs <linux-btrfs@vger.kernel.org>
Subject: Re: [PATCH 10/11] Btrfs: use bit operation for ->fs_state
Date: Tue, 15 Jan 2013 14:03:49 +0800 [thread overview]
Message-ID: <50F4F145.9060201@cn.fujitsu.com> (raw)
In-Reply-To: <20130115040301.GA2430@liubo>
On tue, 15 Jan 2013 12:03:03 +0800, Liu Bo wrote:
> On Mon, Jan 14, 2013 at 03:50:31PM +0800, Miao Xie wrote:
>> On thu, 10 Jan 2013 18:57:35 +0100, David Sterba wrote:
>>> On Thu, Jan 10, 2013 at 08:51:59PM +0800, Miao Xie wrote:
>>>> There is no lock to protect fs_info->fs_state, it will introduce some problems,
>>>> such as the value may be covered by the other task when several tasks modify
>>>> it. Now we use bit operation for it to fix the above problem.
>>>
>>> Can you please describe in more detail how does that happen and to what
>>> problems it leads?
>>
>> For example:
>> Task0 - CPU0 Task1 - CPU1
>> mov %fs_state rax
>> or $0x1 rax
>> mov %fs_state rax
>> or $0x2 rax
>> mov rax %fs_state
>> mov rax %fs_state
>>
>> The expected value is 3, but in fact, it is 2
>
> The code shows that fs_state is only set by open_ctree() and
> save_error_info(), how could the above race can happen?
The reason that the above race can not happen is because there is only one flag currently.
But as we know, ->fs_state can be accessed and updated by multi-task, so the current code
is error prone, if we add other flags, the above problem will happen to a certainty. (Adding
new flags is very likely) So why not write right and robust code at the beginning?
> Although I'm ok with this as a harmless cleanup patch, I'm afraid the commit log
> is not persuadable anyway.
I think the changelog is right since ->fs_state can be accessed and updated by multi-task
actually.
Thanks
Miao
>
> thanks,
> liubo
>
>>
>> Thanks
>> Miao
>>
>>>
>>> thanks,
>>> david
>>>
>>>> Signed-off-by: Miao Xie <miaox@cn.fujitsu.com>
>>>> ---
>>>> fs/btrfs/ctree.h | 4 +++-
>>>> fs/btrfs/disk-io.c | 5 +++--
>>>> fs/btrfs/file.c | 2 +-
>>>> fs/btrfs/scrub.c | 2 +-
>>>> fs/btrfs/super.c | 4 ++--
>>>> fs/btrfs/transaction.c | 9 ++++-----
>>>> 6 files changed, 14 insertions(+), 12 deletions(-)
>>>>
>>>> diff --git a/fs/btrfs/ctree.h b/fs/btrfs/ctree.h
>>>> index c95b539..c34e36e 100644
>>>> --- a/fs/btrfs/ctree.h
>>>> +++ b/fs/btrfs/ctree.h
>>>> @@ -338,7 +338,9 @@ static inline unsigned long btrfs_chunk_item_size(int num_stripes)
>>>> /*
>>>> * File system states
>>>> */
>>>> +#define BTRFS_FS_STATE_ERROR 0
>>>>
>>>> +/* Super block flags */
>>>> /* Errors detected */
>>>> #define BTRFS_SUPER_FLAG_ERROR (1ULL << 2)
>>>>
>>>> @@ -1540,7 +1542,7 @@ struct btrfs_fs_info {
>>>> u64 qgroup_seq;
>>>>
>>>> /* filesystem state */
>>>> - u64 fs_state;
>>>> + unsigned long fs_state;
>>>>
>>>> struct btrfs_delayed_root *delayed_root;
>>>>
>>>> diff --git a/fs/btrfs/disk-io.c b/fs/btrfs/disk-io.c
>>>> index cf03a45..d06e50c 100644
>>>> --- a/fs/btrfs/disk-io.c
>>>> +++ b/fs/btrfs/disk-io.c
>>>> @@ -2196,7 +2196,8 @@ int open_ctree(struct super_block *sb,
>>>> goto fail_alloc;
>>>>
>>>> /* check FS state, whether FS is broken. */
>>>> - fs_info->fs_state |= btrfs_super_flags(disk_super);
>>>> + if (btrfs_super_flags(disk_super) & BTRFS_SUPER_FLAG_ERROR)
>>>> + set_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state);
>>>>
>>>> ret = btrfs_check_super_valid(fs_info, sb->s_flags & MS_RDONLY);
>>>> if (ret) {
>>>> @@ -3354,7 +3355,7 @@ int close_ctree(struct btrfs_root *root)
>>>> printk(KERN_ERR "btrfs: commit super ret %d\n", ret);
>>>> }
>>>>
>>>> - if (fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR)
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state))
>>>> btrfs_error_commit_super(root);
>>>>
>>>> btrfs_put_block_group_cache(fs_info);
>>>> diff --git a/fs/btrfs/file.c b/fs/btrfs/file.c
>>>> index 3e9fa0e..ec87b69 100644
>>>> --- a/fs/btrfs/file.c
>>>> +++ b/fs/btrfs/file.c
>>>> @@ -1531,7 +1531,7 @@ static ssize_t btrfs_file_aio_write(struct kiocb *iocb,
>>>> * although we have opened a file as writable, we have
>>>> * to stop this write operation to ensure FS consistency.
>>>> */
>>>> - if (root->fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR) {
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &root->fs_info->fs_state)) {
>>>> mutex_unlock(&inode->i_mutex);
>>>> err = -EROFS;
>>>> goto out;
>>>> diff --git a/fs/btrfs/scrub.c b/fs/btrfs/scrub.c
>>>> index af0b566..2e91b56 100644
>>>> --- a/fs/btrfs/scrub.c
>>>> +++ b/fs/btrfs/scrub.c
>>>> @@ -2700,7 +2700,7 @@ static noinline_for_stack int scrub_supers(struct scrub_ctx *sctx,
>>>> int ret;
>>>> struct btrfs_root *root = sctx->dev_root;
>>>>
>>>> - if (root->fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR)
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &root->fs_info->fs_state))
>>>> return -EIO;
>>>>
>>>> gen = atomic64_read(&root->fs_info->last_trans_committed);
>>>> diff --git a/fs/btrfs/super.c b/fs/btrfs/super.c
>>>> index 6f0524d..f714379 100644
>>>> --- a/fs/btrfs/super.c
>>>> +++ b/fs/btrfs/super.c
>>>> @@ -98,7 +98,7 @@ static void __save_error_info(struct btrfs_fs_info *fs_info)
>>>> * today we only save the error info into ram. Long term we'll
>>>> * also send it down to the disk
>>>> */
>>>> - fs_info->fs_state = BTRFS_SUPER_FLAG_ERROR;
>>>> + set_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state);
>>>> }
>>>>
>>>> static void save_error_info(struct btrfs_fs_info *fs_info)
>>>> @@ -114,7 +114,7 @@ static void btrfs_handle_error(struct btrfs_fs_info *fs_info)
>>>> if (sb->s_flags & MS_RDONLY)
>>>> return;
>>>>
>>>> - if (fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR) {
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state)) {
>>>> sb->s_flags |= MS_RDONLY;
>>>> printk(KERN_INFO "btrfs is forced readonly\n");
>>>> /*
>>>> diff --git a/fs/btrfs/transaction.c b/fs/btrfs/transaction.c
>>>> index 7999bf8..a950d48 100644
>>>> --- a/fs/btrfs/transaction.c
>>>> +++ b/fs/btrfs/transaction.c
>>>> @@ -62,7 +62,7 @@ static noinline int join_transaction(struct btrfs_root *root, int type)
>>>> spin_lock(&fs_info->trans_lock);
>>>> loop:
>>>> /* The file system has been taken offline. No new transactions. */
>>>> - if (fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR) {
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state)) {
>>>> spin_unlock(&fs_info->trans_lock);
>>>> return -EROFS;
>>>> }
>>>> @@ -114,7 +114,7 @@ loop:
>>>> kmem_cache_free(btrfs_transaction_cachep, cur_trans);
>>>> cur_trans = fs_info->running_transaction;
>>>> goto loop;
>>>> - } else if (fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR) {
>>>> + } else if (test_bit(BTRFS_FS_STATE_ERROR, &fs_info->fs_state)) {
>>>> spin_unlock(&fs_info->trans_lock);
>>>> kmem_cache_free(btrfs_transaction_cachep, cur_trans);
>>>> return -EROFS;
>>>> @@ -302,7 +302,7 @@ start_transaction(struct btrfs_root *root, u64 num_items, int type,
>>>> int ret;
>>>> u64 qgroup_reserved = 0;
>>>>
>>>> - if (root->fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR)
>>>> + if (test_bit(BTRFS_FS_STATE_ERROR, &root->fs_info->fs_state))
>>>> return ERR_PTR(-EROFS);
>>>>
>>>> if (current->journal_info) {
>>>> @@ -635,9 +635,8 @@ static int __btrfs_end_transaction(struct btrfs_trans_handle *trans,
>>>> btrfs_run_delayed_iputs(root);
>>>>
>>>> if (trans->aborted ||
>>>> - root->fs_info->fs_state & BTRFS_SUPER_FLAG_ERROR) {
>>>> + test_bit(BTRFS_FS_STATE_ERROR, &root->fs_info->fs_state))
>>>> err = -EIO;
>>>> - }
>>>> assert_qgroups_uptodate(trans);
>>>>
>>>> memset(trans, 0, sizeof(*trans));
>>>> --
>>>> 1.7.11.7
>>>> --
>>>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
>>>> the body of a message to majordomo@vger.kernel.org
>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>> --
>>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
>>> the body of a message to majordomo@vger.kernel.org
>>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>>>
>>
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
>> the body of a message to majordomo@vger.kernel.org
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
> --
> To unsubscribe from this list: send the line "unsubscribe linux-btrfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
>
next prev parent reply other threads:[~2013-01-15 6:03 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-01-10 12:51 [PATCH 10/11] Btrfs: use bit operation for ->fs_state Miao Xie
2013-01-10 17:57 ` David Sterba
2013-01-14 7:50 ` Miao Xie
2013-01-15 4:03 ` Liu Bo
2013-01-15 6:03 ` Miao Xie [this message]
2013-01-16 13:03 ` David Sterba
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=50F4F145.9060201@cn.fujitsu.com \
--to=miaox@cn.fujitsu.com \
--cc=bo.li.liu@oracle.com \
--cc=dsterba@suse.cz \
--cc=linux-btrfs@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.