linux-btrfs.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Josef Bacik <jbacik@fb.com>
To: Filipe Manana <fdmanana@suse.com>, <linux-btrfs@vger.kernel.org>
Cc: <stable@vger.kernel.org>
Subject: Re: [PATCH v2] Btrfs: fix data loss after concurrent fsyncs for files in the same subvol
Date: Fri, 17 Apr 2015 14:26:09 -0400	[thread overview]
Message-ID: <55315041.4070509@fb.com> (raw)
In-Reply-To: <1429294846-9021-1-git-send-email-fdmanana@suse.com>

On 04/17/2015 02:20 PM, Filipe Manana wrote:
> If we have concurrent fsync calls against files living in the same subvolume,
> we have some time window where we don't add the collected ordered extents
> to the running transaction's list of ordered extents and return success to
> userspace. This can result in data loss if the ordered extents complete after
> the current transaction commits and a power failure happens after the current
> transaction commits and before the next one commits.
>
> A sequence of steps that lead to this:
>
>          CPU 0                                                         CPU 1
>
> btrfs_sync_file(inode A)                               btrfs_sync_file(inode B)
>    btrfs_log_inode_parent()                               btrfs_log_inode_parent()
>
>      start_log_trans()
>        lock root->log_mutex
>        ctx->log_transid = root->log_transid = N
>        unlock root->log_mutex
>
>                                                             start_log_trans()
>                                                               lock root->log_mutex
>                                                               ctx->log_transid = root->log_transid = N
>                                                               unlock root->log_mutex
>
>      btrfs_log_inode()                                          btrfs_log_inode()
>        btrfs_get_logged_extents()                                 btrfs_get_logged_extents()
>           --> gets orderede extent A                                -> gets ordered extent B
>               into local list logged_list                              into local list logged_list
>        write items into the log tree                              write items into the log tree
>        btrfs_submit_logged_extents(&logged_list)
>          --> splices logged_list into
>              log_root->logged_list[N % 2]
>              (N == log_root->log_transid)
>
>    btrfs_sync_log()
>      lock root->log_mutex
>
>      atomic_set(&root->log_commit[N % 2], 1)
>        (N == ctx->log_transid)

Except this can't happen, we have a wait_for_writer() in between here 
that will wait for CPU 1 to finish doing it's logging since it has 
already done it's start_log_trans().  Thanks,

Josef

  reply	other threads:[~2015-04-17 18:26 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-04-17 16:43 [PATCH] Btrfs: fix data loss after concurrent fsyncs for files in the same subvol Filipe Manana
2015-04-17 18:20 ` [PATCH v2] " Filipe Manana
2015-04-17 18:26   ` Josef Bacik [this message]
2015-04-17 19:37     ` Filipe David Manana

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=55315041.4070509@fb.com \
    --to=jbacik@fb.com \
    --cc=fdmanana@suse.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).