All of lore.kernel.org
 help / color / mirror / Atom feed
From: Josef Bacik <jbacik@fb.com>
To: Filipe Manana <fdmanana@suse.com>, <linux-btrfs@vger.kernel.org>
Cc: <stable@vger.kernel.org>
Subject: Re: [PATCH v2] Btrfs: fix data loss after concurrent fsyncs for files in the same subvol
Date: Fri, 17 Apr 2015 14:26:09 -0400	[thread overview]
Message-ID: <55315041.4070509@fb.com> (raw)
In-Reply-To: <1429294846-9021-1-git-send-email-fdmanana@suse.com>

On 04/17/2015 02:20 PM, Filipe Manana wrote:
> If we have concurrent fsync calls against files living in the same subvolume,
> we have some time window where we don't add the collected ordered extents
> to the running transaction's list of ordered extents and return success to
> userspace. This can result in data loss if the ordered extents complete after
> the current transaction commits and a power failure happens after the current
> transaction commits and before the next one commits.
>
> A sequence of steps that lead to this:
>
>          CPU 0                                                         CPU 1
>
> btrfs_sync_file(inode A)                               btrfs_sync_file(inode B)
>    btrfs_log_inode_parent()                               btrfs_log_inode_parent()
>
>      start_log_trans()
>        lock root->log_mutex
>        ctx->log_transid = root->log_transid = N
>        unlock root->log_mutex
>
>                                                             start_log_trans()
>                                                               lock root->log_mutex
>                                                               ctx->log_transid = root->log_transid = N
>                                                               unlock root->log_mutex
>
>      btrfs_log_inode()                                          btrfs_log_inode()
>        btrfs_get_logged_extents()                                 btrfs_get_logged_extents()
>           --> gets orderede extent A                                -> gets ordered extent B
>               into local list logged_list                              into local list logged_list
>        write items into the log tree                              write items into the log tree
>        btrfs_submit_logged_extents(&logged_list)
>          --> splices logged_list into
>              log_root->logged_list[N % 2]
>              (N == log_root->log_transid)
>
>    btrfs_sync_log()
>      lock root->log_mutex
>
>      atomic_set(&root->log_commit[N % 2], 1)
>        (N == ctx->log_transid)

Except this can't happen, we have a wait_for_writer() in between here 
that will wait for CPU 1 to finish doing it's logging since it has 
already done it's start_log_trans().  Thanks,

Josef

  reply	other threads:[~2015-04-17 18:26 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2015-04-17 16:43 [PATCH] Btrfs: fix data loss after concurrent fsyncs for files in the same subvol Filipe Manana
2015-04-17 18:20 ` [PATCH v2] " Filipe Manana
2015-04-17 18:26   ` Josef Bacik [this message]
2015-04-17 19:37     ` Filipe David Manana

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=55315041.4070509@fb.com \
    --to=jbacik@fb.com \
    --cc=fdmanana@suse.com \
    --cc=linux-btrfs@vger.kernel.org \
    --cc=stable@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.