From: harshad shirwadkar <harshadshirwadkar@gmail.com>
To: "Theodore Y. Ts'o" <tytso@mit.edu>, Andreas Dilger <adilger@dilger.ca>
Cc: Ext4 Developers List <linux-ext4@vger.kernel.org>
Subject: Re: [PATCH v2 05/12] jbd2: fast-commit commit path new APIs
Date: Mon, 12 Aug 2019 10:41:48 -0700 [thread overview]
Message-ID: <CAD+ocbxwraTHT0wPCZgtjC8mJ7OU6wpkd7PgC7_YW=G9W-arDQ@mail.gmail.com> (raw)
In-Reply-To: <20190812160445.GA28705@mit.edu>
Thanks Andreas and Ted for the review.
Yeah, this makes sense.
On Mon, Aug 12, 2019 at 9:04 AM Theodore Y. Ts'o <tytso@mit.edu> wrote:
>
> On Thu, Aug 08, 2019 at 08:45:45PM -0700, Harshad Shirwadkar wrote:
> > This patch adds new helper APIs that ext4 needs for fast
> > commits. These new fast commit APIs are used by subsequent fast commit
> > patches to implement fast commits. Following new APIs are added:
> >
> > /*
> > * Returns when either a full commit or a fast commit
> > * completes
> > */
> > int jbd2_fc_complete_commit(journal_tc *journal, tid_t tid,
> > tid_t tid, tid_t subtid)
>
> I think there is an opportunity to do something more efficient.
>
> Right now, the ext4_fsync() calls this function, and the file system
> can only do a "fast commit" if all of the modifications made to the
> file system to date are "fast commit eligible". Otherwise, we have to
> fall back to a normal, slow commit.
>
> We can make this decision on a much more granular level. Suppose that
> so far during the life of the current transaction, inodes A, B, and C
> have been modified. The modification to inode A is not fast commit
> eligible (maybe the inode is deleted, or it is involved in a directory
> rename, etc.). The modification to inode B is fast commit eligible,
> but an fsync was not requested for it. And the modification to inode
> C *is* fast commit eligble, *and* fsync() has been requested for it.
>
> We only need to write the information for inode C to the fast commit
> area. The fact that inode A is not fast commit eligible isn't a
> problem. It will get committed when the normal transaction closes,
> perhaps when the 5 second commit transaction timer expires. And inode
> B, even though its changes might be fast commit eligible, might
> require writing a large number of data blocks if it were included in
> the fast commit. So excluding inodes A and B from the fast commit,
> and only writing the logical changes corresponding to the those made
> to inode C, will allow a fast commit to take place.
>
> In order to do that, though, the ext4's fast commit machinery needs to
> know which inode we actually need to do the fast commit for. And so
> for that reason, it's actually probably better not to run the changes
> through the commit thread. That makes it harder to plumb the file
> system specific information through, and it also requires waking up
> the commit thread and waiting for it to get scheduled.
I see, so you mean each fsync() call will result in exactly one inode
to be committed (the inode on which fsync was called), right? I agree
this doesn't need to go through JBD2 but we need a mechanism to inform
JBD2 about this fast commit since JBD2 maintains sub-transaction ID.
JBD2 will in turn need to make sure that a subtid was allocated for
such a fast commit and it was incremented once the fast commit was
successful as well.
>
> Instead, ext4_fsync() could just call the fast commit machinery, and
> the only thing we need to expose is a way for the fast commit
> machinery to attempt to grab a mutex preventing the normal commit
> thread from starting a normal commit. If it loses the race, and the
> normal commit takes place before we manage to do the fast commit; then
> we don't need to do any thing more. Otherwise the fast commit
> machinery can do its thing, writing inode changes to the journal, and
> once it is done, it can release the mutex and ext4 fsync can return.
>
> Does that make sense?
Thanks for the suggestion, I will implement this in V3.
>
> - Ted
next prev parent reply other threads:[~2019-08-12 17:47 UTC|newest]
Thread overview: 36+ messages / expand[flat|nested] mbox.gz Atom feed top
2019-08-09 3:45 [PATCH v2 00/12] ext4: add support fast commit Harshad Shirwadkar
2019-08-09 3:45 ` [PATCH v2 01/12] ext4: add handling for extended mount options Harshad Shirwadkar
2019-08-09 19:41 ` Andreas Dilger
2019-08-12 14:22 ` Theodore Y. Ts'o
2019-08-09 3:45 ` [PATCH v2 02/12] jbd2: add fast commit fields to journal_s structure Harshad Shirwadkar
2019-08-09 19:48 ` Andreas Dilger
2019-10-01 7:50 ` harshad shirwadkar
2019-08-09 3:45 ` [PATCH v2 03/12] jbd2: fast commit setup and enable Harshad Shirwadkar
2019-08-09 20:02 ` Andreas Dilger
2019-10-01 7:52 ` harshad shirwadkar
2019-11-01 11:22 ` xiaohui li
2019-08-09 3:45 ` [PATCH v2 04/12] jbd2: fast-commit commit path changes Harshad Shirwadkar
2019-08-09 20:22 ` Andreas Dilger
2019-10-01 7:43 ` harshad shirwadkar
2019-08-09 3:45 ` [PATCH v2 05/12] jbd2: fast-commit commit path new APIs Harshad Shirwadkar
2019-08-09 20:38 ` Andreas Dilger
2019-08-09 21:11 ` Andreas Dilger
2019-08-09 21:20 ` harshad shirwadkar
2019-08-12 16:04 ` Theodore Y. Ts'o
2019-08-12 17:41 ` harshad shirwadkar [this message]
2019-08-12 18:01 ` Theodore Y. Ts'o
2019-08-09 3:45 ` [PATCH v2 06/12] jbd2: fast-commit recovery path changes Harshad Shirwadkar
2019-08-09 20:57 ` Andreas Dilger
2019-08-09 3:45 ` [PATCH v2 07/12] ext4: add fields that are needed to track changed files Harshad Shirwadkar
2019-08-09 21:23 ` Andreas Dilger
2019-10-01 7:50 ` harshad shirwadkar
2019-08-09 3:45 ` [PATCH v2 08/12] ext4: track changed files for fast commit Harshad Shirwadkar
2019-08-09 21:46 ` Andreas Dilger
2019-10-01 7:51 ` harshad shirwadkar
2019-08-09 3:45 ` [PATCH v2 09/12] ext4: fast-commit commit range tracking Harshad Shirwadkar
2019-08-09 3:45 ` [PATCH v2 10/12] ext4: fast-commit commit path changes Harshad Shirwadkar
2019-08-09 3:45 ` [PATCH v2 11/12] ext4: fast-commit recovery " Harshad Shirwadkar
2019-08-09 3:45 ` [PATCH v2 12/12] docs: Add fast commit documentation Harshad Shirwadkar
2019-08-16 1:00 ` Darrick J. Wong
2019-08-20 6:38 ` harshad shirwadkar
2019-08-21 15:21 ` Darrick J. Wong
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to='CAD+ocbxwraTHT0wPCZgtjC8mJ7OU6wpkd7PgC7_YW=G9W-arDQ@mail.gmail.com' \
--to=harshadshirwadkar@gmail.com \
--cc=adilger@dilger.ca \
--cc=linux-ext4@vger.kernel.org \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).