From: Joel Fernandes <agnel.joel@gmail.com>
To: Mulyadi Santosa <mulyadi.santosa@gmail.com>
Cc: linux-fsdevel@vger.kernel.org,
kernelnewbies <kernelnewbies@nl.linux.org>
Subject: Re: ext3 writing of data before metadata in ordered mode
Date: Mon, 26 Oct 2009 00:17:26 -0700 [thread overview]
Message-ID: <9ff7a3bc0910260017v22ed53c1q949f22201e8b048f@mail.gmail.com> (raw)
In-Reply-To: <f284c33d0910252140r648e1dd6q326a0db1e6b8e01f@mail.gmail.com>
Hi Mulyadi,
Thanks for your opinion. Well if you ask me, JBD and the I/O scheduler
are 2 independent layers, so don't think the ordering of the data and
metadata is done at that level. But there is something about the data
completion handler you're talking about - I think.
Simplistically,
During a write() In data=ordered mode:
1. During updating of metadata (before the data is copied), the kernel
updates the metadata buffers and moves the metadata block to a list in
the active trasaction (which is going to be logged).
2. Then the actually data buffers (memory) are updated with the contents.
3. Then journal_dirty_data is called on each affected data buffer
(this apparently ensures that data is written before the metadata - I
don't know how)
4. And then the block buffers are committed (marked as dirty so that
the page flushing mechanism can send them to disk).
Now steps 3 and 4 seem to be independent therefore I don't know how
step 3 knows when step 4 completes? The only way I can think of is
step 4 sends calls a callback after its done to step 3 somehow?
Let me know if the above analysis makes sense, Thanks.
-Joel
On Sun, Oct 25, 2009 at 9:40 PM, Mulyadi Santosa
<mulyadi.santosa@gmail.com> wrote:
> Hi Joel...
>
> On Mon, Oct 26, 2009 at 4:33 AM, Joel Fernandes <agnel.joel@gmail.com> wrote:
>> In data=ordered mode the ext3_ordered_commit_write function marks the
>> buffers as dirty, how then does the JBD ensure that the data is
>> written before the metadata? Once the data buffers are marked as
>> dirty, JBD doesn't have control anymore over when the data is written
>> is actually written to disk right? Because the actually writing of the
>> data is handled by the page wtriteback mechanism (pdflush) right?
>
> I am not an expert, but here's my thought:
>
> I think writing to backing device is not done simply marking the
> buffer/page cache dirty. So, I think what kernel does is first prepare
> an I/O queue to update ext3 journal. Since we talk about data=ordered
> here, only metadata are logged.
>
> Perhaps the key here is, metadata writing is done as a async
> completion handler of data writing handler. Thus, data is written
> first, followed by metadata logging
>
> Another possibility is composing a single atomic I/O writing request,
> composed of data writing and metadata logging. Thus, I/O scheduler
> won't be able to re-order the request and must complete the sequence
> as we prepared.
>
> --
> regards,
>
> Mulyadi Santosa
> Freelance Linux trainer and consultant
>
> blog: the-hydra.blogspot.com
> training: mulyaditraining.blogspot.com
>
--
To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
next prev parent reply other threads:[~2009-10-26 7:17 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-10-25 21:33 ext3 writing of data before metadata in ordered mode Joel Fernandes
2009-10-26 4:40 ` Mulyadi Santosa
2009-10-26 7:17 ` Joel Fernandes [this message]
2009-10-26 13:19 ` Josef Bacik
2009-10-26 17:21 ` Joel Fernandes
2009-10-26 17:58 ` Josef Bacik
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=9ff7a3bc0910260017v22ed53c1q949f22201e8b048f@mail.gmail.com \
--to=agnel.joel@gmail.com \
--cc=kernelnewbies@nl.linux.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=mulyadi.santosa@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).