From: Junxiao Bi <junxiao.bi@oracle.com>
To: Joseph Qi <joseph.qi@huawei.com>,
"ocfs2-devel@oss.oracle.com" <ocfs2-devel@oss.oracle.com>,
linux-ext4@vger.kernel.org
Subject: Re: [Ocfs2-devel] ocfs2 inconsistent when updating journal superblock failed
Date: Wed, 03 Jun 2015 10:40:03 +0800 [thread overview]
Message-ID: <556E6903.9090800@oracle.com> (raw)
In-Reply-To: <556D5FAC.20702@huawei.com>
Hi Joseph,
On 06/02/2015 03:47 PM, Joseph Qi wrote:
> Hi all,
> If jbd2 has failed to update superblock because of iscsi link down, it
> may cause ocfs2 inconsistent.
>
> kernel version: 3.0.93
> dmesg:
> JBD2: I/O error detected when updating journal superblock for
> dm-41-36.
>
> Case description:
> Node 1 was doing the checkpoint of global bitmap.
> ocfs2_commit_thread
> ocfs2_commit_cache
> jbd2_journal_flush
> jbd2_cleanup_journal_tail
> jbd2_journal_update_superblock
> sync_dirty_buffer
> submit_bh *failed*
> Since the error was ignored, jbd2_journal_flush would return 0.
> Then ocfs2_commit_cache thought it normal, incremented trans id and woke
> downconvert thread.
> So node 2 could get the lock because the checkpoint had been done
> successfully (in fact, bitmap on disk had been updated but journal
> superblock not). Then node 2 did the update to global bitmap as normal.
> After a while, node 2 found node 1 down and began the journal recovery.
> As a result, the new update by node 2 would be overwritten and filesystem
> became inconsistent.
If this is the case, this seemed a generic issue. Assume a two node
cluster, node 1 updated global bitmap, and the transaction for this
update have been written into node 1's journal. Then node 2 updated
global bitmap, after that, node 1 crash and node 2 replay node 1's
journal and will overwrite global bitmap to old one. Do i miss some point?
Thanks,
Junxiao.
>
> I'm not sure if ext4 has the same case (can it be deployed on LUN?).
> But for ocfs2, I don't think the error can be omitted.
> Any ideas about this?
>
> Thanks,
> Joseph
>
>
> _______________________________________________
> Ocfs2-devel mailing list
> Ocfs2-devel@oss.oracle.com
> https://oss.oracle.com/mailman/listinfo/ocfs2-devel
>
next prev parent reply other threads:[~2015-06-03 2:42 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-02 7:47 ocfs2 inconsistent when updating journal superblock failed Joseph Qi
2015-06-03 2:40 ` Junxiao Bi [this message]
2015-06-03 3:52 ` [Ocfs2-devel] " Joseph Qi
2015-06-03 6:58 ` Junxiao Bi
2015-06-03 7:27 ` [Ocfs2-devel] " Joseph Qi
2015-06-03 7:38 ` Junxiao Bi
2015-06-04 11:26 ` Joseph Qi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=556E6903.9090800@oracle.com \
--to=junxiao.bi@oracle.com \
--cc=joseph.qi@huawei.com \
--cc=linux-ext4@vger.kernel.org \
--cc=ocfs2-devel@oss.oracle.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).