From: NeilBrown <neilb@suse.com>
To: Anton Altaparmakov <anton@tuxera.com>,
Christoph Hellwig <hch@infradead.org>
Cc: Jens Axboe <axboe@fb.com>, Theodore Ts'o <tytso@mit.edu>,
linux-fsdevel <linux-fsdevel@vger.kernel.org>,
Sougata Santra <sougata@tuxera.com>
Subject: Re: Device removal crash problems
Date: Fri, 08 Jul 2016 11:03:48 +1000 [thread overview]
Message-ID: <87eg75eya3.fsf@notabene.neil.brown.name> (raw)
In-Reply-To: <1D4D1B85-F772-4795-A938-852F46969499@tuxera.com>
[-- Attachment #1: Type: text/plain, Size: 1389 bytes --]
On Mon, Jun 13 2016, Anton Altaparmakov wrote:
> Hi Christoph,
>
> I think the reason the storage unplug crashes came back in 4.1 kernel after your work in 4.0 kernel to fix them is this commit: 6cd18e711dd8 "block: destroy bdi before blockdev is unregistered."
>
> The fix was to basically violate the lifetime rules/reference counting you put in place and destroy the bdi before the reference count reaches zero which means we are back at square one! The whole point of the reference count was specifically so that devices are not destroyed before the reference count becomes zero. Or at least that was my understanding/assumption...
>
> The solution should have perhaps been to fix MD and Loop drivers rather than to break the entire kernel all over again and then patch up ext4 again (commit bdfe0cbd746aa9b2509c2f6d6be17193cf7facd7).
>
> The check in ext4 is not perfect because it is a race condition - if you unplug at same time as the check is happening you can still get the kernel to crash. I grant you it is a very small race window but it is there.
>
> What do you think?
Is this problem fixed by
Commit: b02176f30cd3 ("block: don't release bdi while request_queue has live references")
(in 4.3-rc7)?
With that patch the unregistering is done early enough for md and loop,
but the freeing should be done late enough to not inconvenience
filesystems.
Thanks,
NeilBrown
[-- Attachment #2: signature.asc --]
[-- Type: application/pgp-signature, Size: 818 bytes --]
prev parent reply other threads:[~2016-07-08 1:04 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-06-13 1:26 Device removal crash problems Anton Altaparmakov
2016-06-15 13:07 ` Christoph Hellwig
2016-07-08 1:03 ` NeilBrown [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=87eg75eya3.fsf@notabene.neil.brown.name \
--to=neilb@suse.com \
--cc=anton@tuxera.com \
--cc=axboe@fb.com \
--cc=hch@infradead.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=sougata@tuxera.com \
--cc=tytso@mit.edu \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).