All of lore.kernel.org
 help / color / mirror / Atom feed
From: Marc MERLIN <marc@merlins.org>
To: Eric Wheeler <bcache@lists.ewheeler.net>
Cc: Zhu Yanhai <zhu.yanhai@gmail.com>,
	Christoph Nelles <evilazrael@evilazrael.de>,
	linux-bcache@vger.kernel.org
Subject: Re: BUG: drivers/md/bcache/writeback.c:237
Date: Thu, 25 Feb 2016 16:17:13 -0800	[thread overview]
Message-ID: <20160226001713.GC29543@merlins.org> (raw)
In-Reply-To: <alpine.LRH.2.11.1602252341070.3635@mail.ewheeler.net>

On Thu, Feb 25, 2016 at 11:44:01PM +0000, Eric Wheeler wrote:
> 
> On Thu, 25 Feb 2016, Marc MERLIN wrote:
> 
> > On Thu, Feb 25, 2016 at 06:18:03PM +0800, Zhu Yanhai wrote:
> > > Marc,
> > > When did you *first* see this BUG_ON? During boot up or far after the
> > > whole system is up?
> > 
> > I setup bcache, had it work a bit.
> > During shutdown, it crapped out when syncing/unmounting
> 
> Do you have more information about what crapped out on shutdown?  
>    Memory?
>    IO Error?
>    Just hung so the reset button was pushed?
 
Sadly, I didn't record that carefully. I remember a kernel traceback and
system hang, but that's about it.

> Was there a backtrace?

Yes, but not captured :(

> It might be a good idea to use netconsole and point it at a syslog 
> server to catch the whole backtrace.

Next time sure, but too late now, and no netconsole if the system crashes
before I can even bring the ethernet up.

> > After each subsequent boot, the system crashed as soon as the module loaded
> > and scanned my disks.
> 
> So just to clarify, the current BUG_ON discussed is happening at boot, and 
> is not the mid-shutdown error that first happened?

Correct. The only reason my system is booting right now is that I hid the
partition where the bcache cache is, so it doesn't get seen at boot.
If it's there, as soon as the kernel boots and bcache activates, it crashes.
I'm keeping it in case you want me to try a patch to see if it'll stop the
crashing at boot.

Hence my request for a way to turn off bcache as a kernel command line option 
to allow for recovery in such cases.

Sadly even if we don't have perfect state on how we got there, bad data
shouldn't cause bcache to crash the kernel at boot. It could refuse to make
the cache active, log an error, and move on.

Marc
-- 
"A mouse is a device used to point at the xterm you want to type in" - A.S.R.
Microsoft is to operating systems ....
                                      .... what McDonalds is to gourmet cooking
Home page: http://marc.merlins.org/  

  reply	other threads:[~2016-02-26  0:17 UTC|newest]

Thread overview: 28+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-02-15  6:04 echo dev > /sys/fs/bcache/register gives page allocation failure: order:4, mode:0x2040d0 Marc MERLIN
2016-02-15 12:02 ` Johannes Thumshirn
2016-02-15 15:32   ` Marc MERLIN
2016-02-15 15:45     ` Christoph Nelles
2016-02-23 16:32       ` Marc MERLIN
2016-02-23 20:57         ` Marc MERLIN
2016-02-24 20:45       ` BUG: drivers/md/bcache/writeback.c:237 Marc MERLIN
2016-02-25  0:58         ` Eric Wheeler
2016-02-25  6:41           ` Eric Wheeler
2016-02-25  7:36             ` Eric Wheeler
2016-02-25 10:08               ` Zhu Yanhai
2016-02-26  2:38                 ` Eric Wheeler
2016-02-26  2:46                   ` Marc MERLIN
2016-02-26  3:19                     ` Marc MERLIN
2016-02-26  4:55                       ` Eric Wheeler
2016-02-26 16:27                         ` Marc MERLIN
2016-02-26 21:17                           ` Eric Wheeler
2016-03-03  4:17                             ` Eric Wheeler
2016-03-03  4:25                               ` Marc MERLIN
2016-02-25 10:18         ` Zhu Yanhai
2016-02-25 15:20           ` Marc MERLIN
2016-02-25 23:44             ` Eric Wheeler
2016-02-26  0:17               ` Marc MERLIN [this message]
2016-02-15 12:11 ` echo dev > /sys/fs/bcache/register gives page allocation failure: order:4, mode:0x2040d0 Kent Overstreet
2016-02-24  6:53 ` Eric Wheeler
2016-02-24 16:37   ` Disabling bcache from boot when it crashes? Marc MERLIN
2016-02-24 19:10     ` Eric Wheeler
2016-02-25  5:48       ` Marc MERLIN

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20160226001713.GC29543@merlins.org \
    --to=marc@merlins.org \
    --cc=bcache@lists.ewheeler.net \
    --cc=evilazrael@evilazrael.de \
    --cc=linux-bcache@vger.kernel.org \
    --cc=zhu.yanhai@gmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.