All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Kani, Toshimitsu" <toshi.kani@hpe.com>
To: "dan.j.williams@intel.com" <dan.j.williams@intel.com>,
	"vishal.l.verma@intel.com" <vishal.l.verma@intel.com>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"linux-nvdimm@lists.01.org" <linux-nvdimm@lists.01.org>
Subject: Re: [PATCH] libnvdimm: rework region badblocks clearing
Date: Mon, 1 May 2017 16:45:37 +0000	[thread overview]
Message-ID: <1493657135.30303.22.camel@hpe.com> (raw)
In-Reply-To: <1493656965.4424.0.camel@intel.com>

On Mon, 2017-05-01 at 16:42 +0000, Verma, Vishal L wrote:
> On Mon, 2017-05-01 at 09:38 -0700, Dan Williams wrote:
> > On Mon, May 1, 2017 at 9:20 AM, Kani, Toshimitsu
> > <toshi.kani@hpe.com>
 :
> > > > > > > > Hi Dan,
> > > > > > > > 
> > > > > > > > I was testing the change with CONFIG_DEBUG_ATOMIC_SLEEP
> > > > > > > > set this time, and hit the following BUG with
> > > > > > > > BTT.  This is a separate issue (not introduced by this
> > > > > > > > patch), but it shows that we have an issue with the DSM
> > > > > > > > call path as well.
> > > > > > > 
> > > > > > > Ah, great find, thanks! We don't see this in the unit
> > > > > > > tests because the nfit_test infrastructure takes no
> > > > > > > sleeping actions in its simulated DSM path. Outside of
> > > > > > > converting btt to use sleeping locks I'm not sure I see a
> > > > > > > path forward. I wonder how bad the performance impact of
> > > > > > > that would be? Perhaps with opportunistic spinning it
> > > > > > > won't be so bad, but I don't see another choice.
> > > > > > 
> > > > > > It's worse than that. Part of the performance optimization
> > > > > > of BTT I/O was to avoid locking altogether when we could
> > > > > > rely on a BTT lane percpu, so that would also need to be
> > > > > > removed.
> > > > > 
> > > > > I do not have a good idea either, but I'd rather disable this
> > > > > clearing in the regular BTT write path than adding sleeping
> > > > > locks to BTT. Clearing a bad block in the BTT write path is
> > > > > difficult/challenging since it allocates a new block.
> > > > 
> > > > Actually, that may make things easier. Can we teach BTT to
> > > > track error blocks and clear them before they are reassigned?
> > > 
> > > I was thinking the same after sending it.  I think we should be
> > > able to do that.
> > 
> > Ok, but we obviously can't develop something that detailed while
> > the merge window is open, so I think that means we need to revert
> > commit e88da7998d7d "Revert 'libnvdimm: band aid btt vs clear
> > poison locking'" and leave BTT I/O-error-clearing disabled for this
> > cycle and try again for 4.13.
> 
> Agreed, I'll work on something to track badblocks and clear them
> outside the IO path.

Great!  Thanks Vishal!
-Toshi
_______________________________________________
Linux-nvdimm mailing list
Linux-nvdimm@lists.01.org
https://lists.01.org/mailman/listinfo/linux-nvdimm

WARNING: multiple messages have this Message-ID (diff)
From: "Kani, Toshimitsu" <toshi.kani@hpe.com>
To: "dan.j.williams@intel.com" <dan.j.williams@intel.com>,
	"vishal.l.verma@intel.com" <vishal.l.verma@intel.com>
Cc: "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
	"linux-nvdimm@lists.01.org" <linux-nvdimm@ml01.01.org>,
	"dave.jiang@intel.com" <dave.jiang@intel.com>
Subject: Re: [PATCH] libnvdimm: rework region badblocks clearing
Date: Mon, 1 May 2017 16:45:37 +0000	[thread overview]
Message-ID: <1493657135.30303.22.camel@hpe.com> (raw)
In-Reply-To: <1493656965.4424.0.camel@intel.com>

On Mon, 2017-05-01 at 16:42 +0000, Verma, Vishal L wrote:
> On Mon, 2017-05-01 at 09:38 -0700, Dan Williams wrote:
> > On Mon, May 1, 2017 at 9:20 AM, Kani, Toshimitsu
> > <toshi.kani@hpe.com>
 :
> > > > > > > > Hi Dan,
> > > > > > > > 
> > > > > > > > I was testing the change with CONFIG_DEBUG_ATOMIC_SLEEP
> > > > > > > > set this time, and hit the following BUG with
> > > > > > > > BTT.  This is a separate issue (not introduced by this
> > > > > > > > patch), but it shows that we have an issue with the DSM
> > > > > > > > call path as well.
> > > > > > > 
> > > > > > > Ah, great find, thanks! We don't see this in the unit
> > > > > > > tests because the nfit_test infrastructure takes no
> > > > > > > sleeping actions in its simulated DSM path. Outside of
> > > > > > > converting btt to use sleeping locks I'm not sure I see a
> > > > > > > path forward. I wonder how bad the performance impact of
> > > > > > > that would be? Perhaps with opportunistic spinning it
> > > > > > > won't be so bad, but I don't see another choice.
> > > > > > 
> > > > > > It's worse than that. Part of the performance optimization
> > > > > > of BTT I/O was to avoid locking altogether when we could
> > > > > > rely on a BTT lane percpu, so that would also need to be
> > > > > > removed.
> > > > > 
> > > > > I do not have a good idea either, but I'd rather disable this
> > > > > clearing in the regular BTT write path than adding sleeping
> > > > > locks to BTT. Clearing a bad block in the BTT write path is
> > > > > difficult/challenging since it allocates a new block.
> > > > 
> > > > Actually, that may make things easier. Can we teach BTT to
> > > > track error blocks and clear them before they are reassigned?
> > > 
> > > I was thinking the same after sending it.  I think we should be
> > > able to do that.
> > 
> > Ok, but we obviously can't develop something that detailed while
> > the merge window is open, so I think that means we need to revert
> > commit e88da7998d7d "Revert 'libnvdimm: band aid btt vs clear
> > poison locking'" and leave BTT I/O-error-clearing disabled for this
> > cycle and try again for 4.13.
> 
> Agreed, I'll work on something to track badblocks and clear them
> outside the IO path.

Great!  Thanks Vishal!
-Toshi

  reply	other threads:[~2017-05-01 16:45 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-04-30 12:39 [PATCH] libnvdimm: rework region badblocks clearing Dan Williams
2017-04-30 12:39 ` Dan Williams
2017-05-01 15:34 ` Kani, Toshimitsu
2017-05-01 15:34   ` Kani, Toshimitsu
2017-05-01 15:43   ` Dan Williams
2017-05-01 15:43     ` Dan Williams
2017-05-01 15:52     ` Dan Williams
2017-05-01 15:52       ` Dan Williams
2017-05-01 16:12       ` Kani, Toshimitsu
2017-05-01 16:12         ` Kani, Toshimitsu
2017-05-01 16:16         ` Dan Williams
2017-05-01 16:16           ` Dan Williams
2017-05-01 16:20           ` Kani, Toshimitsu
2017-05-01 16:20             ` Kani, Toshimitsu
2017-05-01 16:38             ` Dan Williams
2017-05-01 16:38               ` Dan Williams
2017-05-01 16:42               ` Verma, Vishal L
2017-05-01 16:42                 ` Verma, Vishal L
2017-05-01 16:45                 ` Kani, Toshimitsu [this message]
2017-05-01 16:45                   ` Kani, Toshimitsu
2017-05-01 21:26 ` Kani, Toshimitsu
2017-05-01 21:26   ` Kani, Toshimitsu
2017-05-01 23:09   ` Dan Williams
2017-05-01 23:09     ` Dan Williams

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1493657135.30303.22.camel@hpe.com \
    --to=toshi.kani@hpe.com \
    --cc=dan.j.williams@intel.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-nvdimm@lists.01.org \
    --cc=vishal.l.verma@intel.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.