From: Mike Snitzer <snitzer@redhat.com>
To: Mikulas Patocka <mpatocka@redhat.com>
Cc: dm-devel@redhat.com, Alasdair G Kergon <agk@redhat.com>,
linux-fsdevel@vger.kernel.org, linux-ext4@vger.kernel.org,
axboe@kernel.dk, dmonakhov@openvz.org
Subject: Re: [PATCH 4/4] Support discard if at least one underlying device supports it
Date: Fri, 2 Jul 2010 16:47:09 -0400 [thread overview]
Message-ID: <20100702204709.GB21915@redhat.com> (raw)
In-Reply-To: <Pine.LNX.4.64.1007021554550.11102@hs20-bc2-1.build.redhat.com>
On Fri, Jul 02 2010 at 4:00pm -0400,
Mikulas Patocka <mpatocka@redhat.com> wrote:
>
>
> On Fri, 2 Jul 2010, Mikulas Patocka wrote:
>
> > > As we discussed, we have a challenge where we need DM to avoid issuing
> > > a barrier before the discard IFF a target doesn't support the discard
> > > (which the barrier is paired with).
> > >
> > > My understanding is that blkdev_issue_discard() only cares if the
> > > discard was supported. Barrier is used just to decorate the discard
> > > (for correctness). So by returning -EOPNOTSUPP we're saying the discard
> > > isn't supported; we're not making any claims about the implict barrier,
> > > so best to avoid the barrier entirely.
> > >
> > > Otherwise we'll be issuing unnecessary barriers (and associated
> > > performance loss).
> > >
> > > So yet another TODO item... Anyway:
> > >
> > > Acked-by: Mike Snitzer <snitzer@redhat.com>
> >
> > Unnecessary barriers are issued anyway. With each freed extent.
> >
> > The code must issue a "SYNCHRONIZE CACHE" to flush cache for previous
> > writes, then "UNMAP" and then another "SYNCHRONIZE CACHE" to commit that
> > unmap to disk. And this in loop for all extents in
> > "release_blocks_on_commit".
> >
> > One idea behind "discard barriers" was to submit a discard request and not
> > wait for it. Then the request would need a barrier so that it doesn't get
> > reordered with further writes (that may potentially write to the same area
> > as the discarded area). But discard isn't used this way anyway,
> > sb_issue_discard waits for completion, so the barrier isn't needed.
> >
> > Even if ext4 developers wanted asynchronous discard requests, they should
> > fire all the discards at once and then submit one zero-sized barrier. Not
> > barrier with each discard request.
> >
> > This is up to ext4 developers to optimize and remove the barriers and we
> > can't do anything with it. Just send "SYNCHRONIZE
> > CACHE"+"UNMAP"+"SYNCHRONIZE CACHE" like the barrier specification wants...
> >
> > Mikulas
>
> BTW. I understand that the current dm implementation will send two useless
> consecutive "SYNCHRONIZE CACHE" commands discard is directed to the part
> of the device that doesn't support it.
Issue 1 ^^^
> But the problem is that when you use discard on a part of the device that
> supports discard, it also sends two useless "SYNCHRONIZE CACHE" commands
> --- they are useless for functionality, but mandated by the barrier
> specification.
Issue 2 ^^^
Those are 2 different issues. Please don't join them as if they are one
in the same. DM should treat a discard as a first class request (which
may or may not have a barrier). If a region doesn't support the discard
DM has no business processing anything related to the discard (barriers
included). It is as simple as that.
> The fix is supposedly this:
>
> ---
> include/linux/blkdev.h | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> Index: linux-2.6.35-rc3-fast/include/linux/blkdev.h
> ===================================================================
> --- linux-2.6.35-rc3-fast.orig/include/linux/blkdev.h 2010-07-02 21:59:21.000000000 +0200
> +++ linux-2.6.35-rc3-fast/include/linux/blkdev.h 2010-07-02 21:59:37.000000000 +0200
> @@ -1021,7 +1021,7 @@ static inline int sb_issue_discard(struc
> block <<= (sb->s_blocksize_bits - 9);
> nr_blocks <<= (sb->s_blocksize_bits - 9);
> return blkdev_issue_discard(sb->s_bdev, block, nr_blocks, GFP_KERNEL,
> - BLKDEV_IFL_WAIT | BLKDEV_IFL_BARRIER);
> + BLKDEV_IFL_WAIT);
> }
>
> extern int blk_verify_command(unsigned char *cmd, fmode_t has_write_perm);
Hmm, older kernels use DISCARD_FL_BARRIER which merely mapped to
BLKDEV_IFL_BARRIER.
Seems you've stumbled onto a bug in the conversion that commit
"blkdev: generalize flags for blkdev_issue_fn functions"
(fbd9b09a177a481eda) performed?
That commit seems to have incorrectly replaced DISCARD_FL_BARRIER with
both: BLKDEV_IFL_WAIT | BLKDEV_IFL_BARRIER
Dmitry and/or Jens was this intended?
Mike
next prev parent reply other threads:[~2010-07-02 20:47 UTC|newest]
Thread overview: 26+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-07-02 15:12 [PATCH 0/4] discard updates Mikulas Patocka
2010-07-02 15:17 ` [PATCH 1/4] Check that the target supports discard Mikulas Patocka
2010-07-02 15:18 ` [PATCH 2/4] Clear the discard flag if the device loses discard capability Mikulas Patocka
2010-07-02 15:19 ` [PATCH 3/4] Support discard for multiple devices Mikulas Patocka
2010-07-02 15:19 ` [PATCH 4/4] Support discard if at least one underlying device supports it Mikulas Patocka
2010-07-02 18:14 ` Mike Snitzer
2010-07-02 19:49 ` Mikulas Patocka
2010-07-02 20:00 ` Mikulas Patocka
2010-07-02 20:08 ` GFP_KERNEL in ext4 (was: [PATCH 4/4] Support discard if at least one underlying device supports it) Mikulas Patocka
2010-07-06 16:11 ` [PATCH] disallow FS recursion from sb_issue_discard allocation Mike Snitzer
2010-07-27 13:44 ` Ted Ts'o
2010-07-27 15:33 ` Mike Snitzer
2010-07-28 18:12 ` Jens Axboe
2010-07-28 23:15 ` Ted Ts'o
2010-07-02 20:47 ` Mike Snitzer [this message]
2010-07-02 20:54 ` [PATCH 4/4] Support discard if at least one underlying device supports it Alasdair G Kergon
2010-07-05 7:03 ` Dmitry Monakhov
2010-07-05 11:32 ` Mikulas Patocka
2010-07-02 20:29 ` Mike Snitzer
2010-07-08 0:07 ` Alasdair G Kergon
2010-07-02 17:50 ` [PATCH 3/4] Support discard for multiple devices Mike Snitzer
2010-07-07 23:23 ` Alasdair G Kergon
2010-07-02 17:51 ` [PATCH 2/4] Clear the discard flag if the device loses discard capability Mike Snitzer
2010-07-07 23:18 ` Alasdair G Kergon
2010-07-02 18:05 ` [PATCH 1/4] Check that the target supports discard Mike Snitzer
2010-07-07 23:14 ` Alasdair G Kergon
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20100702204709.GB21915@redhat.com \
--to=snitzer@redhat.com \
--cc=agk@redhat.com \
--cc=axboe@kernel.dk \
--cc=dm-devel@redhat.com \
--cc=dmonakhov@openvz.org \
--cc=linux-ext4@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=mpatocka@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.