From: James Bottomley <James.Bottomley@HansenPartnership.com>
To: Jens Axboe <jens.axboe@oracle.com>
Cc: David Woodhouse <dwmw2@infradead.org>,
Matthew Wilcox <matthew@wil.cx>, Tejun Heo <teheo@suse.de>,
Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
Nick Piggin <npiggin@suse.de>,
IDE/ATA development list <linux-ide@vger.kernel.org>,
Jeff Garzik <jeff@garzik.org>, Dongjun Shin <djshin90@gmail.com>,
chris.mason@oracle.com
Subject: Re: about TRIM/DISCARD support and barriers
Date: Mon, 24 Nov 2008 14:08:23 -0500 [thread overview]
Message-ID: <1227553703.25499.40.camel@localhost.localdomain> (raw)
In-Reply-To: <20081124185747.GN26308@kernel.dk>
On Mon, 2008-11-24 at 19:57 +0100, Jens Axboe wrote:
> On Mon, Nov 24 2008, David Woodhouse wrote:
> > On Mon, 2008-11-24 at 13:42 -0500, James Bottomley wrote:
> > > On Mon, 2008-11-24 at 09:03 +0000, David Woodhouse wrote:
> > > > On Mon, 2008-11-24 at 07:52 +0900, James Bottomley wrote:
> > > > > On Sun, 2008-11-23 at 13:39 +0000, David Woodhouse wrote:
> > > > > > > We don't attempt to put non-contiguous ranges into a single TRIM yet.
> > > > > >
> > > > > > We don't even merge contiguous ranges -- I still need to fix the
> > > > > > elevators to stop writes crossing writes,
> > > > >
> > > > > I don't think we want to do that ... it's legal if the write isn't a
> > > > > barrier and it will inhibit merging. That may be just fine for a SSD,
> > > > > but it's not for spinning media since they get better performance out of
> > > > > merged writes.
> > > >
> > > > No, I just mean writes _to the same sector_. At the moment, we happily
> > > > let those cross each other in the queue.
> > ...
> > > It's not a bug ... but changing it might be feasible ... as long as it
> > > doesn't affect write performance too much (which I don't think it will),
> > > since it is in the critical path.
> >
> > We could argue about how much sense it makes to let two writes to the
> > same sector actually happen in reverse order.
> >
> > Especially given the fact that we actually _do_ preserve ordering in
> > some cases; just not in others. (We preserve ordering only if the start
> > and end of the duplicate writes are _precisely_ matching; if it's just
> > overlapping (which may well happen in the presence of merges), then this
> > check doesn't trigger.
> >
> > But that's just semantics. Yes, changing it should be feasible. I talked
> > to Jens about that at the kernel summit, and we agreed that it should
> > probably be done.
>
> The way this currently works is that we sort based on the first sector
> in the request. So if you have have an overlap condition between rq1 and
> rq2 and then a write gets merged into rq1, then you may have passing
> writes. Linux has never guarenteed any write ordering for non-barriers,
> so we've never attempted to handle it. Direct aliases (matching first
> sectors) are handled as you mention, but that's more of an algorithmic
> thing than by design.
>
> My main worry is that this will add considerable overhead to request
> sorting. For the rbtree based sorting, we'd have to do a rb_next/rb_prev
> to look at adjacent requests. For CFQ it's even worse, since there's no
> per-queue big rbtree for sorting.
Which is why I suggest special casing: Only invoke the expensive
overlap checking if one of the requests is a discard. Otherwise use the
standard path for writes.
James
next prev parent reply other threads:[~2008-11-24 19:08 UTC|newest]
Thread overview: 24+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-11-23 4:46 about TRIM/DISCARD support and barriers Tejun Heo
2008-11-23 4:46 ` Tejun Heo
2008-11-23 7:11 ` Tejun Heo
2008-11-23 7:11 ` Tejun Heo
2008-11-23 7:57 ` Tejun Heo
2008-11-23 7:57 ` Tejun Heo
2008-11-24 5:40 ` Dongjun Shin
2008-11-24 5:45 ` Tejun Heo
2008-11-24 5:57 ` James Bottomley
2008-11-23 12:35 ` Matthew Wilcox
2008-11-23 13:39 ` David Woodhouse
2008-11-23 22:52 ` James Bottomley
2008-11-24 9:03 ` David Woodhouse
2008-11-24 18:42 ` James Bottomley
2008-11-24 18:52 ` David Woodhouse
2008-11-24 18:57 ` Jens Axboe
2008-11-24 19:08 ` James Bottomley [this message]
2008-11-25 9:16 ` Jens Axboe
2008-11-24 19:09 ` James Bottomley
2008-11-25 3:28 ` Matthew Wilcox
2008-11-25 9:15 ` Jens Axboe
2008-11-24 3:01 ` Theodore Tso
2008-11-28 13:21 ` Raz Ben-Yehuda
2008-11-29 22:57 ` Tejun Heo
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1227553703.25499.40.camel@localhost.localdomain \
--to=james.bottomley@hansenpartnership.com \
--cc=chris.mason@oracle.com \
--cc=djshin90@gmail.com \
--cc=dwmw2@infradead.org \
--cc=jeff@garzik.org \
--cc=jens.axboe@oracle.com \
--cc=linux-ide@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=matthew@wil.cx \
--cc=npiggin@suse.de \
--cc=teheo@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.