From: Jens Axboe <axboe@suse.de>
To: Kai Makisara <Kai.Makisara@kolumbus.fi>
Cc: James Bottomley <James.Bottomley@SteelEye.com>,
Saeed Bishara <Saeed.Bishara@il.marvell.com>,
SCSI Mailing List <linux-scsi@vger.kernel.org>
Subject: Re: IO buffer alignment for block devices
Date: Mon, 13 Sep 2004 22:23:48 +0200 [thread overview]
Message-ID: <20040913202348.GH18883@suse.de> (raw)
In-Reply-To: <Pine.LNX.4.58.0409132217370.1972@kai.makisara.local>
On Mon, Sep 13 2004, Kai Makisara wrote:
> On Mon, 13 Sep 2004, Jens Axboe wrote:
>
> > On Mon, Sep 13 2004, Kai Makisara wrote:
> ...
> > > Any driver can change this restriction in slave_configure(). I have for
> > > some time had the following patch in my system to change the alignment for
> > > my tape drive to 8 bytes:
> >
> > I'm not sure 8 is a safe alignment at all, at least for ide-cd there
> > were cases that simply didn't work well with 8-byte alignment.
> >
> I agree. That is why I have not objected to the patch that changed the
> default back to 512. That is why I reviewed sym53c8xx_2 and, to me, it
> seemed safe to to try 8 byte alignment with this driver. The proper way is
> to have safe default and let the LLDDs relax the alignment. The only
> practical problem is how to get this done but we probably have to live
> with this. Besides we were talking about SCSI.
This seemed to be a device problem, not an adapter problem.
I'm curious where the alignment is really documented, seems to be we are
stabbing in the dark.
> > > --- linux-2.6.9-rc1-bk1/drivers/scsi/sym53c8xx_2/sym_glue.c~ 2004-08-27 19:33:34.000000000 +0300
> > > +++ linux-2.6.9-rc1-bk1/drivers/scsi/sym53c8xx_2/sym_glue.c 2004-08-28 12:54:49.000000000 +0300
> > > @@ -1097,6 +1097,8 @@
> > >
> > > spi_dv_device(device);
> > >
> > > + blk_queue_dma_alignment(device->request_queue, 7);
> > > +
> > > return 0;
> > > }
> >
> > Why don't you just do this in st? Not that it matters, since it's just
> > your private hack - just wondering.
> >
> Because the high-level driver should not guess what the low-level driver
> requirements are.
>
> > And finally (and most importantly), why do you care so much?
>
> I do care because the non-bounced transfers give measurable decrease in
> cpu time and bus use. And I don't like the situation where portable
> software has to have special alignment code just for Linux when there is
> no real technical reason for that. In practice, this code often does not
> exist. Then the question is, why does writing to tape in Linux use, say,
> 30 % CPU time when the same thing in xxx Unix uses 1 % CPU time.
Please share how you arrived at those numbers, if you get that bad
performance you are doing something horribly wrong elsewhere. Doing
50MiB/sec single transfer from a hard drive with either bio_map_user()
or bio_copy_user() backing shows at most 1% sys time difference on a
dual P3-800MHz. How fast are these tape drives?
There's no questioning that bio_map_user() will be faster for larger
transfers (4kb and up, I'm guessing), but there's no way that you can
claim 30% sys time for a tape drive without backing that up. Where was
this time spent?
> > rip some of that manual data mapping out of st completely. Add a
> > scsi_rq_map_user or something that returns a struct scsi_request, and
> > uses blk_rq_map_user (which will do bio_map_user() or bio_copy_user())
> > to map the user data into your SCSI request. I seriously doubt that st
> > will see any performance difference between the two at all, and st
> > really should be using proper infrastructure for this. sg as well, but
> > since it's dying anyways...
> >
> The manual data mapping exists in st and sg because when the code was
> made, nothing better was available. It was meant to be a proof of concept
> and it still is. However, it has at least one good property: it works.
Nothing as permanent as temporary driver code :)
> I don't think you can get around the alignment by using bio_map_user(): it
> tests alignment for both the buffer start and the transfer length. So,
> converting the data mapping does not change the alignment handling which
> is what we were talking about.
Of course not, bio_map_user() data must be aligned, that's the whole
premise of if. But bio_copy_data() will align the data for you. Lookup
blk_rq_map_user() like I described.
> I have sometimes looked at the bio code and thought about changing to use
> it. The problem has been that it is too limited for use in st. It has
> become better but it still is not good enough to replace both the data
> mapping and the driver's buffering. A quick look through the code suggests
> that it still can't handle big requests (up to 6 MB). Please tell me if I
> am wrong.
I don't think you should. As I wrote, write a scsi mid layer helper that
will map the data for you into a scsi structure. I think it's a mistake
that st/sg attempts to handle this business themselves.
You can't map 6MB into a single bio, but you can string them together if
you want such a big request.
--
Jens Axboe
next prev parent reply other threads:[~2004-09-13 20:24 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2004-09-13 12:48 IO buffer alignment for block devices Saeed Bishara
2004-09-13 14:14 ` James Bottomley
2004-09-13 15:16 ` Saeed Bishara
2004-09-13 15:28 ` James Bottomley
2004-09-13 17:12 ` Kai Makisara
2004-09-13 19:02 ` Jens Axboe
2004-09-13 19:56 ` Kai Makisara
2004-09-13 20:23 ` Jens Axboe [this message]
2004-09-13 22:08 ` Kai Makisara
2004-09-14 7:44 ` Jens Axboe
2004-09-14 17:39 ` Kai Makisara
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20040913202348.GH18883@suse.de \
--to=axboe@suse.de \
--cc=James.Bottomley@SteelEye.com \
--cc=Kai.Makisara@kolumbus.fi \
--cc=Saeed.Bishara@il.marvell.com \
--cc=linux-scsi@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox