* optimal value for blksize_size
@ 2002-11-27 23:24 Alex Ryan
2002-11-28 10:21 ` Jens Axboe
0 siblings, 1 reply; 2+ messages in thread
From: Alex Ryan @ 2002-11-27 23:24 UTC (permalink / raw)
To: linux-kernel
Hello,
I am writing a Linux block driver for our RAID firmware, and I am
very confused about blksize_size.
The documentation simply says that blksize_size should be the size
of the block used by the device in bytes.
Now, for my device(hard disk), the only restriction is that calls
must be a multiple of 512 bytes(1 sector).
I thought the natural choice for blksize_size would be 512, but I
saw that if I make it as 512 then the upper layer breaks up all
calls into buffer heads , each of size 512.
I think that is bad for sequential performance, even though my
device has scatter gather capability.
And if I make blksize_size of a higher value(e.g 4K), then the
upper layer gives calls of 4k size even for 512 byte reads.
Making blksize_size greater than PAGE_SIZE results in kernel
panic.
I am really very confused about what blksize_size really means,
and what should be an optimum value to put in there.
One more question about clustering:
All IO requests for consecutive sectors are clustered in the same
request structure, this much I understand. My question is, does
the b_data field of the corresponding bufferheads are also
sequential in the physical memory? In other words, can I satisfy a
request if I simply transfer req->nr_sectors amount of data to
req->buffer?
-Alex
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: optimal value for blksize_size
2002-11-27 23:24 optimal value for blksize_size Alex Ryan
@ 2002-11-28 10:21 ` Jens Axboe
0 siblings, 0 replies; 2+ messages in thread
From: Jens Axboe @ 2002-11-28 10:21 UTC (permalink / raw)
To: Alex Ryan; +Cc: linux-kernel
On Wed, Nov 27 2002, Alex Ryan wrote:
> Hello,
>
> I am writing a Linux block driver for our RAID firmware, and I am
> very confused about blksize_size.
>
> The documentation simply says that blksize_size should be the size
> of the block used by the device in bytes.
> Now, for my device(hard disk), the only restriction is that calls
> must be a multiple of 512 bytes(1 sector).
>
> I thought the natural choice for blksize_size would be 512, but I
> saw that if I make it as 512 then the upper layer breaks up all
> calls into buffer heads , each of size 512.
> I think that is bad for sequential performance, even though my
> device has scatter gather capability.
>
> And if I make blksize_size of a higher value(e.g 4K), then the
> upper layer gives calls of 4k size even for 512 byte reads.
>
> Making blksize_size greater than PAGE_SIZE results in kernel
> panic.
>
> I am really very confused about what blksize_size really means,
> and what should be an optimum value to put in there.
You should not put anything there, just leave room for someone else to
fill it in. It's the soft block size, and file systems typically set it
at mount time.
If 512b is the minimum request size you can do, you need to set
hardsect_size to that and trust that you wont get requests below that
size.
That's all you need to worry about.
> One more question about clustering:
> All IO requests for consecutive sectors are clustered in the same
> request structure, this much I understand. My question is, does
> the b_data field of the corresponding bufferheads are also
> sequential in the physical memory? In other words, can I satisfy a
> request if I simply transfer req->nr_sectors amount of data to
> req->buffer?
No this is very wrong. First of all, if you are setting up sg tables for
a request you never ever want to look at rq->buffer. Ever. You need to
loop through all the buffer heads attached to the request and setup an
sg entry for each of them.
A clustered request just means that it is contig on disk, not in memory.
Of course it can happen that some buffer_heads are also contig in memory
and you can coalesc these segments into one sg entry, but that all
depends on your hardware capabilities and you need to detect this
yourself.
BTW, I'm assuming a 2.4 kernel. In 2.5 this is all automated for you.
--
Jens Axboe
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2002-11-28 10:14 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-11-27 23:24 optimal value for blksize_size Alex Ryan
2002-11-28 10:21 ` Jens Axboe
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.