* IO merge in engine
@ 2015-10-02 14:04 Fabrice Bacchella
2015-10-02 15:59 ` Jens Axboe
0 siblings, 1 reply; 3+ messages in thread
From: Fabrice Bacchella @ 2015-10-02 14:04 UTC (permalink / raw)
To: fio
When writing my new hdfs engine, I met a problem with IO merge.
If I submit IO that are bigger that what the hdfs can manage, I return them to fio as incomplete IO, the net engine works the same way :
in engines/net.c, line 668+
io_u->resid = io_u->xfer_buflen - ret;
io_u->error = 0;
return FIO_Q_COMPLETED;
But the fio count them as two or more fast IO. I'm not sure it's a good measurement because if I'm simulating an application with fio, I expect to get the full IO latency and operations/s count. I don't mesure to mesure sub-io for that. For example, reducing the maximum transfer size will increase IO/s and reduce latency, even if the simulated application really sees reduced performance, because more of it's high level IO operation generated more real IO.
Is there a way to prevent that in fio, or is that up to my engine to manage that and merge IO ?
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: IO merge in engine
2015-10-02 14:04 IO merge in engine Fabrice Bacchella
@ 2015-10-02 15:59 ` Jens Axboe
2015-10-06 15:36 ` Fabrice Bacchella
0 siblings, 1 reply; 3+ messages in thread
From: Jens Axboe @ 2015-10-02 15:59 UTC (permalink / raw)
To: Fabrice Bacchella, fio
On 10/02/2015 04:04 PM, Fabrice Bacchella wrote:
> When writing my new hdfs engine, I met a problem with IO merge.
>
> If I submit IO that are bigger that what the hdfs can manage, I return them to fio as incomplete IO, the net engine works the same way :
> in engines/net.c, line 668+
> io_u->resid = io_u->xfer_buflen - ret;
> io_u->error = 0;
> return FIO_Q_COMPLETED;
>
> But the fio count them as two or more fast IO. I'm not sure it's a good measurement because if I'm simulating an application with fio, I expect to get the full IO latency and operations/s count. I don't mesure to mesure sub-io for that. For example, reducing the maximum transfer size will increase IO/s and reduce latency, even if the simulated application really sees reduced performance, because more of it's high level IO operation generated more real IO.
>
> Is there a way to prevent that in fio, or is that up to my engine to manage that and merge IO ?--
It sounds like a bug in the fio accounting, for the case of short
reads/writes. That doesn't happen very often elsewhere, so not
unreasonable to expect that is the case. Feel free to poke around and
figure it out. Let me know if that doesn't work out, and I'll take a
stab at fixing it up.
--
Jens Axboe
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: IO merge in engine
2015-10-02 15:59 ` Jens Axboe
@ 2015-10-06 15:36 ` Fabrice Bacchella
0 siblings, 0 replies; 3+ messages in thread
From: Fabrice Bacchella @ 2015-10-06 15:36 UTC (permalink / raw)
To: Jens Axboe; +Cc: fio
> Le 2 oct. 2015 à 17:59, Jens Axboe <axboe@kernel.dk> a écrit :
>
> On 10/02/2015 04:04 PM, Fabrice Bacchella wrote:
>> When writing my new hdfs engine, I met a problem with IO merge.
>>
>>
>> Is there a way to prevent that in fio, or is that up to my engine to manage that and merge IO ?--
>
> It sounds like a bug in the fio accounting, for the case of short reads/writes. That doesn't happen very often elsewhere, so not unreasonable to expect that is the case. Feel free to poke around and figure it out. Let me know if that doesn't work out, and I'll take a stab at fixing it up.
>
I'm using this simple fio script:
[global]
size=1m
ioengine=net
hostname=localhost
port=8765
filename=localhost,8765,tcp
[job1]
rw=write
bs=<something>
numjobs=1
And launching a fio listener with :
./fio --server
With ./fio --debug=io,file sample.fio and bs=654820, I got:
io 15106 ->prep(0x779f40)=0
io 15106 queue: io_u 0x779f40: off=0/len=654820/ddir=1/localhost,8765,tcp
io 15106 io complete: io_u 0x779f40: off=0/len=654820/ddir=1/localhost,8765,tcp
...
io 15705 fill_io_u: io_u 0x1926f40: off=654820/len=654820/ddir=1/localhost,8765,tcp
io 15705 prep: io_u 0x1926f40: off=654820/len=654820/ddir=1/localhost,8765,tcp
issued : total=r=0/w=2/d=0, short=r=0/w=0/d=0, drop=r=0/w=0/d=0
It fails for the second IO with
job1: (groupid=0, jobs=1): err=104 (file:engines/net.c:684, func=xfer, error=Connection reset by peer): pid=15106: Tue Oct 6 16:11:59 2015,
but that's not the problem. It does 2 654820 bytes IO, as requested
With bs=654821 and adding strace -f -e trace=sendto , I got:
io 15157 prep: io_u 0x1660f40: off=0/len=654821/ddir=1/localhost,8765,tcp
io 15157 ->prep(0x1660f40)=0
io 15157 queue: io_u 0x1660f40: off=0/len=654821/ddir=1/localhost,8765,tcp
[pid 15157] sendto(3, "\220\240@\6\371\341\277>\0\0\0\0\0\0\0\0\202\2\7\320\343\6H\26P\340\277\370\330\30e\30"..., 654821, MSG_MORE, NULL, 0) = 654820
io 15157 requeue 0x1660f40
io 15157 io_u_queued_completed: min=1
io 15157 getevents: 0
io 15157 prep: io_u 0x1660f40: off=654820/len=654821/ddir=1/localhost,8765,tcp
io 15157 ->prep(0x1660f40)=0
io 15157 queue: io_u 0x1660f40: off=654820/len=654821/ddir=1/localhost,8765,tcp
and it still failling, fio is doing another IO, at the good offset, but wrong length, I think it should now try to read 1 byte. It try a second IO but not to finish the first one.
With bs=1m, I got:
io 15205 prep: io_u 0x862f40: off=0/len=1048576/ddir=1/localhost,8765,tcp
io 15205 ->prep(0x862f40)=0
io 15205 queue: io_u 0x862f40: off=0/len=1048576/ddir=1/localhost,8765,tcp
[pid 15205] sendto(3, "\220\240@\6\371\341\277>\22\24\200\320\36y\313\26\202\2\7\320\343\6H\26P\340\277\370\330\30e\30"..., 1048576, 0, NULL, 0) = 654820
io 15205 requeue 0x862f40
io 15205 io_u_queued_completed: min=1
issued : total=r=0/w=1/d=0, short=r=0/w=1/d=0, drop=r=0/w=0/d=0
So only 654820 bytes are read, the IO is requeued, but not send.
For sequential IO, the result is not totally wrong and the net engine won't do random IO :
fio: network IO can't be random
But with my libhdfs engine, I'm getting wrong results, because it can do random IO and segmented random IO are very different than segmented serial IO. Other engine like rbd might have the same problem.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2015-10-06 15:36 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2015-10-02 14:04 IO merge in engine Fabrice Bacchella
2015-10-02 15:59 ` Jens Axboe
2015-10-06 15:36 ` Fabrice Bacchella
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.