From mboxrd@z Thu Jan 1 00:00:00 1970 From: Mark Kirkwood Subject: Re: fio rbd hang for block sizes > 1M Date: Sat, 25 Oct 2014 11:45:17 +1300 Message-ID: <544AD67D.4030603@catalyst.net.nz> References: <5449BBB3.7090109@catalyst.net.nz> <5449E50E.7000808@kernel.dk> <5449EEF1.1060407@catalyst.net.nz> <544A51C7.40803@gmail.com> <544A5DA6.2010709@gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Return-path: In-Reply-To: <544A5DA6.2010709-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org> Sender: fio-owner-u79uwXL29TY76Z2rM5mHXA@public.gmane.org To: Mark Nelson , Jens Axboe , fio-u79uwXL29TY76Z2rM5mHXA@public.gmane.org Cc: "d.gollub-+tb+GG71Y8CELgA04lAiVw@public.gmane.org >> Daniel Gollub" , "xan.peng" , "ceph-devel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org" List-Id: ceph-devel.vger.kernel.org Interestingly, I first encountered this on (what I think is) 0.86 release (0.86-1precise). I wonder if you had a bigger rbd cache on the release cluster you tested? As mentioned in the same named thread on -users, disabling the rbd cache stops the hang. Regards Mark On 25/10/14 03:09, Mark Nelson wrote: > More info: > > I went back and tested fio versions back to 2.1.10 and still encountered > the issue. I then went back and tested the v0.86 release versus giant > and was able to get through a 4MB read test without error. I suspect > this is not an fio problem. I'll try to narrow down the commit after > 0.86 that is causing this. > > Mark > > On 10/24/2014 08:19 AM, Mark Nelson wrote: >> FWIW we are seeing this at Redhat/Inktank with recent fio from master >> and ceph giant branch as well. >> >> Mark >> >> On 10/24/2014 01:17 AM, Mark Kirkwood wrote: >>> On 24/10/14 18:35, Jens Axboe wrote: >>>> CC'ing relevant parties, leaving email intact. >>>> >>> >>> Note that the 'Killed' is because I killed the run - it hangs and >>> appears to be non interruptable. I missed that when pasting, sorry! >>> >>>>> $ fio read-test.fio # attached >>>>> rbd_thread: (g=0): rw=read, bs=2M-2M/2M-2M/2M-2M, ioengine=rbd, >>>>> iodepth=32 >>>>> fio-2.1.13-88-gb2ee7 >>>>> Starting 1 process >>>>> rbd engine: RBD version: 0.1.8 >>>>> Killed1 (f=1): [R(1)] [inf% done] [0KB/0KB/0KB /s] [0/0/0 iops] [eta >>>>> 1158050441d:06h:59m:33s] >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe fio" in >>> the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> > From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Message-ID: <544AD67D.4030603@catalyst.net.nz> Date: Sat, 25 Oct 2014 11:45:17 +1300 From: Mark Kirkwood MIME-Version: 1.0 Subject: Re: fio rbd hang for block sizes > 1M References: <5449BBB3.7090109@catalyst.net.nz> <5449E50E.7000808@kernel.dk> <5449EEF1.1060407@catalyst.net.nz> <544A51C7.40803@gmail.com> <544A5DA6.2010709@gmail.com> In-Reply-To: <544A5DA6.2010709@gmail.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit To: Mark Nelson , Jens Axboe , fio@vger.kernel.org Cc: "d.gollub@telekom.de >> Daniel Gollub" , "xan.peng" , "ceph-devel@vger.kernel.org" List-ID: Interestingly, I first encountered this on (what I think is) 0.86 release (0.86-1precise). I wonder if you had a bigger rbd cache on the release cluster you tested? As mentioned in the same named thread on -users, disabling the rbd cache stops the hang. Regards Mark On 25/10/14 03:09, Mark Nelson wrote: > More info: > > I went back and tested fio versions back to 2.1.10 and still encountered > the issue. I then went back and tested the v0.86 release versus giant > and was able to get through a 4MB read test without error. I suspect > this is not an fio problem. I'll try to narrow down the commit after > 0.86 that is causing this. > > Mark > > On 10/24/2014 08:19 AM, Mark Nelson wrote: >> FWIW we are seeing this at Redhat/Inktank with recent fio from master >> and ceph giant branch as well. >> >> Mark >> >> On 10/24/2014 01:17 AM, Mark Kirkwood wrote: >>> On 24/10/14 18:35, Jens Axboe wrote: >>>> CC'ing relevant parties, leaving email intact. >>>> >>> >>> Note that the 'Killed' is because I killed the run - it hangs and >>> appears to be non interruptable. I missed that when pasting, sorry! >>> >>>>> $ fio read-test.fio # attached >>>>> rbd_thread: (g=0): rw=read, bs=2M-2M/2M-2M/2M-2M, ioengine=rbd, >>>>> iodepth=32 >>>>> fio-2.1.13-88-gb2ee7 >>>>> Starting 1 process >>>>> rbd engine: RBD version: 0.1.8 >>>>> Killed1 (f=1): [R(1)] [inf% done] [0KB/0KB/0KB /s] [0/0/0 iops] [eta >>>>> 1158050441d:06h:59m:33s] >>> >>> -- >>> To unsubscribe from this list: send the line "unsubscribe fio" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> >