From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1JfHRB-0007pO-Qr for qemu-devel@nongnu.org; Fri, 28 Mar 2008 12:28:17 -0400 Received: from exim by lists.gnu.org with spam-scanned (Exim 4.43) id 1JfHRA-0007oj-Si for qemu-devel@nongnu.org; Fri, 28 Mar 2008 12:28:17 -0400 Received: from [199.232.76.173] (helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1JfHRA-0007od-Jz for qemu-devel@nongnu.org; Fri, 28 Mar 2008 12:28:16 -0400 Received: from kanga.kvack.org ([66.96.29.28]) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1JfHRA-0004K1-FS for qemu-devel@nongnu.org; Fri, 28 Mar 2008 12:28:16 -0400 Date: Fri, 28 Mar 2008 13:31:16 -0300 From: Marcelo Tosatti Subject: Re: [kvm-devel] [Qemu-devel] [PATCH] QEMU: fsync AIO writes on flush request Message-ID: <20080328163116.GA18853@dmt> References: <20080328150517.GA18077@dmt> <20080328150703.GA19624@shareable.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080328150703.GA19624@shareable.org> Reply-To: qemu-devel@nongnu.org List-Id: qemu-devel.nongnu.org List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Jamie Lokier Cc: kvm-devel , qemu-devel@nongnu.org On Fri, Mar 28, 2008 at 03:07:03PM +0000, Jamie Lokier wrote: > Marcelo Tosatti wrote: > > Its necessary to guarantee that pending AIO writes have reached stable > > storage when the flush request returns. > > > > Also change fsync() to fdatasync(), since the modification time is not > > critical data. > > + if (aio_fsync(O_DSYNC, &acb->aiocb) < 0) { > > > BDRVRawState *s = bs->opaque; > > - fsync(s->fd); > > + raw_aio_flush(bs); > > + fdatasync(s->fd); > > + > > + /* We rely on the fact that no other AIO will be submitted > > + * in parallel, but this should be fixed by per-device > > + * AIO queues when allowing multiple CPU's to process IO > > + * in QEMU. > > + */ > > + qemu_aio_flush(); > > I'm a bit confused by this. Why do you need aio_fsync(O_DSYNC) _and_ > synchronous fdatasync() calls? Aren't they equivalent? fdatasync() will write and wait for completion of dirty file data present in memory. aio_write() only queues data for submission: The "asynchronous" means that this call returns as soon as the request has been enqueued; the write may or may not have completed when the call returns. One tests for completion using aio_error(3). So fdatasync() is not enough because data written via AIO may not have been reflected as "dirty file data" through write() by the time raw_flush() is called. The SCSI and IDE drivers use flush() in response to a "flush cache" request, which is used by the guest OS to implement barriers, for example.