* Re: [Qemu-devel] [Nbd] NBD_CMD_DISC [not found] ` <56CF18E3-554D-4CF2-9DC3-6831D739A846@alex.org.uk> @ 2016-04-09 23:23 ` Eric Blake 0 siblings, 0 replies; 3+ messages in thread From: Eric Blake @ 2016-04-09 23:23 UTC (permalink / raw) To: Alex Bligh Cc: Wouter Verhelst, nbd-general@lists.sourceforge.net, qemu-devel@nongnu.org, Daniel P. Berrange [-- Attachment #1: Type: text/plain, Size: 901 bytes --] [adding qemu list and Dan into the mix] On 04/09/2016 05:02 PM, Alex Bligh wrote: > > On 9 Apr 2016, at 22:12, Eric Blake <eblake@redhat.com> wrote: > >>> How would the client know that? I'm using Go's TLS library, and there is >>> no way (as far as I can tell) to ensure that. >> >> Likewise - if it's qemu's fault for not flushing the outgoing queue, >> then what's the right way to get that NBD_CMD_DISC flushed? > > You use GnuTLS. Having just (tonight) written something with > GnuTLS, I note that > gnutls_bye > is not being called (in qemu) before the connection is closed > (indeed it's not being called anywhere in qemu). Fixing that > might help. Dan, sounds like your QIOChannel code should start using gnutls_bye(), as something to get into 2.6... -- Eric Blake eblake redhat com +1-919-301-3266 Libvirt virtualization library http://libvirt.org [-- Attachment #2: OpenPGP digital signature --] [-- Type: application/pgp-signature, Size: 604 bytes --] ^ permalink raw reply [flat|nested] 3+ messages in thread
[parent not found: <20160409102833.GL19023@grep.be>]
[parent not found: <FDFBD171-A214-4780-B3E8-7E25FB41FACB@alex.org.uk>]
[parent not found: <20160409113225.GR19023@grep.be>]
[parent not found: <F405A775-6A76-4776-9182-2431293A3A60@alex.org.uk>]
[parent not found: <20160410082710.GA28660@grep.be>]
[parent not found: <91E6DE09-E8F3-489E-B1F2-B3ECC358BB75@alex.org.uk>]
[parent not found: <20160412094820.GA8122@redhat.com>]
* Re: [Qemu-devel] [Nbd] NBD_CMD_DISC [not found] ` <20160412094820.GA8122@redhat.com> @ 2016-04-12 10:34 ` Daniel P. Berrange 2016-04-12 11:14 ` Alex Bligh 0 siblings, 1 reply; 3+ messages in thread From: Daniel P. Berrange @ 2016-04-12 10:34 UTC (permalink / raw) To: Alex Bligh; +Cc: Wouter Verhelst, nbd-general@lists.sourceforge.net, qemu-devel On Tue, Apr 12, 2016 at 10:48:20AM +0100, Daniel P. Berrange wrote: > On Sun, Apr 10, 2016 at 10:49:00AM +0100, Alex Bligh wrote: > > (Daniel: if you want to replicate the issue, just run qemu-img info > > against my gonbdserver with TLS. Every fifth NBD_CMD_DISC doesn't > > get through before the TCP session closes). > > Hmm, I'll have a look at this - I'm not sure its caused by > the lack of gnutls_bye, as opposed to incorrect handling > of EAGAIN when the block layer sends CMD_DISC I have tried to reproduce with your server yet, but I did look at the code and identified one potential flaw that might cause the behaviour you mention. Can you try testing with the following change applied to QEMU: diff --git a/nbd/common.c b/nbd/common.c index 8ddb2dd..47757b6 100644 --- a/nbd/common.c +++ b/nbd/common.c @@ -50,7 +50,7 @@ ssize_t nbd_wr_syncv(QIOChannel *ioc, * qio_channel_yield() that works with AIO contexts * and consider using that in this branch */ qemu_coroutine_yield(); - } else if (done) { + } else if (done || !do_read) { /* XXX this is needed by nbd_reply_ready. */ qio_channel_wait(ioc, do_read ? G_IO_IN : G_IO_OUT); The current code will cause it to silently not send CMD_DISC if the socket returns EAGAIN on send(). This change will cause it to do a poll to wait and retry the send(). That said I'd be pretty surprised if we do actually get EAGAIN in this scenario, as I'd expect the outgoing TCP buffers to be empty at the point in which we close the client connection, but this fix is worth a try. Regards, Daniel -- |: http://berrange.com -o- http://www.flickr.com/photos/dberrange/ :| |: http://libvirt.org -o- http://virt-manager.org :| |: http://autobuild.org -o- http://search.cpan.org/~danberr/ :| |: http://entangle-photo.org -o- http://live.gnome.org/gtk-vnc :| ^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [Qemu-devel] [Nbd] NBD_CMD_DISC 2016-04-12 10:34 ` Daniel P. Berrange @ 2016-04-12 11:14 ` Alex Bligh 0 siblings, 0 replies; 3+ messages in thread From: Alex Bligh @ 2016-04-12 11:14 UTC (permalink / raw) To: Daniel P. Berrange Cc: Alex Bligh, Wouter Verhelst, nbd-general@lists.sourceforge.net, qemu-devel@nongnu.org On 12 Apr 2016, at 11:34, Daniel P. Berrange <berrange@redhat.com> wrote: > On Tue, Apr 12, 2016 at 10:48:20AM +0100, Daniel P. Berrange wrote: >> On Sun, Apr 10, 2016 at 10:49:00AM +0100, Alex Bligh wrote: >>> (Daniel: if you want to replicate the issue, just run qemu-img info >>> against my gonbdserver with TLS. Every fifth NBD_CMD_DISC doesn't >>> get through before the TCP session closes). >> >> Hmm, I'll have a look at this - I'm not sure its caused by >> the lack of gnutls_bye, as opposed to incorrect handling >> of EAGAIN when the block layer sends CMD_DISC > > I have tried to reproduce with your server yet, but I did look at the > code and identified one potential flaw that might cause the behaviour > you mention. Can you try testing with the following change applied > to QEMU: > > diff --git a/nbd/common.c b/nbd/common.c > index 8ddb2dd..47757b6 100644 > --- a/nbd/common.c > +++ b/nbd/common.c > @@ -50,7 +50,7 @@ ssize_t nbd_wr_syncv(QIOChannel *ioc, > * qio_channel_yield() that works with AIO contexts > * and consider using that in this branch */ > qemu_coroutine_yield(); > - } else if (done) { > + } else if (done || !do_read) { > /* XXX this is needed by nbd_reply_ready. */ > qio_channel_wait(ioc, > do_read ? G_IO_IN : G_IO_OUT); > > > The current code will cause it to silently not send CMD_DISC if the socket > returns EAGAIN on send(). This change will cause it to do a poll to wait > and retry the send(). That said I'd be pretty surprised if we do actually > get EAGAIN in this scenario, as I'd expect the outgoing TCP buffers to be > empty at the point in which we close the client connection, but this fix > is worth a try. Well, what with pulling to the latest qemu master I now can't reproduce the original problem against my server :-/ However, I have verified that against my server it works with your patch (as well as without). I have verified each time that I am receiving NBD_CMD_DISC. It's also now working (with and without your patch) against nbd-server.c (reference implementation) with my GnuTLS patches. This should be exactly the same as before. I suspect the issue may be timing related. Unless there's something else fixed in master, the main change I've made is rebooted my MBP, which was running very slowly. I think you should probably still do a gnutls_bye() though. -- Alex Bligh ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2016-04-12 11:14 UTC | newest] Thread overview: 3+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- [not found] <D4F43553-381E-42EE-838E-AC8729E3D356@alex.org.uk> [not found] ` <5705727B.1000802@redhat.com> [not found] ` <20160409091634.GB19023@grep.be> [not found] ` <C0E9BDD0-C8B6-412E-9835-A696A1279935@alex.org.uk> [not found] ` <57097020.7030003@redhat.com> [not found] ` <56CF18E3-554D-4CF2-9DC3-6831D739A846@alex.org.uk> 2016-04-09 23:23 ` [Qemu-devel] [Nbd] NBD_CMD_DISC Eric Blake [not found] ` <20160409102833.GL19023@grep.be> [not found] ` <FDFBD171-A214-4780-B3E8-7E25FB41FACB@alex.org.uk> [not found] ` <20160409113225.GR19023@grep.be> [not found] ` <F405A775-6A76-4776-9182-2431293A3A60@alex.org.uk> [not found] ` <20160410082710.GA28660@grep.be> [not found] ` <91E6DE09-E8F3-489E-B1F2-B3ECC358BB75@alex.org.uk> [not found] ` <20160412094820.GA8122@redhat.com> 2016-04-12 10:34 ` Daniel P. Berrange 2016-04-12 11:14 ` Alex Bligh
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).