* Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) @ 2016-07-07 10:49 Mkrtchyan, Tigran 2016-07-12 17:16 ` Adamson, Andy 0 siblings, 1 reply; 10+ messages in thread From: Mkrtchyan, Tigran @ 2016-07-07 10:49 UTC (permalink / raw) To: Linux NFS Mailing List; +Cc: Andy Adamson, Trond Myklebust, Steve Dickson Dear NFS folks, we observe orphan open-states on our deployment with nfsv4.1. Our setup - two client nodes, running RHEL-7.2 with kernel 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like a dropbox) which nfsv4.1 mounts to dCache storage. Some clients connected to node1, others to node2. Time-to-time we see some 'active' transfers on data our DS which do nothing. There is a corresponding state on MDS. I have traced one one such cases: - node1 uploads the file. - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE - node2 sends OPEN+LAYOUTGET - there is no open file on node2 which points to it. - CLOSE never send to the server. - node1 eventually removes the removes the file We have many other cases where file is not removed, but this one I was able to trace. The link to capture files: https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). Tigran. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-07-07 10:49 Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) Mkrtchyan, Tigran @ 2016-07-12 17:16 ` Adamson, Andy 2016-07-13 16:49 ` Mkrtchyan, Tigran 0 siblings, 1 reply; 10+ messages in thread From: Adamson, Andy @ 2016-07-12 17:16 UTC (permalink / raw) To: Mkrtchyan, Tigran Cc: Linux NFS Mailing List, Adamson, Andy, Trond Myklebust, Steve Dickson SGkgVGlncmFuDQoNCkNhbiB5b3UgdGVzdCB3aXRoIGFuIHVwc3RyZWFtIGtlcm5lbD8gT2xnYSBo YXMgc2VlbiBpc3N1ZXMgYXJvdW5kIG5vIENMT1NFIGJlaW5nIHNlbnQgLSBpdCBpcyByZWFsbHkg aGFyZCB0byByZXByb2R1Y2XigKYuDQoNCuKAlD5BbmR5DQoNCg0KPiBPbiBKdWwgNywgMjAxNiwg YXQgNjo0OSBBTSwgTWtydGNoeWFuLCBUaWdyYW4gPHRpZ3Jhbi5ta3J0Y2h5YW5AZGVzeS5kZT4g d3JvdGU6DQo+IA0KPiANCj4gDQo+IERlYXIgTkZTIGZvbGtzLA0KPiANCj4gd2Ugb2JzZXJ2ZSBv cnBoYW4gb3Blbi1zdGF0ZXMgb24gb3VyIGRlcGxveW1lbnQgd2l0aCBuZnN2NC4xLg0KPiBPdXIg c2V0dXAgLSB0d28gY2xpZW50IG5vZGVzLCBydW5uaW5nIFJIRUwtNy4yIHdpdGgga2VybmVsDQo+ IDMuMTAuMC0zMjcuMjIuMi5lbDcueDg2XzY0LiBCb3RoIG5vZGVzIHJ1bm5pbmcgb3duQ2xvdWQg KGxpa2UNCj4gYSBkcm9wYm94KSB3aGljaCBuZnN2NC4xIG1vdW50cyB0byBkQ2FjaGUgc3RvcmFn ZS4gU29tZSBjbGllbnRzDQo+IGNvbm5lY3RlZCB0byBub2RlMSwgb3RoZXJzIHRvIG5vZGUyLg0K PiANCj4gVGltZS10by10aW1lIHdlIHNlZSBzb21lICdhY3RpdmUnIHRyYW5zZmVycyBvbiBkYXRh IG91ciBEUw0KPiB3aGljaCBkbyBub3RoaW5nLiBUaGVyZSBpcyBhIGNvcnJlc3BvbmRpbmcgc3Rh dGUgb24gTURTLg0KPiANCj4gSSBoYXZlIHRyYWNlZCBvbmUgb25lIHN1Y2ggY2FzZXM6DQo+IA0K PiAgLSBub2RlMSB1cGxvYWRzIHRoZSBmaWxlLg0KPiAgLSBub2RlMiByZWFkcyB0aGUgZmlsZSBj b3VwbGUgb2YgdGltZXMsIE9QRU4rTEFZT1VUR0VUK0NMT1NFDQo+ICAtIG5vZGUyIHNlbmRzIE9Q RU4rTEFZT1VUR0VUDQo+ICAtIHRoZXJlIGlzIG5vIG9wZW4gZmlsZSBvbiBub2RlMiB3aGljaCBw b2ludHMgdG8gaXQuDQo+ICAtIENMT1NFIG5ldmVyIHNlbmQgdG8gdGhlIHNlcnZlci4gDQo+ICAt IG5vZGUxIGV2ZW50dWFsbHkgcmVtb3ZlcyB0aGUgcmVtb3ZlcyB0aGUgZmlsZQ0KPiANCj4gV2Ug aGF2ZSBtYW55IG90aGVyIGNhc2VzIHdoZXJlIGZpbGUgaXMgbm90IHJlbW92ZWQsIGJ1dCB0aGlz IG9uZSBJIHdhcw0KPiBhYmxlIHRvIHRyYWNlLiBUaGUgbGluayB0byBjYXB0dXJlIGZpbGVzOg0K PiANCj4gaHR0cHM6Ly9kZXN5Y2xvdWQuZGVzeS5kZS9pbmRleC5waHAvcy9ZbGRvd2NSelRHSmVM Yk4NCj4gDQo+IFdlIGhhZCB+IDEwXjYgdHJhbnNmZXJzIGluIGxhc3QgMiBkYXlzIGFuZCAyOSBm aWxlcyBpbiBzdWNoIHN0YXRlICh+MC4wMDI5JSkuDQo+IA0KPiBUaWdyYW4uDQoNCg== ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-07-12 17:16 ` Adamson, Andy @ 2016-07-13 16:49 ` Mkrtchyan, Tigran 2016-07-14 14:52 ` Olga Kornievskaia 0 siblings, 1 reply; 10+ messages in thread From: Mkrtchyan, Tigran @ 2016-07-13 16:49 UTC (permalink / raw) To: Andy Adamson; +Cc: Linux NFS Mailing List, Trond Myklebust, Steve Dickson Hi Andy, I will try to get upstream kernel on one of the nodes. It will take some time as we need to add a new host into the cluster and get some traffic go through it. In the mean while, with RHEL7 we get it easy reproduced - about 10 such cases per day. Is there any tool that will help us to see where it happens? Some traces points? Call trace from vfs close to NFS close? There is a one comment in the kernel code, which sounds similar: (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) nfs4proc.c: 2954 ==== /* * It is possible for data to be read/written from a mem-mapped file * after the sys_close call (which hits the vfs layer as a flush). * This means that we can't safely call nfsv4 close on a file until * the inode is cleared. This in turn means that we are not good * NFSv4 citizens - we do not indicate to the server to update the file's * share state even when we are done with one of the three share * stateid's in the inode. * * NOTE: Caller must be holding the sp->so_owner semaphore! */ int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) ==== Tigran. ----- Original Message ----- > From: "Andy Adamson" <William.Adamson@netapp.com> > To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> > Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" <William.Adamson@netapp.com>, "Trond Myklebust" > <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> > Sent: Tuesday, July 12, 2016 7:16:19 PM > Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > Hi Tigran > > Can you test with an upstream kernel? Olga has seen issues around no CLOSE being > sent - it is really hard to reproduce…. > > —>Andy > > >> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >> >> >> >> Dear NFS folks, >> >> we observe orphan open-states on our deployment with nfsv4.1. >> Our setup - two client nodes, running RHEL-7.2 with kernel >> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >> connected to node1, others to node2. >> >> Time-to-time we see some 'active' transfers on data our DS >> which do nothing. There is a corresponding state on MDS. >> >> I have traced one one such cases: >> >> - node1 uploads the file. >> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >> - node2 sends OPEN+LAYOUTGET >> - there is no open file on node2 which points to it. >> - CLOSE never send to the server. >> - node1 eventually removes the removes the file >> >> We have many other cases where file is not removed, but this one I was >> able to trace. The link to capture files: >> >> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >> >> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >> > > Tigran. ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-07-13 16:49 ` Mkrtchyan, Tigran @ 2016-07-14 14:52 ` Olga Kornievskaia 2016-08-01 11:08 ` Mkrtchyan, Tigran 0 siblings, 1 reply; 10+ messages in thread From: Olga Kornievskaia @ 2016-07-14 14:52 UTC (permalink / raw) To: Mkrtchyan, Tigran Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson Hi Tigran, On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: > > > Hi Andy, > > I will try to get upstream kernel on one of the nodes. It will take > some time as we need to add a new host into the cluster and get > some traffic go through it. > > In the mean while, with RHEL7 we get it easy reproduced - about 10 > such cases per day. Is there any tool that will help us to see where > it happens? Some traces points? Call trace from vfs close to NFS close? There are NFS tracepoints but I don't know think there are VFS tracepoints. Unfortunately, there was a bug in the OPEN tracepoints that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says it's fixed in the later kernel (.381) but it's currently not back ported to RHEL7.2z but hopefully will be soon (just chatted with Steve about getting the fix into zstream). I made no progress in figuring out what could be causing the lack of CLOSE and it was hard for me to reproduce. Just recently Trond fixed a problem where a CLOSE that was suppose to be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I wonder if that can be fixing this problem.... > There is a one comment in the kernel code, which sounds similar: > (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) > > nfs4proc.c: 2954 > ==== > > /* > * It is possible for data to be read/written from a mem-mapped file > * after the sys_close call (which hits the vfs layer as a flush). > * This means that we can't safely call nfsv4 close on a file until > * the inode is cleared. This in turn means that we are not good > * NFSv4 citizens - we do not indicate to the server to update the file's > * share state even when we are done with one of the three share > * stateid's in the inode. > * > * NOTE: Caller must be holding the sp->so_owner semaphore! > */ > int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) > I'm not sure if the comment means to say that there is a possibility that NFS won't send a CLOSE (or at least I hope not). I thought that because we keep a reference count on the inode and send the CLOSE when it goes down to 0. Basically the last WRITE will trigger the nfs close not the vfs_close. > ==== > > > Tigran. > > > ----- Original Message ----- >> From: "Andy Adamson" <William.Adamson@netapp.com> >> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" <William.Adamson@netapp.com>, "Trond Myklebust" >> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >> Sent: Tuesday, July 12, 2016 7:16:19 PM >> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > >> Hi Tigran >> >> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >> sent - it is really hard to reproduce…. >> >> —>Andy >> >> >>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>> >>> >>> >>> Dear NFS folks, >>> >>> we observe orphan open-states on our deployment with nfsv4.1. >>> Our setup - two client nodes, running RHEL-7.2 with kernel >>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>> connected to node1, others to node2. >>> >>> Time-to-time we see some 'active' transfers on data our DS >>> which do nothing. There is a corresponding state on MDS. >>> >>> I have traced one one such cases: >>> >>> - node1 uploads the file. >>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>> - node2 sends OPEN+LAYOUTGET >>> - there is no open file on node2 which points to it. >>> - CLOSE never send to the server. >>> - node1 eventually removes the removes the file >>> >>> We have many other cases where file is not removed, but this one I was >>> able to trace. The link to capture files: >>> >>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>> >>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>> >> > Tigran. > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-07-14 14:52 ` Olga Kornievskaia @ 2016-08-01 11:08 ` Mkrtchyan, Tigran 2016-08-01 21:22 ` Olga Kornievskaia 0 siblings, 1 reply; 10+ messages in thread From: Mkrtchyan, Tigran @ 2016-08-01 11:08 UTC (permalink / raw) To: Olga Kornievskaia Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson Hi Olga, we have installed kernel 4.7.0 on one of the nodes and don't see missing closes from that node. Nevertheless, I don't think that the commit you have mentioned is fixing that, as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The OPEN_DOWNGRADE is not expected - file is already closed when a second open is sent and both requests using the same session slot. Have you seen a similar issue on vanilla or rhel kernel? Thanks a lot, Tigran. ----- Original Message ----- > From: "Olga Kornievskaia" <aglo@umich.edu> > To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> > Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" > <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> > Sent: Thursday, July 14, 2016 4:52:59 PM > Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > Hi Tigran, > > On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran > <tigran.mkrtchyan@desy.de> wrote: >> >> >> Hi Andy, >> >> I will try to get upstream kernel on one of the nodes. It will take >> some time as we need to add a new host into the cluster and get >> some traffic go through it. >> >> In the mean while, with RHEL7 we get it easy reproduced - about 10 >> such cases per day. Is there any tool that will help us to see where >> it happens? Some traces points? Call trace from vfs close to NFS close? > > There are NFS tracepoints but I don't know think there are VFS > tracepoints. Unfortunately, there was a bug in the OPEN tracepoints > that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says > it's fixed in the later kernel (.381) but it's currently not back > ported to RHEL7.2z but hopefully will be soon (just chatted with Steve > about getting the fix into zstream). I made no progress in figuring > out what could be causing the lack of CLOSE and it was hard for me to > reproduce. > > Just recently Trond fixed a problem where a CLOSE that was suppose to > be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I > wonder if that can be fixing this problem.... > >> There is a one comment in the kernel code, which sounds similar: >> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >> >> nfs4proc.c: 2954 >> ==== >> >> /* >> * It is possible for data to be read/written from a mem-mapped file >> * after the sys_close call (which hits the vfs layer as a flush). >> * This means that we can't safely call nfsv4 close on a file until >> * the inode is cleared. This in turn means that we are not good >> * NFSv4 citizens - we do not indicate to the server to update the file's >> * share state even when we are done with one of the three share >> * stateid's in the inode. >> * >> * NOTE: Caller must be holding the sp->so_owner semaphore! >> */ >> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >> > > I'm not sure if the comment means to say that there is a possibility > that NFS won't send a CLOSE (or at least I hope not). I thought that > because we keep a reference count on the inode and send the CLOSE when > it goes down to 0. Basically the last WRITE will trigger the nfs close > not the vfs_close. > > >> ==== >> >> >> Tigran. >> >> >> ----- Original Message ----- >>> From: "Andy Adamson" <William.Adamson@netapp.com> >>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>> <William.Adamson@netapp.com>, "Trond Myklebust" >>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >> >>> Hi Tigran >>> >>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>> sent - it is really hard to reproduce…. >>> >>> —>Andy >>> >>> >>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>> >>>> >>>> >>>> Dear NFS folks, >>>> >>>> we observe orphan open-states on our deployment with nfsv4.1. >>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>> connected to node1, others to node2. >>>> >>>> Time-to-time we see some 'active' transfers on data our DS >>>> which do nothing. There is a corresponding state on MDS. >>>> >>>> I have traced one one such cases: >>>> >>>> - node1 uploads the file. >>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>> - node2 sends OPEN+LAYOUTGET >>>> - there is no open file on node2 which points to it. >>>> - CLOSE never send to the server. >>>> - node1 eventually removes the removes the file >>>> >>>> We have many other cases where file is not removed, but this one I was >>>> able to trace. The link to capture files: >>>> >>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>> >>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>> >>> > Tigran. >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-08-01 11:08 ` Mkrtchyan, Tigran @ 2016-08-01 21:22 ` Olga Kornievskaia 2016-08-04 15:04 ` Mkrtchyan, Tigran 0 siblings, 1 reply; 10+ messages in thread From: Olga Kornievskaia @ 2016-08-01 21:22 UTC (permalink / raw) To: Mkrtchyan, Tigran Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson On Mon, Aug 1, 2016 at 7:08 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: > Hi Olga, > > we have installed kernel 4.7.0 on one of the nodes and don't see missing > closes from that node. > > Nevertheless, I don't think that the commit you have mentioned is fixing that, > as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The > OPEN_DOWNGRADE is not expected - file is already closed when a second open > is sent and both requests using the same session slot. > > Have you seen a similar issue on vanilla or rhel kernel? I had a hard time triggering it consistently. I believe I have seen it on RHEL7.2 kernel but I think I was more consistently seeing it on some upstream (Trond's) kernel version (I think it was around 4.2). The problem was seen by Netapp QA on 4.3-rc7 version. Thanks for testing on the 4.7 version. I'll see what else went in that might explain the failure on the older kernel. > > Thanks a lot, > Tigran. > > ----- Original Message ----- >> From: "Olga Kornievskaia" <aglo@umich.edu> >> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" >> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >> Sent: Thursday, July 14, 2016 4:52:59 PM >> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > >> Hi Tigran, >> >> On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran >> <tigran.mkrtchyan@desy.de> wrote: >>> >>> >>> Hi Andy, >>> >>> I will try to get upstream kernel on one of the nodes. It will take >>> some time as we need to add a new host into the cluster and get >>> some traffic go through it. >>> >>> In the mean while, with RHEL7 we get it easy reproduced - about 10 >>> such cases per day. Is there any tool that will help us to see where >>> it happens? Some traces points? Call trace from vfs close to NFS close? >> >> There are NFS tracepoints but I don't know think there are VFS >> tracepoints. Unfortunately, there was a bug in the OPEN tracepoints >> that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says >> it's fixed in the later kernel (.381) but it's currently not back >> ported to RHEL7.2z but hopefully will be soon (just chatted with Steve >> about getting the fix into zstream). I made no progress in figuring >> out what could be causing the lack of CLOSE and it was hard for me to >> reproduce. >> >> Just recently Trond fixed a problem where a CLOSE that was suppose to >> be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I >> wonder if that can be fixing this problem.... >> >>> There is a one comment in the kernel code, which sounds similar: >>> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >>> >>> nfs4proc.c: 2954 >>> ==== >>> >>> /* >>> * It is possible for data to be read/written from a mem-mapped file >>> * after the sys_close call (which hits the vfs layer as a flush). >>> * This means that we can't safely call nfsv4 close on a file until >>> * the inode is cleared. This in turn means that we are not good >>> * NFSv4 citizens - we do not indicate to the server to update the file's >>> * share state even when we are done with one of the three share >>> * stateid's in the inode. >>> * >>> * NOTE: Caller must be holding the sp->so_owner semaphore! >>> */ >>> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >>> >> >> I'm not sure if the comment means to say that there is a possibility >> that NFS won't send a CLOSE (or at least I hope not). I thought that >> because we keep a reference count on the inode and send the CLOSE when >> it goes down to 0. Basically the last WRITE will trigger the nfs close >> not the vfs_close. >> >> >>> ==== >>> >>> >>> Tigran. >>> >>> >>> ----- Original Message ----- >>>> From: "Andy Adamson" <William.Adamson@netapp.com> >>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>>> <William.Adamson@netapp.com>, "Trond Myklebust" >>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>> >>>> Hi Tigran >>>> >>>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>>> sent - it is really hard to reproduce…. >>>> >>>> —>Andy >>>> >>>> >>>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>>> >>>>> >>>>> >>>>> Dear NFS folks, >>>>> >>>>> we observe orphan open-states on our deployment with nfsv4.1. >>>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>>> connected to node1, others to node2. >>>>> >>>>> Time-to-time we see some 'active' transfers on data our DS >>>>> which do nothing. There is a corresponding state on MDS. >>>>> >>>>> I have traced one one such cases: >>>>> >>>>> - node1 uploads the file. >>>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>>> - node2 sends OPEN+LAYOUTGET >>>>> - there is no open file on node2 which points to it. >>>>> - CLOSE never send to the server. >>>>> - node1 eventually removes the removes the file >>>>> >>>>> We have many other cases where file is not removed, but this one I was >>>>> able to trace. The link to capture files: >>>>> >>>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>>> >>>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>>> >>>> > Tigran. >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-08-01 21:22 ` Olga Kornievskaia @ 2016-08-04 15:04 ` Mkrtchyan, Tigran 2016-08-04 19:00 ` Olga Kornievskaia 0 siblings, 1 reply; 10+ messages in thread From: Mkrtchyan, Tigran @ 2016-08-04 15:04 UTC (permalink / raw) To: Olga Kornievskaia Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson [-- Attachment #1: Type: text/plain, Size: 7867 bytes --] Hi Olga et al. Finally I was able to to create a reproducer (attached)! It looks like, if on close client application is interrupted by Ctrl+C or SIGINT, then nfs client does not sends CLOSE. I can 100% reproduce it on RHEL7 and Fedora24 with 4.6 kernel. The 4.7 kernel works (side effect of some other change?). The attached application reads file in a loop. On the second iteration a thread is started, which will send SIGINT to itself. When CLOSE is lost, you still can read the file. Client even won't send any OPEN. So it looks like that some where file is marked as open, but corresponding process does not exist any more. Even re-mount does not help. Best regards, Tigran. ----- Original Message ----- > From: "Olga Kornievskaia" <aglo@umich.edu> > To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> > Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" > <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> > Sent: Monday, August 1, 2016 11:22:10 PM > Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > On Mon, Aug 1, 2016 at 7:08 AM, Mkrtchyan, Tigran > <tigran.mkrtchyan@desy.de> wrote: >> Hi Olga, >> >> we have installed kernel 4.7.0 on one of the nodes and don't see missing >> closes from that node. >> >> Nevertheless, I don't think that the commit you have mentioned is fixing that, >> as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The >> OPEN_DOWNGRADE is not expected - file is already closed when a second open >> is sent and both requests using the same session slot. >> >> Have you seen a similar issue on vanilla or rhel kernel? > > I had a hard time triggering it consistently. I believe I have seen it > on RHEL7.2 kernel but I think I was more consistently seeing it on > some upstream (Trond's) kernel version (I think it was around 4.2). > The problem was seen by Netapp QA on 4.3-rc7 version. > > Thanks for testing on the 4.7 version. I'll see what else went in that > might explain the failure on the older kernel. > >> >> Thanks a lot, >> Tigran. >> >> ----- Original Message ----- >>> From: "Olga Kornievskaia" <aglo@umich.edu> >>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" >>> <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>> Sent: Thursday, July 14, 2016 4:52:59 PM >>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >> >>> Hi Tigran, >>> >>> On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran >>> <tigran.mkrtchyan@desy.de> wrote: >>>> >>>> >>>> Hi Andy, >>>> >>>> I will try to get upstream kernel on one of the nodes. It will take >>>> some time as we need to add a new host into the cluster and get >>>> some traffic go through it. >>>> >>>> In the mean while, with RHEL7 we get it easy reproduced - about 10 >>>> such cases per day. Is there any tool that will help us to see where >>>> it happens? Some traces points? Call trace from vfs close to NFS close? >>> >>> There are NFS tracepoints but I don't know think there are VFS >>> tracepoints. Unfortunately, there was a bug in the OPEN tracepoints >>> that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says >>> it's fixed in the later kernel (.381) but it's currently not back >>> ported to RHEL7.2z but hopefully will be soon (just chatted with Steve >>> about getting the fix into zstream). I made no progress in figuring >>> out what could be causing the lack of CLOSE and it was hard for me to >>> reproduce. >>> >>> Just recently Trond fixed a problem where a CLOSE that was suppose to >>> be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I >>> wonder if that can be fixing this problem.... >>> >>>> There is a one comment in the kernel code, which sounds similar: >>>> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >>>> >>>> nfs4proc.c: 2954 >>>> ==== >>>> >>>> /* >>>> * It is possible for data to be read/written from a mem-mapped file >>>> * after the sys_close call (which hits the vfs layer as a flush). >>>> * This means that we can't safely call nfsv4 close on a file until >>>> * the inode is cleared. This in turn means that we are not good >>>> * NFSv4 citizens - we do not indicate to the server to update the file's >>>> * share state even when we are done with one of the three share >>>> * stateid's in the inode. >>>> * >>>> * NOTE: Caller must be holding the sp->so_owner semaphore! >>>> */ >>>> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >>>> >>> >>> I'm not sure if the comment means to say that there is a possibility >>> that NFS won't send a CLOSE (or at least I hope not). I thought that >>> because we keep a reference count on the inode and send the CLOSE when >>> it goes down to 0. Basically the last WRITE will trigger the nfs close >>> not the vfs_close. >>> >>> >>>> ==== >>>> >>>> >>>> Tigran. >>>> >>>> >>>> ----- Original Message ----- >>>>> From: "Andy Adamson" <William.Adamson@netapp.com> >>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>>>> <William.Adamson@netapp.com>, "Trond Myklebust" >>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>> >>>>> Hi Tigran >>>>> >>>>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>>>> sent - it is really hard to reproduce…. >>>>> >>>>> —>Andy >>>>> >>>>> >>>>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>>>> >>>>>> >>>>>> >>>>>> Dear NFS folks, >>>>>> >>>>>> we observe orphan open-states on our deployment with nfsv4.1. >>>>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>>>> connected to node1, others to node2. >>>>>> >>>>>> Time-to-time we see some 'active' transfers on data our DS >>>>>> which do nothing. There is a corresponding state on MDS. >>>>>> >>>>>> I have traced one one such cases: >>>>>> >>>>>> - node1 uploads the file. >>>>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>>>> - node2 sends OPEN+LAYOUTGET >>>>>> - there is no open file on node2 which points to it. >>>>>> - CLOSE never send to the server. >>>>>> - node1 eventually removes the removes the file >>>>>> >>>>>> We have many other cases where file is not removed, but this one I was >>>>>> able to trace. The link to capture files: >>>>>> >>>>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>>>> >>>>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>>>> >>>>> > Tigran. >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html [-- Warning: decoded text below may be mangled, UTF-8 assumed --] [-- Attachment #2: nfs_leak_close.c --] [-- Type: text/x-c++src; name="nfs_leak_close.c", Size: 1170 bytes --] #include <stdio.h> #include <stdlib.h> #include <fcntl.h> #include <unistd.h> #include <sys/stat.h> #include <signal.h> #include <pthread.h> #define BSIZE 512 void *kill_task(void *arg) { kill(getpid(), SIGINT); return NULL; } int io_task(const char *fname, off_t size) { int fd; int res; char buf[BSIZE]; int n = 0; pthread_t tid; while (1) { fd = open(fname, O_RDONLY); if (fd < 0) { perror("faild to open file:"); return fd; } res = lseek(fd, random() % size, SEEK_SET); if (res < 0) { perror("stat"); return res; } if (n) { /* try to simulate system call interrupt */ pthread_create(&tid, NULL, kill_task, NULL); } read(fd, buf, BSIZE); pthread_yield(); res = close(fd); if (res < 0) { perror("faild to close file:"); return res; } n++; } return 0; } int main(int argc, char *argv[]) { struct stat sbuf; const char *fname; if (argc != 2) { printf("Usage: %s: <fname>\n", argv[0]); exit(1); } fname = argv[1]; if (stat(fname, &sbuf) < 0) { perror("stat"); exit(1); } io_task((void *) fname, sbuf.st_size); return 0; } ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-08-04 15:04 ` Mkrtchyan, Tigran @ 2016-08-04 19:00 ` Olga Kornievskaia 2016-08-04 21:20 ` Olga Kornievskaia 0 siblings, 1 reply; 10+ messages in thread From: Olga Kornievskaia @ 2016-08-04 19:00 UTC (permalink / raw) To: Mkrtchyan, Tigran Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson On Thu, Aug 4, 2016 at 11:04 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: > > Hi Olga et al. > > Finally I was able to to create a reproducer (attached)! > > It looks like, if on close client application is interrupted by > Ctrl+C or SIGINT, then nfs client does not sends CLOSE. I can > 100% reproduce it on RHEL7 and Fedora24 with 4.6 kernel. The 4.7 > kernel works (side effect of some other change?). > > The attached application reads file in a loop. On the second > iteration a thread is started, which will send SIGINT > to itself. When CLOSE is lost, you still can read the > file. Client even won't send any OPEN. So it looks like > that some where file is marked as open, but corresponding > process does not exist any more. Even re-mount does not help. > Thank you Tigran for a reproducer, I'll check it out and get back to you. > Best regards, > Tigran. > > ----- Original Message ----- >> From: "Olga Kornievskaia" <aglo@umich.edu> >> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" >> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >> Sent: Monday, August 1, 2016 11:22:10 PM >> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > >> On Mon, Aug 1, 2016 at 7:08 AM, Mkrtchyan, Tigran >> <tigran.mkrtchyan@desy.de> wrote: >>> Hi Olga, >>> >>> we have installed kernel 4.7.0 on one of the nodes and don't see missing >>> closes from that node. >>> >>> Nevertheless, I don't think that the commit you have mentioned is fixing that, >>> as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The >>> OPEN_DOWNGRADE is not expected - file is already closed when a second open >>> is sent and both requests using the same session slot. >>> >>> Have you seen a similar issue on vanilla or rhel kernel? >> >> I had a hard time triggering it consistently. I believe I have seen it >> on RHEL7.2 kernel but I think I was more consistently seeing it on >> some upstream (Trond's) kernel version (I think it was around 4.2). >> The problem was seen by Netapp QA on 4.3-rc7 version. >> >> Thanks for testing on the 4.7 version. I'll see what else went in that >> might explain the failure on the older kernel. >> >>> >>> Thanks a lot, >>> Tigran. >>> >>> ----- Original Message ----- >>>> From: "Olga Kornievskaia" <aglo@umich.edu> >>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" >>>> <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>> Sent: Thursday, July 14, 2016 4:52:59 PM >>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>> >>>> Hi Tigran, >>>> >>>> On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran >>>> <tigran.mkrtchyan@desy.de> wrote: >>>>> >>>>> >>>>> Hi Andy, >>>>> >>>>> I will try to get upstream kernel on one of the nodes. It will take >>>>> some time as we need to add a new host into the cluster and get >>>>> some traffic go through it. >>>>> >>>>> In the mean while, with RHEL7 we get it easy reproduced - about 10 >>>>> such cases per day. Is there any tool that will help us to see where >>>>> it happens? Some traces points? Call trace from vfs close to NFS close? >>>> >>>> There are NFS tracepoints but I don't know think there are VFS >>>> tracepoints. Unfortunately, there was a bug in the OPEN tracepoints >>>> that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says >>>> it's fixed in the later kernel (.381) but it's currently not back >>>> ported to RHEL7.2z but hopefully will be soon (just chatted with Steve >>>> about getting the fix into zstream). I made no progress in figuring >>>> out what could be causing the lack of CLOSE and it was hard for me to >>>> reproduce. >>>> >>>> Just recently Trond fixed a problem where a CLOSE that was suppose to >>>> be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I >>>> wonder if that can be fixing this problem.... >>>> >>>>> There is a one comment in the kernel code, which sounds similar: >>>>> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >>>>> >>>>> nfs4proc.c: 2954 >>>>> ==== >>>>> >>>>> /* >>>>> * It is possible for data to be read/written from a mem-mapped file >>>>> * after the sys_close call (which hits the vfs layer as a flush). >>>>> * This means that we can't safely call nfsv4 close on a file until >>>>> * the inode is cleared. This in turn means that we are not good >>>>> * NFSv4 citizens - we do not indicate to the server to update the file's >>>>> * share state even when we are done with one of the three share >>>>> * stateid's in the inode. >>>>> * >>>>> * NOTE: Caller must be holding the sp->so_owner semaphore! >>>>> */ >>>>> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >>>>> >>>> >>>> I'm not sure if the comment means to say that there is a possibility >>>> that NFS won't send a CLOSE (or at least I hope not). I thought that >>>> because we keep a reference count on the inode and send the CLOSE when >>>> it goes down to 0. Basically the last WRITE will trigger the nfs close >>>> not the vfs_close. >>>> >>>> >>>>> ==== >>>>> >>>>> >>>>> Tigran. >>>>> >>>>> >>>>> ----- Original Message ----- >>>>>> From: "Andy Adamson" <William.Adamson@netapp.com> >>>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>>>>> <William.Adamson@netapp.com>, "Trond Myklebust" >>>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>>> >>>>>> Hi Tigran >>>>>> >>>>>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>>>>> sent - it is really hard to reproduce…. >>>>>> >>>>>> —>Andy >>>>>> >>>>>> >>>>>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>>>>> >>>>>>> >>>>>>> >>>>>>> Dear NFS folks, >>>>>>> >>>>>>> we observe orphan open-states on our deployment with nfsv4.1. >>>>>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>>>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>>>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>>>>> connected to node1, others to node2. >>>>>>> >>>>>>> Time-to-time we see some 'active' transfers on data our DS >>>>>>> which do nothing. There is a corresponding state on MDS. >>>>>>> >>>>>>> I have traced one one such cases: >>>>>>> >>>>>>> - node1 uploads the file. >>>>>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>>>>> - node2 sends OPEN+LAYOUTGET >>>>>>> - there is no open file on node2 which points to it. >>>>>>> - CLOSE never send to the server. >>>>>>> - node1 eventually removes the removes the file >>>>>>> >>>>>>> We have many other cases where file is not removed, but this one I was >>>>>>> able to trace. The link to capture files: >>>>>>> >>>>>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>>>>> >>>>>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>>>>> >>>>>> > Tigran. >>>>> -- >>>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>>> the body of a message to majordomo@vger.kernel.org >>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >> -- >> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-08-04 19:00 ` Olga Kornievskaia @ 2016-08-04 21:20 ` Olga Kornievskaia 2016-08-09 10:57 ` Mkrtchyan, Tigran 0 siblings, 1 reply; 10+ messages in thread From: Olga Kornievskaia @ 2016-08-04 21:20 UTC (permalink / raw) To: Mkrtchyan, Tigran Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson On Thu, Aug 4, 2016 at 3:00 PM, Olga Kornievskaia <aglo@umich.edu> wrote: > On Thu, Aug 4, 2016 at 11:04 AM, Mkrtchyan, Tigran > <tigran.mkrtchyan@desy.de> wrote: >> >> Hi Olga et al. >> >> Finally I was able to to create a reproducer (attached)! >> >> It looks like, if on close client application is interrupted by >> Ctrl+C or SIGINT, then nfs client does not sends CLOSE. I can >> 100% reproduce it on RHEL7 and Fedora24 with 4.6 kernel. The 4.7 >> kernel works (side effect of some other change?). >> >> The attached application reads file in a loop. On the second >> iteration a thread is started, which will send SIGINT >> to itself. When CLOSE is lost, you still can read the >> file. Client even won't send any OPEN. So it looks like >> that some where file is marked as open, but corresponding >> process does not exist any more. Even re-mount does not help. >> > > Thank you Tigran for a reproducer, I'll check it out and get back to you. I tried your example. I ran this on 3.10.0-327.4.4. In my run, I see OPEN, GETLAYOUT, GETDEVINFO, READ, CLOSE. Since I get a delegation, then when the file is read again, there won't be able an OPEN on the network trace in the next loop iteration. There won't be a read as it'll be read from the cache. So I'm not should what ctrl-c is accomplishing. What am I missing? Thanks. > >> Best regards, >> Tigran. >> >> ----- Original Message ----- >>> From: "Olga Kornievskaia" <aglo@umich.edu> >>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>> Sent: Monday, August 1, 2016 11:22:10 PM >>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >> >>> On Mon, Aug 1, 2016 at 7:08 AM, Mkrtchyan, Tigran >>> <tigran.mkrtchyan@desy.de> wrote: >>>> Hi Olga, >>>> >>>> we have installed kernel 4.7.0 on one of the nodes and don't see missing >>>> closes from that node. >>>> >>>> Nevertheless, I don't think that the commit you have mentioned is fixing that, >>>> as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The >>>> OPEN_DOWNGRADE is not expected - file is already closed when a second open >>>> is sent and both requests using the same session slot. >>>> >>>> Have you seen a similar issue on vanilla or rhel kernel? >>> >>> I had a hard time triggering it consistently. I believe I have seen it >>> on RHEL7.2 kernel but I think I was more consistently seeing it on >>> some upstream (Trond's) kernel version (I think it was around 4.2). >>> The problem was seen by Netapp QA on 4.3-rc7 version. >>> >>> Thanks for testing on the 4.7 version. I'll see what else went in that >>> might explain the failure on the older kernel. >>> >>>> >>>> Thanks a lot, >>>> Tigran. >>>> >>>> ----- Original Message ----- >>>>> From: "Olga Kornievskaia" <aglo@umich.edu> >>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" >>>>> <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>> Sent: Thursday, July 14, 2016 4:52:59 PM >>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>> >>>>> Hi Tigran, >>>>> >>>>> On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran >>>>> <tigran.mkrtchyan@desy.de> wrote: >>>>>> >>>>>> >>>>>> Hi Andy, >>>>>> >>>>>> I will try to get upstream kernel on one of the nodes. It will take >>>>>> some time as we need to add a new host into the cluster and get >>>>>> some traffic go through it. >>>>>> >>>>>> In the mean while, with RHEL7 we get it easy reproduced - about 10 >>>>>> such cases per day. Is there any tool that will help us to see where >>>>>> it happens? Some traces points? Call trace from vfs close to NFS close? >>>>> >>>>> There are NFS tracepoints but I don't know think there are VFS >>>>> tracepoints. Unfortunately, there was a bug in the OPEN tracepoints >>>>> that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says >>>>> it's fixed in the later kernel (.381) but it's currently not back >>>>> ported to RHEL7.2z but hopefully will be soon (just chatted with Steve >>>>> about getting the fix into zstream). I made no progress in figuring >>>>> out what could be causing the lack of CLOSE and it was hard for me to >>>>> reproduce. >>>>> >>>>> Just recently Trond fixed a problem where a CLOSE that was suppose to >>>>> be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I >>>>> wonder if that can be fixing this problem.... >>>>> >>>>>> There is a one comment in the kernel code, which sounds similar: >>>>>> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >>>>>> >>>>>> nfs4proc.c: 2954 >>>>>> ==== >>>>>> >>>>>> /* >>>>>> * It is possible for data to be read/written from a mem-mapped file >>>>>> * after the sys_close call (which hits the vfs layer as a flush). >>>>>> * This means that we can't safely call nfsv4 close on a file until >>>>>> * the inode is cleared. This in turn means that we are not good >>>>>> * NFSv4 citizens - we do not indicate to the server to update the file's >>>>>> * share state even when we are done with one of the three share >>>>>> * stateid's in the inode. >>>>>> * >>>>>> * NOTE: Caller must be holding the sp->so_owner semaphore! >>>>>> */ >>>>>> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >>>>>> >>>>> >>>>> I'm not sure if the comment means to say that there is a possibility >>>>> that NFS won't send a CLOSE (or at least I hope not). I thought that >>>>> because we keep a reference count on the inode and send the CLOSE when >>>>> it goes down to 0. Basically the last WRITE will trigger the nfs close >>>>> not the vfs_close. >>>>> >>>>> >>>>>> ==== >>>>>> >>>>>> >>>>>> Tigran. >>>>>> >>>>>> >>>>>> ----- Original Message ----- >>>>>>> From: "Andy Adamson" <William.Adamson@netapp.com> >>>>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>>>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>>>>>> <William.Adamson@netapp.com>, "Trond Myklebust" >>>>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>>>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>>>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>>>> >>>>>>> Hi Tigran >>>>>>> >>>>>>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>>>>>> sent - it is really hard to reproduce…. >>>>>>> >>>>>>> —>Andy >>>>>>> >>>>>>> >>>>>>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> Dear NFS folks, >>>>>>>> >>>>>>>> we observe orphan open-states on our deployment with nfsv4.1. >>>>>>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>>>>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>>>>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>>>>>> connected to node1, others to node2. >>>>>>>> >>>>>>>> Time-to-time we see some 'active' transfers on data our DS >>>>>>>> which do nothing. There is a corresponding state on MDS. >>>>>>>> >>>>>>>> I have traced one one such cases: >>>>>>>> >>>>>>>> - node1 uploads the file. >>>>>>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>>>>>> - node2 sends OPEN+LAYOUTGET >>>>>>>> - there is no open file on node2 which points to it. >>>>>>>> - CLOSE never send to the server. >>>>>>>> - node1 eventually removes the removes the file >>>>>>>> >>>>>>>> We have many other cases where file is not removed, but this one I was >>>>>>>> able to trace. The link to capture files: >>>>>>>> >>>>>>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>>>>>> >>>>>>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>>>>>> >>>>>>> > Tigran. >>>>>> -- >>>>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>>>> the body of a message to majordomo@vger.kernel.org >>>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>>> -- >>>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>>> the body of a message to majordomo@vger.kernel.org >>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>> -- >>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>> the body of a message to majordomo@vger.kernel.org >>> More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
* Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) 2016-08-04 21:20 ` Olga Kornievskaia @ 2016-08-09 10:57 ` Mkrtchyan, Tigran 0 siblings, 0 replies; 10+ messages in thread From: Mkrtchyan, Tigran @ 2016-08-09 10:57 UTC (permalink / raw) To: Olga Kornievskaia Cc: Andy Adamson, Linux NFS Mailing List, Trond Myklebust, Steve Dickson I cant reproduce it with 4.6.5 kernel any more. The nfs related changes are: NFS: Fix another OPEN_DOWNGRADE bug (e547f2628327fec6afd2e03b46f113f614cca05b) NFS: Fix a double page unlock (cbebaf897e5c4862567eb799dc84acc5d7ee2678) The second one fixes the source of missing close in 4.6. I will try to chase it RHEL7. Tigran. ----- Original Message ----- > From: "Olga Kornievskaia" <aglo@umich.edu> > To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> > Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Trond Myklebust" > <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> > Sent: Thursday, August 4, 2016 11:20:54 PM > Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) > On Thu, Aug 4, 2016 at 3:00 PM, Olga Kornievskaia <aglo@umich.edu> wrote: >> On Thu, Aug 4, 2016 at 11:04 AM, Mkrtchyan, Tigran >> <tigran.mkrtchyan@desy.de> wrote: >>> >>> Hi Olga et al. >>> >>> Finally I was able to to create a reproducer (attached)! >>> >>> It looks like, if on close client application is interrupted by >>> Ctrl+C or SIGINT, then nfs client does not sends CLOSE. I can >>> 100% reproduce it on RHEL7 and Fedora24 with 4.6 kernel. The 4.7 >>> kernel works (side effect of some other change?). >>> >>> The attached application reads file in a loop. On the second >>> iteration a thread is started, which will send SIGINT >>> to itself. When CLOSE is lost, you still can read the >>> file. Client even won't send any OPEN. So it looks like >>> that some where file is marked as open, but corresponding >>> process does not exist any more. Even re-mount does not help. >>> >> >> Thank you Tigran for a reproducer, I'll check it out and get back to you. > > I tried your example. I ran this on 3.10.0-327.4.4. In my run, I see > OPEN, GETLAYOUT, GETDEVINFO, READ, CLOSE. Since I get a delegation, > then when the file is read again, there won't be able an OPEN on the > network trace in the next loop iteration. There won't be a read as > it'll be read from the cache. So I'm not should what ctrl-c is > accomplishing. What am I missing? > > Thanks. > >> >>> Best regards, >>> Tigran. >>> >>> ----- Original Message ----- >>>> From: "Olga Kornievskaia" <aglo@umich.edu> >>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" >>>> <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>> Sent: Monday, August 1, 2016 11:22:10 PM >>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>> >>>> On Mon, Aug 1, 2016 at 7:08 AM, Mkrtchyan, Tigran >>>> <tigran.mkrtchyan@desy.de> wrote: >>>>> Hi Olga, >>>>> >>>>> we have installed kernel 4.7.0 on one of the nodes and don't see missing >>>>> closes from that node. >>>>> >>>>> Nevertheless, I don't think that the commit you have mentioned is fixing that, >>>>> as it fixes OPEN_DOWNGRADE, but we have a sequence of OPEN->CLOSE->OPEN. The >>>>> OPEN_DOWNGRADE is not expected - file is already closed when a second open >>>>> is sent and both requests using the same session slot. >>>>> >>>>> Have you seen a similar issue on vanilla or rhel kernel? >>>> >>>> I had a hard time triggering it consistently. I believe I have seen it >>>> on RHEL7.2 kernel but I think I was more consistently seeing it on >>>> some upstream (Trond's) kernel version (I think it was around 4.2). >>>> The problem was seen by Netapp QA on 4.3-rc7 version. >>>> >>>> Thanks for testing on the 4.7 version. I'll see what else went in that >>>> might explain the failure on the older kernel. >>>> >>>>> >>>>> Thanks a lot, >>>>> Tigran. >>>>> >>>>> ----- Original Message ----- >>>>>> From: "Olga Kornievskaia" <aglo@umich.edu> >>>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>>> Cc: "Andy Adamson" <William.Adamson@netapp.com>, "Linux NFS Mailing List" >>>>>> <linux-nfs@vger.kernel.org>, "Trond Myklebust" >>>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>>> Sent: Thursday, July 14, 2016 4:52:59 PM >>>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>>> >>>>>> Hi Tigran, >>>>>> >>>>>> On Wed, Jul 13, 2016 at 12:49 PM, Mkrtchyan, Tigran >>>>>> <tigran.mkrtchyan@desy.de> wrote: >>>>>>> >>>>>>> >>>>>>> Hi Andy, >>>>>>> >>>>>>> I will try to get upstream kernel on one of the nodes. It will take >>>>>>> some time as we need to add a new host into the cluster and get >>>>>>> some traffic go through it. >>>>>>> >>>>>>> In the mean while, with RHEL7 we get it easy reproduced - about 10 >>>>>>> such cases per day. Is there any tool that will help us to see where >>>>>>> it happens? Some traces points? Call trace from vfs close to NFS close? >>>>>> >>>>>> There are NFS tracepoints but I don't know think there are VFS >>>>>> tracepoints. Unfortunately, there was a bug in the OPEN tracepoints >>>>>> that caused a kernel crash. I had a bugzilla out for RHEL7.2. It says >>>>>> it's fixed in the later kernel (.381) but it's currently not back >>>>>> ported to RHEL7.2z but hopefully will be soon (just chatted with Steve >>>>>> about getting the fix into zstream). I made no progress in figuring >>>>>> out what could be causing the lack of CLOSE and it was hard for me to >>>>>> reproduce. >>>>>> >>>>>> Just recently Trond fixed a problem where a CLOSE that was suppose to >>>>>> be sent as an OPEN_DOWNGRADE wasn't sent (commit 0979bc2a59) . I >>>>>> wonder if that can be fixing this problem.... >>>>>> >>>>>>> There is a one comment in the kernel code, which sounds similar: >>>>>>> (http://git.linux-nfs.org/?p=trondmy/linux-nfs.git;a=blob;f=fs/nfs/nfs4proc.c;h=519368b987622ea23bea210929bebfd0c327e14e;hb=refs/heads/linux-next#l2955) >>>>>>> >>>>>>> nfs4proc.c: 2954 >>>>>>> ==== >>>>>>> >>>>>>> /* >>>>>>> * It is possible for data to be read/written from a mem-mapped file >>>>>>> * after the sys_close call (which hits the vfs layer as a flush). >>>>>>> * This means that we can't safely call nfsv4 close on a file until >>>>>>> * the inode is cleared. This in turn means that we are not good >>>>>>> * NFSv4 citizens - we do not indicate to the server to update the file's >>>>>>> * share state even when we are done with one of the three share >>>>>>> * stateid's in the inode. >>>>>>> * >>>>>>> * NOTE: Caller must be holding the sp->so_owner semaphore! >>>>>>> */ >>>>>>> int nfs4_do_close(struct nfs4_state *state, gfp_t gfp_mask, int wait) >>>>>>> >>>>>> >>>>>> I'm not sure if the comment means to say that there is a possibility >>>>>> that NFS won't send a CLOSE (or at least I hope not). I thought that >>>>>> because we keep a reference count on the inode and send the CLOSE when >>>>>> it goes down to 0. Basically the last WRITE will trigger the nfs close >>>>>> not the vfs_close. >>>>>> >>>>>> >>>>>>> ==== >>>>>>> >>>>>>> >>>>>>> Tigran. >>>>>>> >>>>>>> >>>>>>> ----- Original Message ----- >>>>>>>> From: "Andy Adamson" <William.Adamson@netapp.com> >>>>>>>> To: "Mkrtchyan, Tigran" <tigran.mkrtchyan@desy.de> >>>>>>>> Cc: "Linux NFS Mailing List" <linux-nfs@vger.kernel.org>, "Andy Adamson" >>>>>>>> <William.Adamson@netapp.com>, "Trond Myklebust" >>>>>>>> <trond.myklebust@primarydata.com>, "Steve Dickson" <steved@redhat.com> >>>>>>>> Sent: Tuesday, July 12, 2016 7:16:19 PM >>>>>>>> Subject: Re: Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) >>>>>>> >>>>>>>> Hi Tigran >>>>>>>> >>>>>>>> Can you test with an upstream kernel? Olga has seen issues around no CLOSE being >>>>>>>> sent - it is really hard to reproduce…. >>>>>>>> >>>>>>>> —>Andy >>>>>>>> >>>>>>>> >>>>>>>>> On Jul 7, 2016, at 6:49 AM, Mkrtchyan, Tigran <tigran.mkrtchyan@desy.de> wrote: >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> Dear NFS folks, >>>>>>>>> >>>>>>>>> we observe orphan open-states on our deployment with nfsv4.1. >>>>>>>>> Our setup - two client nodes, running RHEL-7.2 with kernel >>>>>>>>> 3.10.0-327.22.2.el7.x86_64. Both nodes running ownCloud (like >>>>>>>>> a dropbox) which nfsv4.1 mounts to dCache storage. Some clients >>>>>>>>> connected to node1, others to node2. >>>>>>>>> >>>>>>>>> Time-to-time we see some 'active' transfers on data our DS >>>>>>>>> which do nothing. There is a corresponding state on MDS. >>>>>>>>> >>>>>>>>> I have traced one one such cases: >>>>>>>>> >>>>>>>>> - node1 uploads the file. >>>>>>>>> - node2 reads the file couple of times, OPEN+LAYOUTGET+CLOSE >>>>>>>>> - node2 sends OPEN+LAYOUTGET >>>>>>>>> - there is no open file on node2 which points to it. >>>>>>>>> - CLOSE never send to the server. >>>>>>>>> - node1 eventually removes the removes the file >>>>>>>>> >>>>>>>>> We have many other cases where file is not removed, but this one I was >>>>>>>>> able to trace. The link to capture files: >>>>>>>>> >>>>>>>>> https://desycloud.desy.de/index.php/s/YldowcRzTGJeLbN >>>>>>>>> >>>>>>>>> We had ~ 10^6 transfers in last 2 days and 29 files in such state (~0.0029%). >>>>>>>>> >>>>>>>> > Tigran. >>>>>>> -- >>>>>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>>>>> the body of a message to majordomo@vger.kernel.org >>>>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>>>> -- >>>>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>>>> the body of a message to majordomo@vger.kernel.org >>>>>> More majordomo info at http://vger.kernel.org/majordomo-info.html >>>> -- >>>> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in >>>> the body of a message to majordomo@vger.kernel.org >>>> More majordomo info at http://vger.kernel.org/majordomo-info.html > -- > To unsubscribe from this list: send the line "unsubscribe linux-nfs" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 10+ messages in thread
end of thread, other threads:[~2016-08-09 10:57 UTC | newest] Thread overview: 10+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2016-07-07 10:49 Lost CLOSE with NFSv4.1 on RHEL7 ( and bejond?) Mkrtchyan, Tigran 2016-07-12 17:16 ` Adamson, Andy 2016-07-13 16:49 ` Mkrtchyan, Tigran 2016-07-14 14:52 ` Olga Kornievskaia 2016-08-01 11:08 ` Mkrtchyan, Tigran 2016-08-01 21:22 ` Olga Kornievskaia 2016-08-04 15:04 ` Mkrtchyan, Tigran 2016-08-04 19:00 ` Olga Kornievskaia 2016-08-04 21:20 ` Olga Kornievskaia 2016-08-09 10:57 ` Mkrtchyan, Tigran
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).