* Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel
@ 2011-09-09 13:10 R, Sricharan
[not found] ` <CAJ7qFSeB8bAGdYfZs-LFHuYqGWNA2RidY0EBEFw_1CndvG7q5A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
0 siblings, 1 reply; 7+ messages in thread
From: R, Sricharan @ 2011-09-09 13:10 UTC (permalink / raw)
To: linux-omap-u79uwXL29TY76Z2rM5mHXA,
linux-nfs-u79uwXL29TY76Z2rM5mHXA
Cc: trond.myklebust
Sorry resending again. My mailer settings thrashed my earlier email.
Hi,
A kernel crash is observed on 3.1rc4 kernel when HIGHMEM is enabled and
kernel is booted with a NFS on omap4430sdp. The issue happens in the below
scenario.
In file net/sunrpc/xprtsock.c,
static int xs_send_pagedata( xxx, struct xdr_buf *xdr, ..)
{
Struct page **ppage;
....
.....
ppage = xdr->pages + (base >> PAGE_SHIFT);
....
err = sock->ops->sendpage(sock, *ppage, base, len, flags);
...
}
1) In the above piece of code, the *ppage value from ops->sendpage
function is finally passed on to Kmap by the lower level code to
get the virtual address of the page.
2) In some corner cases the value of *ppage pointer is NULL.
3) When highmem is enabled and a NULL pointer is passed to
Kmap, then kmap finally crashes. But in the case when highmem
is disabled, then kmap returns a junk value for NULL pointer.
Highmem Enabled , kmap( NULL )-----> kernel crashes.
Highmem disabled, kmap( NULL )-----> junk value is returned.
Subsequently this message is observed on
the console.
"RPC call returned error 14"
4) Now the question is why is the value of *ppage = NULL is passed
from the above piece of code to lower layers.
Should that not have handled *ppage = NULL? and kmap should not
have received a NULL pointer?
Thanks,
Sricharan
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 7+ messages in thread[parent not found: <CAJ7qFSeB8bAGdYfZs-LFHuYqGWNA2RidY0EBEFw_1CndvG7q5A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel [not found] ` <CAJ7qFSeB8bAGdYfZs-LFHuYqGWNA2RidY0EBEFw_1CndvG7q5A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-09-09 18:51 ` Trond Myklebust [not found] ` <1315594317.17611.25.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org> 0 siblings, 1 reply; 7+ messages in thread From: Trond Myklebust @ 2011-09-09 18:51 UTC (permalink / raw) To: R, Sricharan Cc: linux-omap-u79uwXL29TY76Z2rM5mHXA, linux-nfs-u79uwXL29TY76Z2rM5mHXA On Fri, 2011-09-09 at 18:40 +0530, R, Sricharan wrote: > Sorry resending again. My mailer settings thrashed my earlier email. > > Hi, > A kernel crash is observed on 3.1rc4 kernel when HIGHMEM is enabled and > kernel is booted with a NFS on omap4430sdp. The issue happens in the below > scenario. > > In file net/sunrpc/xprtsock.c, > static int xs_send_pagedata( xxx, struct xdr_buf *xdr, ..) > { > Struct page **ppage; > .... > ..... > ppage = xdr->pages + (base >> PAGE_SHIFT); > .... > err = sock->ops->sendpage(sock, *ppage, base, len, flags); > > ... > } > > 1) In the above piece of code, the *ppage value from ops->sendpage > function is finally passed on to Kmap by the lower level code to > get the virtual address of the page. > 2) In some corner cases the value of *ppage pointer is NULL. > 3) When highmem is enabled and a NULL pointer is passed to > Kmap, then kmap finally crashes. But in the case when highmem > is disabled, then kmap returns a junk value for NULL pointer. > > Highmem Enabled , kmap( NULL )-----> kernel crashes. > > Highmem disabled, kmap( NULL )-----> junk value is returned. > Subsequently this message is observed on > the console. > > "RPC call returned error 14" > > 4) Now the question is why is the value of *ppage = NULL is passed > from the above piece of code to lower layers. > Should that not have handled *ppage = NULL? and kmap should not > have received a NULL pointer? I wouldn't expect *ppage to be NULL under any circumstances, so I'm really curious as to what is happening here. Could you perhaps add a printk() to that section of code to print out the values of 'xdr->page_base', 'xdr->page_len', 'len' and 'remainder' in the case where *ppage == NULL? Cheers Trond -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org www.netapp.com -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <1315594317.17611.25.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org>]
* RE: Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel [not found] ` <1315594317.17611.25.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org> @ 2011-09-12 6:16 ` Sricharan R [not found] ` <dbdc7c3761a82e0b84f2f49c533c07a5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 7+ messages in thread From: Sricharan R @ 2011-09-12 6:16 UTC (permalink / raw) To: Trond Myklebust Cc: linux-omap-u79uwXL29TY76Z2rM5mHXA, linux-nfs-u79uwXL29TY76Z2rM5mHXA Hi Trond, [....] >> 1) In the above piece of code, the *ppage value from ops- >>sendpage >> function is finally passed on to Kmap by the lower level code >to >> get the virtual address of the page. >> 2) In some corner cases the value of *ppage pointer is NULL. >> 3) When highmem is enabled and a NULL pointer is passed to >> Kmap, then kmap finally crashes. But in the case when highmem >> is disabled, then kmap returns a junk value for NULL pointer. >> >> Highmem Enabled , kmap( NULL )-----> kernel crashes. >> >> Highmem disabled, kmap( NULL )-----> junk value is returned. >> Subsequently this message is observed on >> the console. >> >> "RPC call returned error 14" >> >> 4) Now the question is why is the value of *ppage = NULL is >passed >> from the above piece of code to lower layers. >> Should that not have handled *ppage = NULL? and kmap should >not >> have received a NULL pointer? > >I wouldn't expect *ppage to be NULL under any circumstances, so I'm >really curious as to what is happening here. > >Could you perhaps add a printk() to that section of code to print out >the values of 'xdr->page_base', 'xdr->page_len', 'len' and 'remainder' >in the case where *ppage == NULL? > Thanks for the response. I added a printk just before err = sock->ops->sendpage(sock, *ppage, base, len, flags); So here are values when *ppage is NULL. xdr->page_base= 0xCE9 xdr->page_len=0x400 len=0xE9 remainder=0x0. Thanks, Sricharan -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
[parent not found: <dbdc7c3761a82e0b84f2f49c533c07a5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* RE: Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel [not found] ` <dbdc7c3761a82e0b84f2f49c533c07a5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2011-09-12 14:41 ` Trond Myklebust 2011-09-12 15:54 ` Trond Myklebust 0 siblings, 1 reply; 7+ messages in thread From: Trond Myklebust @ 2011-09-12 14:41 UTC (permalink / raw) To: Sricharan R Cc: linux-omap-u79uwXL29TY76Z2rM5mHXA, linux-nfs-u79uwXL29TY76Z2rM5mHXA On Mon, 2011-09-12 at 11:46 +0530, Sricharan R wrote: > Hi Trond, > [....] > > >> 1) In the above piece of code, the *ppage value from ops- > >>sendpage > >> function is finally passed on to Kmap by the lower level > code > >to > >> get the virtual address of the page. > >> 2) In some corner cases the value of *ppage pointer is NULL. > >> 3) When highmem is enabled and a NULL pointer is passed to > >> Kmap, then kmap finally crashes. But in the case when > highmem > >> is disabled, then kmap returns a junk value for NULL > pointer. > >> > >> Highmem Enabled , kmap( NULL )-----> kernel crashes. > >> > >> Highmem disabled, kmap( NULL )-----> junk value is returned. > >> Subsequently this message is observed on > >> the console. > >> > >> "RPC call returned error 14" > >> > >> 4) Now the question is why is the value of *ppage = NULL is > >passed > >> from the above piece of code to lower layers. > >> Should that not have handled *ppage = NULL? and kmap should > >not > >> have received a NULL pointer? > > > >I wouldn't expect *ppage to be NULL under any circumstances, so I'm > >really curious as to what is happening here. > > > >Could you perhaps add a printk() to that section of code to print out > >the values of 'xdr->page_base', 'xdr->page_len', 'len' and 'remainder' > >in the case where *ppage == NULL? > > > > > Thanks for the response. > I added a printk just before err = sock->ops->sendpage(sock, *ppage, base, > len, flags); > So here are values when *ppage is NULL. > > xdr->page_base= 0xCE9 xdr->page_len=0x400 len=0xE9 remainder=0x0. > > Thanks, > Sricharan Can you please tell me what the mount options are for this setup? Are you running any applications that might be using O_DIRECT writes? Cheers Trond -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org www.netapp.com -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* RE: Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel 2011-09-12 14:41 ` Trond Myklebust @ 2011-09-12 15:54 ` Trond Myklebust [not found] ` <1315842858.15975.2.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org> 0 siblings, 1 reply; 7+ messages in thread From: Trond Myklebust @ 2011-09-12 15:54 UTC (permalink / raw) To: Sricharan R; +Cc: linux-omap, linux-nfs On Mon, 2011-09-12 at 10:41 -0400, Trond Myklebust wrote: > On Mon, 2011-09-12 at 11:46 +0530, Sricharan R wrote: > > Thanks for the response. > > I added a printk just before err = sock->ops->sendpage(sock, *ppage, base, > > len, flags); > > So here are values when *ppage is NULL. > > > > xdr->page_base= 0xCE9 xdr->page_len=0x400 len=0xE9 remainder=0x0. > > > > Thanks, > > Sricharan > > Can you please tell me what the mount options are for this setup? I'm guessing you've got wsize=1024, in which case, can you please try the following patch? Cheers Trond 8<-------------------------------------------------------------------------- >From 7b4a9c76b55dd254431902552528137a2ea5e55d Mon Sep 17 00:00:00 2001 From: Trond Myklebust <Trond.Myklebust@netapp.com> Date: Mon, 12 Sep 2011 11:47:53 -0400 Subject: [PATCH] NFS: Fix a typo in nfs_flush_multi Fix a typo which causes an Oops in the RPC layer, when using wsize < 4k. Signed-off-by: Trond Myklebust <Trond.Myklebust@netapp.com> --- fs/nfs/write.c | 2 +- 1 files changed, 1 insertions(+), 1 deletions(-) diff --git a/fs/nfs/write.c b/fs/nfs/write.c index b39b37f..c9bd2a6 100644 --- a/fs/nfs/write.c +++ b/fs/nfs/write.c @@ -958,7 +958,7 @@ static int nfs_flush_multi(struct nfs_pageio_descriptor *desc, struct list_head if (!data) goto out_bad; data->pagevec[0] = page; - nfs_write_rpcsetup(req, data, wsize, offset, desc->pg_ioflags); + nfs_write_rpcsetup(req, data, len, offset, desc->pg_ioflags); list_add(&data->list, res); requests++; nbytes -= len; -- 1.7.6 -- Trond Myklebust Linux NFS client maintainer NetApp Trond.Myklebust@netapp.com www.netapp.com ^ permalink raw reply related [flat|nested] 7+ messages in thread
[parent not found: <1315842858.15975.2.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org>]
* RE: Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel [not found] ` <1315842858.15975.2.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org> @ 2011-09-13 6:41 ` Sricharan R 0 siblings, 0 replies; 7+ messages in thread From: Sricharan R @ 2011-09-13 6:41 UTC (permalink / raw) To: Trond Myklebust Cc: linux-omap-u79uwXL29TY76Z2rM5mHXA, linux-nfs-u79uwXL29TY76Z2rM5mHXA, Santosh Shilimkar [..] >> >> Can you please tell me what the mount options are for this setup? > >I'm guessing you've got wsize=1024, in which case, can you please try >the following patch? > The mount options for nfs is rw. Yes, in my setup wsize=1024 when the issue happened. I tried your patch and I was not able to see the issue after that, where as in the other case the issue happened quite frequently. So I think that the patch fixes the issue. Thanks a lot for your help. >Cheers > Trond >8<----------------------------------------------------------------------- -- >- >From 7b4a9c76b55dd254431902552528137a2ea5e55d Mon Sep 17 00:00:00 2001 >From: Trond Myklebust <Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org> >Date: Mon, 12 Sep 2011 11:47:53 -0400 >Subject: [PATCH] NFS: Fix a typo in nfs_flush_multi > >Fix a typo which causes an Oops in the RPC layer, when using wsize < 4k. > >Signed-off-by: Trond Myklebust <Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org> >--- > fs/nfs/write.c | 2 +- > 1 files changed, 1 insertions(+), 1 deletions(-) > >diff --git a/fs/nfs/write.c b/fs/nfs/write.c >index b39b37f..c9bd2a6 100644 >--- a/fs/nfs/write.c >+++ b/fs/nfs/write.c >@@ -958,7 +958,7 @@ static int nfs_flush_multi(struct nfs_pageio_descriptor >*desc, struct list_head > if (!data) > goto out_bad; > data->pagevec[0] = page; >- nfs_write_rpcsetup(req, data, wsize, offset, desc->pg_ioflags); >+ nfs_write_rpcsetup(req, data, len, offset, desc->pg_ioflags); > list_add(&data->list, res); > requests++; > nbytes -= len; >-- >1.7.6 > > > >-- >Trond Myklebust >Linux NFS client maintainer > >NetApp >Trond.Myklebust-HgOvQuBEEgTQT0dZR+AlfA@public.gmane.org >www.netapp.com -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 7+ messages in thread
* Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel
@ 2011-09-09 12:42 Sricharan R
0 siblings, 0 replies; 7+ messages in thread
From: Sricharan R @ 2011-09-09 12:42 UTC (permalink / raw)
To: linux-omap, linux-nfs; +Cc: Trond.Myklebust
Hi,
A kernel crash is observed on 3.1rc4 kernel when HIGHMEM is enabled and
kernel is booted with a NFS on omap4430sdp. The issue happens in the
below
scenario.
In file net/sunrpc/xprtsock.c,
static int xs_send_pagedata( xxx, struct xdr_buf *xdr, ..)
{
Struct page **ppage;
....
.....
ppage = xdr->pages + (base >> PAGE_SHIFT);
....
err = sock->ops->sendpage(sock, *ppage, base, len, flags);
...
}
1) In the above piece of code, the *ppage value from
ops->sendpage function is finally passed on to Kmap by the lower
level code to get the virtual address of the page.
2) In some corner cases the value of *ppage pointer is NULL.
3) When highmem is enabled and a NULL pointer is passed to
Kmap, then kmap finally crashes. But in the case when highmem
is disabled, then kmap returns a junk value for NULL pointer.
Highmem Enabled , kmap( NULL )-----> kernel crashes.
Highmem disabled, kmap( NULL )-----> junk value is
returned.
Subsequently this message
is observed on
the console.
"RPC call returned error 14"
4) Now the question is why is the value of *ppage = NULL is
passed
from the above piece of code to lower layers.
Should that not have handled *ppage = NULL? and kmap should not
have received a NULL pointer?
Thanks,
Sricharan
^ permalink raw reply [flat|nested] 7+ messages in threadend of thread, other threads:[~2011-09-13 6:41 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2011-09-09 13:10 Regression seen when HIGHMEM enabled with NFS on 3.1rc4 kernel R, Sricharan
[not found] ` <CAJ7qFSeB8bAGdYfZs-LFHuYqGWNA2RidY0EBEFw_1CndvG7q5A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-09-09 18:51 ` Trond Myklebust
[not found] ` <1315594317.17611.25.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org>
2011-09-12 6:16 ` Sricharan R
[not found] ` <dbdc7c3761a82e0b84f2f49c533c07a5-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2011-09-12 14:41 ` Trond Myklebust
2011-09-12 15:54 ` Trond Myklebust
[not found] ` <1315842858.15975.2.camel-SyLVLa/KEI9HwK5hSS5vWB2eb7JE58TQ@public.gmane.org>
2011-09-13 6:41 ` Sricharan R
-- strict thread matches above, loose matches on Subject: below --
2011-09-09 12:42 Sricharan R
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).