* Re: [Fwd: Re: Linux 2.6.25-rc2]
[not found] ` <1203282165.2929.7.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
@ 2008-02-18 17:45 ` J. Bruce Fields
2008-02-18 18:18 ` Tom Tucker
0 siblings, 1 reply; 4+ messages in thread
From: J. Bruce Fields @ 2008-02-18 17:45 UTC (permalink / raw)
To: Trond Myklebust; +Cc: linux-nfs, Tom Tucker
On Sun, Feb 17, 2008 at 04:02:45PM -0500, Trond Myklebust wrote:
> Hi Bruce,
>
> Here is a question for you.
>
> Why does svc_close_all() get away with deleting xprt->xpt_ready
> without holding the pool->sp_lock?
>From a quick look--I think the intention is that the code that calls it
(in svc_destroy()) is only called after all other server threads have
exited, and that there can't be anyone else monkeying with that service
any more. But I haven't verified that really carefully.
> I ask because the strange Oopses that appear in the attached bugreport
> often involve the sunrpc server code. They would appear to indicate
> corruption in the pool->sp_sockets code...
But obviously this needs a hard look. OK, thanks. I could really use
some help if there's someone that has time to look into this....
--b.
>
> Cheers
> Trond
Content-Description: Forwarded message - Re: Linux 2.6.25-rc2
> From: "Rafael J. Wysocki" <rjw@sisk.pl>
> To: Torsten Kaiser <just.for.lkml@googlemail.com>
> Subject: Re: Linux 2.6.25-rc2
> Date: Sun, 17 Feb 2008 21:25:55 +0100
> Cc: Linus Torvalds <torvalds@linux-foundation.org>,
> Linux Kernel Mailing List <linux-kernel@vger.kernel.org>
>
> On Saturday, 16 of February 2008, Torsten Kaiser wrote:
> > On Feb 15, 2008 10:23 PM, Linus Torvalds <torvalds@linux-foundation.org> wrote:
> > >
> > > Ok,
> > > this kernel is a winner.
> >
> > Sadly not for me:
> > [ 5282.056415] ------------[ cut here ]------------
> > [ 5282.059757] kernel BUG at lib/list_debug.c:33!
> > [ 5282.062055] invalid opcode: 0000 [1] SMP
> > [ 5282.062055] CPU 3
> > [ 5282.062055] Modules linked in: radeon drm w83792d ipv6 tuner
> > tea5767 tda8290 tuner_xc2028 tda9887 tuner_simple mt20xx tea5761
> > tvaudio msp3400 bttv videodev v4l1_compat ir_common compat_ioctl32
> > v4l2_common videobuf_dma_sg videobuf_core btcx_risc tveeprom usbhid
> > pata_amd i2c_nforce2 hid sg
> > [ 5282.062055] Pid: 12937, comm: sed Not tainted 2.6.25-rc2 #1
> > [ 5282.062055] RIP: 0010:[<ffffffff803bffe4>]
> > -> then the output from the serial console stopped. I was in X, so I
> > could not see, if there was anything more on the real console.
> >
> > (gdb) list *0xffffffff803bffe4
> > 0xffffffff803bffe4 is in __list_add (lib/list_debug.c:33).
> > 28 }
> > 29 if (unlikely(prev->next != next)) {
> > 30 printk(KERN_ERR "list_add corruption.
> > prev->next should be "
> > 31 "next (%p), but was %p. (prev=%p).\n",
> > 32 next, prev->next, prev);
> > 33 BUG();
> > 34 }
> > 35 next->prev = new;
> > 36 new->next = next;
> > 37 new->prev = prev;
> >
> > For more on this problem see http://marc.info/?l=linux-kernel&m=120293042005445
>
> There's the Bugzilla entry for it at
> http://bugzilla.kernel.org/show_bug.cgi?id=9973
>
> Please update it with the current information.
>
> Thanks,
> Rafael
> --
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Fwd: Re: Linux 2.6.25-rc2]
2008-02-18 17:45 ` [Fwd: Re: Linux 2.6.25-rc2] J. Bruce Fields
@ 2008-02-18 18:18 ` Tom Tucker
[not found] ` <1203358682.24272.31.camel-SMNkleLxa3ZimH42XvhXlA@public.gmane.org>
0 siblings, 1 reply; 4+ messages in thread
From: Tom Tucker @ 2008-02-18 18:18 UTC (permalink / raw)
To: J. Bruce Fields; +Cc: Trond Myklebust, linux-nfs
Bruce:
I'll take a look...
Tom
On Mon, 2008-02-18 at 12:45 -0500, J. Bruce Fields wrote:
> On Sun, Feb 17, 2008 at 04:02:45PM -0500, Trond Myklebust wrote:
> > Hi Bruce,
> >
> > Here is a question for you.
> >
> > Why does svc_close_all() get away with deleting xprt->xpt_ready
> > without holding the pool->sp_lock?
>
> >From a quick look--I think the intention is that the code that calls it
> (in svc_destroy()) is only called after all other server threads have
> exited, and that there can't be anyone else monkeying with that service
> any more. But I haven't verified that really carefully.
>
> > I ask because the strange Oopses that appear in the attached bugreport
> > often involve the sunrpc server code. They would appear to indicate
> > corruption in the pool->sp_sockets code...
>
> But obviously this needs a hard look. OK, thanks. I could really use
> some help if there's someone that has time to look into this....
>
> --b.
>
> >
> > Cheers
> > Trond
>
> Content-Description: Forwarded message - Re: Linux 2.6.25-rc2
> > From: "Rafael J. Wysocki" <rjw@sisk.pl>
> > To: Torsten Kaiser <just.for.lkml@googlemail.com>
> > Subject: Re: Linux 2.6.25-rc2
> > Date: Sun, 17 Feb 2008 21:25:55 +0100
> > Cc: Linus Torvalds <torvalds@linux-foundation.org>,
> > Linux Kernel Mailing List <linux-kernel@vger.kernel.org>
> >
> > On Saturday, 16 of February 2008, Torsten Kaiser wrote:
> > > On Feb 15, 2008 10:23 PM, Linus Torvalds <torvalds@linux-foundation.org> wrote:
> > > >
> > > > Ok,
> > > > this kernel is a winner.
> > >
> > > Sadly not for me:
> > > [ 5282.056415] ------------[ cut here ]------------
> > > [ 5282.059757] kernel BUG at lib/list_debug.c:33!
> > > [ 5282.062055] invalid opcode: 0000 [1] SMP
> > > [ 5282.062055] CPU 3
> > > [ 5282.062055] Modules linked in: radeon drm w83792d ipv6 tuner
> > > tea5767 tda8290 tuner_xc2028 tda9887 tuner_simple mt20xx tea5761
> > > tvaudio msp3400 bttv videodev v4l1_compat ir_common compat_ioctl32
> > > v4l2_common videobuf_dma_sg videobuf_core btcx_risc tveeprom usbhid
> > > pata_amd i2c_nforce2 hid sg
> > > [ 5282.062055] Pid: 12937, comm: sed Not tainted 2.6.25-rc2 #1
> > > [ 5282.062055] RIP: 0010:[<ffffffff803bffe4>]
> > > -> then the output from the serial console stopped. I was in X, so I
> > > could not see, if there was anything more on the real console.
> > >
> > > (gdb) list *0xffffffff803bffe4
> > > 0xffffffff803bffe4 is in __list_add (lib/list_debug.c:33).
> > > 28 }
> > > 29 if (unlikely(prev->next != next)) {
> > > 30 printk(KERN_ERR "list_add corruption.
> > > prev->next should be "
> > > 31 "next (%p), but was %p. (prev=%p).\n",
> > > 32 next, prev->next, prev);
> > > 33 BUG();
> > > 34 }
> > > 35 next->prev = new;
> > > 36 new->next = next;
> > > 37 new->prev = prev;
> > >
> > > For more on this problem see http://marc.info/?l=linux-kernel&m=120293042005445
> >
> > There's the Bugzilla entry for it at
> >
http://bugzilla.kernel.org/show_bug.cgi?id=9973
> >
> > Please update it with the current information.
> >
> > Thanks,
> > Rafael
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> > the body of a message to majordomo@vger.kernel.org
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> > Please read the FAQ at http://www.tux.org/lkml/
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at http://vger.kernel.org/majordomo-info.html
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Fwd: Re: Linux 2.6.25-rc2]
[not found] ` <1203358682.24272.31.camel-SMNkleLxa3ZimH42XvhXlA@public.gmane.org>
@ 2008-02-19 8:00 ` Greg Banks
[not found] ` <47BA8CB3.4030703-cP1dWloDopni96+mSzHFpQC/G2K4zDHf@public.gmane.org>
0 siblings, 1 reply; 4+ messages in thread
From: Greg Banks @ 2008-02-19 8:00 UTC (permalink / raw)
To: Tom Tucker; +Cc: J. Bruce Fields, Trond Myklebust, linux-nfs
Tom Tucker wrote:
> Bruce:
>
> I'll take a look...
>
> Tom
>
> On Mon, 2008-02-18 at 12:45 -0500, J. Bruce Fields wrote:
>
>> On Sun, Feb 17, 2008 at 04:02:45PM -0500, Trond Myklebust wrote:
>>
>>> Hi Bruce,
>>>
>>> Here is a question for you.
>>>
>>> Why does svc_close_all() get away with deleting xprt->xpt_ready
>>> without holding the pool->sp_lock?
>>>
>> >From a quick look--I think the intention is that the code that calls it
>> (in svc_destroy()) is only called after all other server threads have
>> exited, and that there can't be anyone else monkeying with that service
>> any more. But I haven't verified that really carefully.
>>
That's certainly the intention. The serv->sv_nrthreads fields is used
as a refcount, which counts 1 for each nfsd thread and sometimes 1 to
guard some short-term manipulations. This refcount should not drop to
zero until the last thread exits. So by the time svc_close_all() is
called, no thread can be looking at a pool's sp_sockets list. Each
xprt could still be racily added to that list from softirq mode data
ready handlers calling svc_xprt_enqueue(), but only until the xprt's
xpo_detach method is called (which removes any data ready callbacks).
At that point, no code should be modifying the xpt_ready field, and
it may or may not be used to link the xprt into some pool->sp_sockets
but we don't care because all the pools are about to be destroyed
anyway.
That's the way it's been working since 2.6.19, and I don't think any
of Tom's patches changed that.
I can think of a couple of things that could be wrong:
* serv->sv_nrthreads is used in a few places, and there might
be bugs in that which are getting that count wrong (when I left
the code, all increments and decrements of that field went through
svc_get() and svc_destroy(), but other changes have crept in).
* Currently running data ready callbacks might be racing with xpo_detach.
Moving that call inside the spin_lock_bh() critical section just
after it might help.
>>>> For more on this problem see http://marc.info/?l=linux-kernel&m=120293042005445
>>>>
>>> There's the Bugzilla entry for it at
>>>
>>>
>
>
> http://bugzilla.kernel.org/show_bug.cgi?id=9973
>
It's not clear from the bugzilla that NFS is at fault here.
--
Greg Banks, R&D Software Engineer, SGI Australian Software Group.
The cake is *not* a lie.
I don't speak for SGI.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [Fwd: Re: Linux 2.6.25-rc2]
[not found] ` <47BA8CB3.4030703-cP1dWloDopni96+mSzHFpQC/G2K4zDHf@public.gmane.org>
@ 2008-02-20 18:52 ` J. Bruce Fields
0 siblings, 0 replies; 4+ messages in thread
From: J. Bruce Fields @ 2008-02-20 18:52 UTC (permalink / raw)
To: Greg Banks; +Cc: Tom Tucker, Trond Myklebust, linux-nfs
On Tue, Feb 19, 2008 at 07:00:51PM +1100, Greg Banks wrote:
> Tom Tucker wrote:
> > Bruce:
> >
> > I'll take a look...
> >
> > Tom
> >
> > On Mon, 2008-02-18 at 12:45 -0500, J. Bruce Fields wrote:
> >
> >> On Sun, Feb 17, 2008 at 04:02:45PM -0500, Trond Myklebust wrote:
> >>
> >>> Hi Bruce,
> >>>
> >>> Here is a question for you.
> >>>
> >>> Why does svc_close_all() get away with deleting xprt->xpt_ready
> >>> without holding the pool->sp_lock?
> >>>
> >> >From a quick look--I think the intention is that the code that calls it
> >> (in svc_destroy()) is only called after all other server threads have
> >> exited, and that there can't be anyone else monkeying with that service
> >> any more. But I haven't verified that really carefully.
> >>
> That's certainly the intention. The serv->sv_nrthreads fields is used
> as a refcount, which counts 1 for each nfsd thread and sometimes 1 to
> guard some short-term manipulations. This refcount should not drop to
> zero until the last thread exits. So by the time svc_close_all() is
> called, no thread can be looking at a pool's sp_sockets list. Each
> xprt could still be racily added to that list from softirq mode data
> ready handlers calling svc_xprt_enqueue(), but only until the xprt's
> xpo_detach method is called (which removes any data ready callbacks).
> At that point, no code should be modifying the xpt_ready field, and
> it may or may not be used to link the xprt into some pool->sp_sockets
> but we don't care because all the pools are about to be destroyed
> anyway.
OK, thanks very much for the confirmation. It looks like they've
finally decided the problem was actually with the memory allocator:
http://bugzilla.kernel.org/show_bug.cgi?id=9973
so I think we're OK.
> That's the way it's been working since 2.6.19, and I don't think any
> of Tom's patches changed that.
>
> I can think of a couple of things that could be wrong:
>
> * serv->sv_nrthreads is used in a few places, and there might
> be bugs in that which are getting that count wrong (when I left
> the code, all increments and decrements of that field went through
> svc_get() and svc_destroy(), but other changes have crept in).
>
> * Currently running data ready callbacks might be racing with xpo_detach.
> Moving that call inside the spin_lock_bh() critical section just
> after it might help.
But if you were unsure about these two things then I suppose it might be
worth doing a closer audit and seeing if there's any refactoring or
documentation that could be added to make the rules clearer.
--b.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2008-02-20 18:52 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <1203282165.2929.7.camel@heimdal.trondhjem.org>
[not found] ` <1203282165.2929.7.camel-rJ7iovZKK19ZJLDQqaL3InhyD016LWXt@public.gmane.org>
2008-02-18 17:45 ` [Fwd: Re: Linux 2.6.25-rc2] J. Bruce Fields
2008-02-18 18:18 ` Tom Tucker
[not found] ` <1203358682.24272.31.camel-SMNkleLxa3ZimH42XvhXlA@public.gmane.org>
2008-02-19 8:00 ` Greg Banks
[not found] ` <47BA8CB3.4030703-cP1dWloDopni96+mSzHFpQC/G2K4zDHf@public.gmane.org>
2008-02-20 18:52 ` J. Bruce Fields
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.