* [PATCH] nfs lockd: detect grace_list corruption
@ 2009-04-24 3:09 Wang Chen
2009-04-24 5:15 ` Bian Naimeng
` (2 more replies)
0 siblings, 3 replies; 13+ messages in thread
From: Wang Chen @ 2009-04-24 3:09 UTC (permalink / raw)
To: bfields, neilb, Trond.Myklebust; +Cc: linux-nfs
Although I can't reproduce it now, it really happened that some lock manager
started grace period but didn't end it.
This causes an lm entry be left in grace_list, and when service nfs restart,
the same lm will be added again into the list.
As you know, adding an entry, which is in the list, to a list will leads to
list corruption.
The following is the dmesg information:
------------[ cut here ]------------
WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c()
Hardware name: Presario M2000 (PT365PA#AB2)
list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128).
Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode]
Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3
Call Trace:
[<c042d5b5>] warn_slowpath+0x71/0xa0
[<c0422a96>] ? update_curr+0x11d/0x125
[<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150
[<c044b270>] ? trace_hardirqs_on+0xb/0xd
[<c051c61a>] ? _raw_spin_lock+0x53/0xfa
[<c051c89f>] __list_add+0x27/0x5c
[<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd]
[<ef8f34da>] set_grace_period+0x39/0x53 [lockd]
[<c06b8921>] ? lock_kernel+0x1c/0x28
[<ef8f3558>] lockd+0x64/0x164 [lockd]
[<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150
[<c04227b0>] ? complete+0x34/0x3e
[<ef8f34f4>] ? lockd+0x0/0x164 [lockd]
[<ef8f34f4>] ? lockd+0x0/0x164 [lockd]
[<c043dd42>] kthread+0x45/0x6b
[<c043dcfd>] ? kthread+0x0/0x6b
[<c0403c23>] kernel_thread_helper+0x7/0x10
---[ end trace fa484bd6d19ade89 ]---
NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory
NFSD: starting 90-second grace period
------------------------
To prevent list corruption, don't add entry which is already in the list
and do WARN.
Signed-off-by: Wang Chen <wangchen@cn.fujitsu.com>
---
fs/lockd/grace.c | 9 ++++++++++
1 files changed, 9 insertions(+), 0 deletions(-)
diff --git a/fs/lockd/grace.c b/fs/lockd/grace.c
index 183cc1f..80becb7 100644
--- a/fs/lockd/grace.c
+++ b/fs/lockd/grace.c
@@ -21,8 +21,17 @@ static DEFINE_SPINLOCK(grace_lock);
*/
void locks_start_grace(struct lock_manager *lm)
{
+ struct lock_manager *each;
+
spin_lock(&grace_lock);
+ if (!list_empty(&lm->list)) {
+ WARN(1, "lock manager(%p) exists. locks_start_grace() "
+ "and locks_end_grace() were not called pairly."
+ "\n", lm);
+ goto unlock;
+ }
list_add(&lm->list, &grace_list);
+unlock:
spin_unlock(&grace_lock);
}
EXPORT_SYMBOL_GPL(locks_start_grace);
--
1.6.0.6
^ permalink raw reply related [flat|nested] 13+ messages in thread* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-04-24 3:09 [PATCH] nfs lockd: detect grace_list corruption Wang Chen @ 2009-04-24 5:15 ` Bian Naimeng 2009-04-24 7:35 ` [PATCH V2] " Wang Chen 2009-04-24 23:12 ` [PATCH] " J. Bruce Fields 2 siblings, 0 replies; 13+ messages in thread From: Bian Naimeng @ 2009-04-24 5:15 UTC (permalink / raw) To: Wang Chen; +Cc: bfields, neilb, Trond.Myklebust, linux-nfs > Although I can't reproduce it now, it really happened that some lock manager > started grace period but didn't end it. > This causes an lm entry be left in grace_list, and when service nfs restart, > the same lm will be added again into the list. > As you know, adding an entry, which is in the list, to a list will leads to > list corruption. > > The following is the dmesg information: > ------------[ cut here ]------------ > WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Hardware name: Presario M2000 (PT365PA#AB2) > list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Call Trace: > [<c042d5b5>] warn_slowpath+0x71/0xa0 > [<c0422a96>] ? update_curr+0x11d/0x125 > [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > [<c044b270>] ? trace_hardirqs_on+0xb/0xd > [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > [<c051c89f>] __list_add+0x27/0x5c > [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > [<c06b8921>] ? lock_kernel+0x1c/0x28 > [<ef8f3558>] lockd+0x64/0x164 [lockd] > [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > [<c04227b0>] ? complete+0x34/0x3e > [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > [<c043dd42>] kthread+0x45/0x6b > [<c043dcfd>] ? kthread+0x0/0x6b > [<c0403c23>] kernel_thread_helper+0x7/0x10 > ---[ end trace fa484bd6d19ade89 ]--- > NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > NFSD: starting 90-second grace period > ------------------------ > > To prevent list corruption, don't add entry which is already in the list > and do WARN. > > Signed-off-by: Wang Chen <wangchen@cn.fujitsu.com> > --- > fs/lockd/grace.c | 9 ++++++++++ > 1 files changed, 9 insertions(+), 0 deletions(-) > > diff --git a/fs/lockd/grace.c b/fs/lockd/grace.c > index 183cc1f..80becb7 100644 > --- a/fs/lockd/grace.c > +++ b/fs/lockd/grace.c > @@ -21,8 +21,17 @@ static DEFINE_SPINLOCK(grace_lock); > */ > void locks_start_grace(struct lock_manager *lm) > { > + struct lock_manager *each; > + The line "struct lock_manager *each" should be removed. > spin_lock(&grace_lock); > + if (!list_empty(&lm->list)) { It should INIT_LIST_HEAD(lm->list) before used list_empty. It sounds "lockd_manager" and "nfsd4_manager" never init themselves, when first used by locks_start_grace, so we should init them after defined, right? > + WARN(1, "lock manager(%p) exists. locks_start_grace() " > + "and locks_end_grace() were not called pairly." > + "\n", lm); > + goto unlock; > + } > list_add(&lm->list, &grace_list); > +unlock: > spin_unlock(&grace_lock); > } > EXPORT_SYMBOL_GPL(locks_start_grace); ^ permalink raw reply [flat|nested] 13+ messages in thread
* [PATCH V2] nfs lockd: detect grace_list corruption 2009-04-24 3:09 [PATCH] nfs lockd: detect grace_list corruption Wang Chen 2009-04-24 5:15 ` Bian Naimeng @ 2009-04-24 7:35 ` Wang Chen 2009-04-24 23:12 ` [PATCH] " J. Bruce Fields 2 siblings, 0 replies; 13+ messages in thread From: Wang Chen @ 2009-04-24 7:35 UTC (permalink / raw) To: bfields, neilb, Trond.Myklebust; +Cc: linux-nfs, FNST-Bian Naimeng V1->V2: - remove useless variable which was dirty code - use INIT_LIST_HEAD(&lm->list) to initialize lm->list >From 6dd710c220ea72af41d7679b42e4e6eddff051b5 Mon Sep 17 00:00:00 2001 From: Wang Chen <wangchen@cn.fujitsu.com> Date: Fri, 24 Apr 2009 15:09:55 +0800 Subject: [PATCH] nfs lockd: detect grace_list corruption Although I can't reproduce it now, it really happened that some lock manager started grace period but didn't end it. This causes an lm entry be left in grace_list, and when service nfs restart, the same lm will be added again into the list. As you know, adding an entry, which is in the list, to a list will leads to list corruption. The following is the dmesg information: ------------[ cut here ]------------ WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Hardware name: Presario M2000 (PT365PA#AB2) list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 Call Trace: [<c042d5b5>] warn_slowpath+0x71/0xa0 [<c0422a96>] ? update_curr+0x11d/0x125 [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 [<c044b270>] ? trace_hardirqs_on+0xb/0xd [<c051c61a>] ? _raw_spin_lock+0x53/0xfa [<c051c89f>] __list_add+0x27/0x5c [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] [<c06b8921>] ? lock_kernel+0x1c/0x28 [<ef8f3558>] lockd+0x64/0x164 [lockd] [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 [<c04227b0>] ? complete+0x34/0x3e [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] [<c043dd42>] kthread+0x45/0x6b [<c043dcfd>] ? kthread+0x0/0x6b [<c0403c23>] kernel_thread_helper+0x7/0x10 ---[ end trace fa484bd6d19ade89 ]--- NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory NFSD: starting 90-second grace period ------------------------ To prevent list corruption, don't add entry which is already in the list and do WARN. Signed-off-by: Wang Chen <wangchen@cn.fujitsu.com> Reviewed-by: Bian Naimeng <biannm@cn.fujitsu.com> --- fs/lockd/grace.c | 7 +++++++ fs/lockd/svc.c | 2 ++ fs/nfsd/nfs4state.c | 1 + 3 files changed, 10 insertions(+), 0 deletions(-) diff --git a/fs/lockd/grace.c b/fs/lockd/grace.c index 183cc1f..da52c35 100644 --- a/fs/lockd/grace.c +++ b/fs/lockd/grace.c @@ -22,7 +22,14 @@ static DEFINE_SPINLOCK(grace_lock); void locks_start_grace(struct lock_manager *lm) { spin_lock(&grace_lock); + if (!list_empty(&lm->list)) { + WARN(1, "lock manager(%p) exists. locks_start_grace() " + "and locks_end_grace() were not called pairly." + "\n", lm); + goto unlock; + } list_add(&lm->list, &grace_list); +unlock: spin_unlock(&grace_lock); } EXPORT_SYMBOL_GPL(locks_start_grace); diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c index abf8388..76a1f61 100644 --- a/fs/lockd/svc.c +++ b/fs/lockd/svc.c @@ -113,6 +113,8 @@ lockd(void *vrqstp) int err = 0, preverr = 0; struct svc_rqst *rqstp = vrqstp; + INIT_LIST_HEAD(&lockd_manager.list); + /* try_to_freeze() is called from svc_recv() */ set_freezable(); diff --git a/fs/nfsd/nfs4state.c b/fs/nfsd/nfs4state.c index c65a27b..86b9afd 100644 --- a/fs/nfsd/nfs4state.c +++ b/fs/nfsd/nfs4state.c @@ -4079,6 +4079,7 @@ nfs4_state_start(void) { if (nfs4_init) return; + INIT_LIST_HEAD(&nfsd4_manager.list); nfsd4_load_reboot_recovery_data(); __nfs4_state_start(); nfs4_init = 1; -- 1.6.0.6 ^ permalink raw reply related [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-04-24 3:09 [PATCH] nfs lockd: detect grace_list corruption Wang Chen 2009-04-24 5:15 ` Bian Naimeng 2009-04-24 7:35 ` [PATCH V2] " Wang Chen @ 2009-04-24 23:12 ` J. Bruce Fields 2009-05-06 9:17 ` Wang Chen 2 siblings, 1 reply; 13+ messages in thread From: J. Bruce Fields @ 2009-04-24 23:12 UTC (permalink / raw) To: Wang Chen; +Cc: neilb, Trond.Myklebust, linux-nfs On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: > Although I can't reproduce it now, it really happened that some lock manager > started grace period but didn't end it. > This causes an lm entry be left in grace_list, and when service nfs restart, > the same lm will be added again into the list. > As you know, adding an entry, which is in the list, to a list will leads to > list corruption. I'd really like to understand why locks_end_grace() isn't being called. I'm probably overlooking something obvious, but I just can't see how lockd or nfsd can be shut down right now without locks_end_grace() being called. --b. > > The following is the dmesg information: > ------------[ cut here ]------------ > WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Hardware name: Presario M2000 (PT365PA#AB2) > list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Call Trace: > [<c042d5b5>] warn_slowpath+0x71/0xa0 > [<c0422a96>] ? update_curr+0x11d/0x125 > [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > [<c044b270>] ? trace_hardirqs_on+0xb/0xd > [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > [<c051c89f>] __list_add+0x27/0x5c > [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > [<c06b8921>] ? lock_kernel+0x1c/0x28 > [<ef8f3558>] lockd+0x64/0x164 [lockd] > [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > [<c04227b0>] ? complete+0x34/0x3e > [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > [<c043dd42>] kthread+0x45/0x6b > [<c043dcfd>] ? kthread+0x0/0x6b > [<c0403c23>] kernel_thread_helper+0x7/0x10 > ---[ end trace fa484bd6d19ade89 ]--- > NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > NFSD: starting 90-second grace period > ------------------------ > > To prevent list corruption, don't add entry which is already in the list > and do WARN. > > Signed-off-by: Wang Chen <wangchen@cn.fujitsu.com> > --- > fs/lockd/grace.c | 9 ++++++++++ > 1 files changed, 9 insertions(+), 0 deletions(-) > > diff --git a/fs/lockd/grace.c b/fs/lockd/grace.c > index 183cc1f..80becb7 100644 > --- a/fs/lockd/grace.c > +++ b/fs/lockd/grace.c > @@ -21,8 +21,17 @@ static DEFINE_SPINLOCK(grace_lock); > */ > void locks_start_grace(struct lock_manager *lm) > { > + struct lock_manager *each; > + > spin_lock(&grace_lock); > + if (!list_empty(&lm->list)) { > + WARN(1, "lock manager(%p) exists. locks_start_grace() " > + "and locks_end_grace() were not called pairly." > + "\n", lm); > + goto unlock; > + } > list_add(&lm->list, &grace_list); > +unlock: > spin_unlock(&grace_lock); > } > EXPORT_SYMBOL_GPL(locks_start_grace); > -- > 1.6.0.6 > > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-04-24 23:12 ` [PATCH] " J. Bruce Fields @ 2009-05-06 9:17 ` Wang Chen 2009-05-06 20:32 ` J. Bruce Fields 0 siblings, 1 reply; 13+ messages in thread From: Wang Chen @ 2009-05-06 9:17 UTC (permalink / raw) To: J. Bruce Fields; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng J. Bruce Fields said the following on 2009-4-25 7:12: > On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: >> Although I can't reproduce it now, it really happened that some lock manager >> started grace period but didn't end it. >> This causes an lm entry be left in grace_list, and when service nfs restart, >> the same lm will be added again into the list. >> As you know, adding an entry, which is in the list, to a list will leads to >> list corruption. > > I'd really like to understand why locks_end_grace() isn't being called. > I'm probably overlooking something obvious, but I just can't see how > lockd or nfsd can be shut down right now without locks_end_grace() being > called. > Me neither can figure out why locks_end_grace() isn't being called. But do locks_start_grace() twice can trigger this warning too. You can do 1. service nfs restart 2. (immediately) kill -s SIGKILL lockd this can trigger --- lockd(void *vrqstp) ... if (signalled()) { flush_signals(current); if (nlmsvc_ops) { nlmsvc_invalidate_all(); set_grace_period(); --- and makes locks_start_grace() be called twice without locks_end_grace(). So I still suggest to do something to protect the lm list. :) ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-06 9:17 ` Wang Chen @ 2009-05-06 20:32 ` J. Bruce Fields 2009-05-07 6:51 ` Wang Chen 0 siblings, 1 reply; 13+ messages in thread From: J. Bruce Fields @ 2009-05-06 20:32 UTC (permalink / raw) To: Wang Chen; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: > J. Bruce Fields said the following on 2009-4-25 7:12: > > On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: > >> Although I can't reproduce it now, it really happened that some lock manager > >> started grace period but didn't end it. > >> This causes an lm entry be left in grace_list, and when service nfs restart, > >> the same lm will be added again into the list. > >> As you know, adding an entry, which is in the list, to a list will leads to > >> list corruption. > > > > I'd really like to understand why locks_end_grace() isn't being called. > > I'm probably overlooking something obvious, but I just can't see how > > lockd or nfsd can be shut down right now without locks_end_grace() being > > called. > > > > Me neither can figure out why locks_end_grace() isn't being called. > > But do locks_start_grace() twice can trigger this warning too. > You can do > 1. service nfs restart > 2. (immediately) kill -s SIGKILL lockd > this can trigger > --- > lockd(void *vrqstp) > ... > if (signalled()) { > flush_signals(current); > if (nlmsvc_ops) { > nlmsvc_invalidate_all(); > set_grace_period(); > --- > and makes locks_start_grace() be called twice without locks_end_grace(). Ah-hah! > So I still suggest to do something to protect the lm list. :) I wouldn't be opposed to a simple WARN_ON(!list_empty()) in locks_start_grace(), but I'm mainly worried about fixing the original bug. How about the following? --b. diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c index abf8388..1a54ae1 100644 --- a/fs/lockd/svc.c +++ b/fs/lockd/svc.c @@ -104,6 +104,16 @@ static void set_grace_period(void) schedule_delayed_work(&grace_period_end, grace_period); } +static void restart_grace(void) +{ + if (nlmsvc_ops) { + cancel_delayed_work_sync(&grace_period_end); + locks_end_grace(&lockd_manager); + nlmsvc_invalidate_all(); + set_grace_period(); + } +} + /* * This is the lockd kernel thread */ @@ -149,10 +159,7 @@ lockd(void *vrqstp) if (signalled()) { flush_signals(current); - if (nlmsvc_ops) { - nlmsvc_invalidate_all(); - set_grace_period(); - } + restart_grace(); continue; } ^ permalink raw reply related [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-06 20:32 ` J. Bruce Fields @ 2009-05-07 6:51 ` Wang Chen 2009-05-08 18:26 ` J. Bruce Fields 0 siblings, 1 reply; 13+ messages in thread From: Wang Chen @ 2009-05-07 6:51 UTC (permalink / raw) To: J. Bruce Fields; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng J. Bruce Fields said the following on 2009-5-7 4:32: > On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: >> J. Bruce Fields said the following on 2009-4-25 7:12: >>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: >>>> Although I can't reproduce it now, it really happened that some lock manager >>>> started grace period but didn't end it. >>>> This causes an lm entry be left in grace_list, and when service nfs restart, >>>> the same lm will be added again into the list. >>>> As you know, adding an entry, which is in the list, to a list will leads to >>>> list corruption. >>> I'd really like to understand why locks_end_grace() isn't being called. >>> I'm probably overlooking something obvious, but I just can't see how >>> lockd or nfsd can be shut down right now without locks_end_grace() being >>> called. >>> >> Me neither can figure out why locks_end_grace() isn't being called. >> >> But do locks_start_grace() twice can trigger this warning too. >> You can do >> 1. service nfs restart >> 2. (immediately) kill -s SIGKILL lockd >> this can trigger >> --- >> lockd(void *vrqstp) >> ... >> if (signalled()) { >> flush_signals(current); >> if (nlmsvc_ops) { >> nlmsvc_invalidate_all(); >> set_grace_period(); >> --- >> and makes locks_start_grace() be called twice without locks_end_grace(). > > Ah-hah! > >> So I still suggest to do something to protect the lm list. :) > > I wouldn't be opposed to a simple WARN_ON(!list_empty()) in > locks_start_grace(), but I'm mainly worried about fixing the original > bug. How about the following? > Yeah, the following fix is OK to me, although it only fixed "start_grace again after start_grace" case. The bug about "quit lockd without end_grace", which I encountered before incidentally, maybe is still there. > --b. > > diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c > index abf8388..1a54ae1 100644 > --- a/fs/lockd/svc.c > +++ b/fs/lockd/svc.c > @@ -104,6 +104,16 @@ static void set_grace_period(void) > schedule_delayed_work(&grace_period_end, grace_period); > } > > +static void restart_grace(void) > +{ > + if (nlmsvc_ops) { > + cancel_delayed_work_sync(&grace_period_end); > + locks_end_grace(&lockd_manager); > + nlmsvc_invalidate_all(); > + set_grace_period(); > + } > +} > + > /* > * This is the lockd kernel thread > */ > @@ -149,10 +159,7 @@ lockd(void *vrqstp) > > if (signalled()) { > flush_signals(current); > - if (nlmsvc_ops) { > - nlmsvc_invalidate_all(); > - set_grace_period(); > - } > + restart_grace(); > continue; > } > > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-07 6:51 ` Wang Chen @ 2009-05-08 18:26 ` J. Bruce Fields 2009-05-11 6:13 ` Wang Chen 0 siblings, 1 reply; 13+ messages in thread From: J. Bruce Fields @ 2009-05-08 18:26 UTC (permalink / raw) To: Wang Chen; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: > J. Bruce Fields said the following on 2009-5-7 4:32: > > On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: > >> J. Bruce Fields said the following on 2009-4-25 7:12: > >>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: > >>>> Although I can't reproduce it now, it really happened that some lock manager > >>>> started grace period but didn't end it. > >>>> This causes an lm entry be left in grace_list, and when service nfs restart, > >>>> the same lm will be added again into the list. > >>>> As you know, adding an entry, which is in the list, to a list will leads to > >>>> list corruption. > >>> I'd really like to understand why locks_end_grace() isn't being called. > >>> I'm probably overlooking something obvious, but I just can't see how > >>> lockd or nfsd can be shut down right now without locks_end_grace() being > >>> called. > >>> > >> Me neither can figure out why locks_end_grace() isn't being called. > >> > >> But do locks_start_grace() twice can trigger this warning too. > >> You can do > >> 1. service nfs restart > >> 2. (immediately) kill -s SIGKILL lockd > >> this can trigger > >> --- > >> lockd(void *vrqstp) > >> ... > >> if (signalled()) { > >> flush_signals(current); > >> if (nlmsvc_ops) { > >> nlmsvc_invalidate_all(); > >> set_grace_period(); > >> --- > >> and makes locks_start_grace() be called twice without locks_end_grace(). > > > > Ah-hah! > > > >> So I still suggest to do something to protect the lm list. :) > > > > I wouldn't be opposed to a simple WARN_ON(!list_empty()) in > > locks_start_grace(), but I'm mainly worried about fixing the original > > bug. How about the following? > > > > Yeah, the following fix is OK to me, although it only fixed > "start_grace again after start_grace" case. OK, thanks. > The bug about "quit lockd without end_grace", which I encountered before > incidentally, maybe is still there. You're talking about the report that started this thread?: http://marc.info/?l=linux-nfs&m=124054262421444&w=2 It looks to me like that could be explained by two start_grace's in a row. --b. > > > --b. > > > > diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c > > index abf8388..1a54ae1 100644 > > --- a/fs/lockd/svc.c > > +++ b/fs/lockd/svc.c > > @@ -104,6 +104,16 @@ static void set_grace_period(void) > > schedule_delayed_work(&grace_period_end, grace_period); > > } > > > > +static void restart_grace(void) > > +{ > > + if (nlmsvc_ops) { > > + cancel_delayed_work_sync(&grace_period_end); > > + locks_end_grace(&lockd_manager); > > + nlmsvc_invalidate_all(); > > + set_grace_period(); > > + } > > +} > > + > > /* > > * This is the lockd kernel thread > > */ > > @@ -149,10 +159,7 @@ lockd(void *vrqstp) > > > > if (signalled()) { > > flush_signals(current); > > - if (nlmsvc_ops) { > > - nlmsvc_invalidate_all(); > > - set_grace_period(); > > - } > > + restart_grace(); > > continue; > > } > > > > > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-08 18:26 ` J. Bruce Fields @ 2009-05-11 6:13 ` Wang Chen 2009-05-11 20:57 ` J. Bruce Fields 0 siblings, 1 reply; 13+ messages in thread From: Wang Chen @ 2009-05-11 6:13 UTC (permalink / raw) To: J. Bruce Fields; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng J. Bruce Fields said the following on 2009-5-9 2:26: > On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: >> J. Bruce Fields said the following on 2009-5-7 4:32: >>> On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: >>>> J. Bruce Fields said the following on 2009-4-25 7:12: >>>>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: >>>>>> Although I can't reproduce it now, it really happened that some lock manager >>>>>> started grace period but didn't end it. >>>>>> This causes an lm entry be left in grace_list, and when service nfs restart, >>>>>> the same lm will be added again into the list. >>>>>> As you know, adding an entry, which is in the list, to a list will leads to >>>>>> list corruption. >>>>> I'd really like to understand why locks_end_grace() isn't being called. >>>>> I'm probably overlooking something obvious, but I just can't see how >>>>> lockd or nfsd can be shut down right now without locks_end_grace() being >>>>> called. >>>>> >>>> Me neither can figure out why locks_end_grace() isn't being called. >>>> >>>> But do locks_start_grace() twice can trigger this warning too. >>>> You can do >>>> 1. service nfs restart >>>> 2. (immediately) kill -s SIGKILL lockd >>>> this can trigger >>>> --- >>>> lockd(void *vrqstp) >>>> ... >>>> if (signalled()) { >>>> flush_signals(current); >>>> if (nlmsvc_ops) { >>>> nlmsvc_invalidate_all(); >>>> set_grace_period(); >>>> --- >>>> and makes locks_start_grace() be called twice without locks_end_grace(). >>> Ah-hah! >>> >>>> So I still suggest to do something to protect the lm list. :) >>> I wouldn't be opposed to a simple WARN_ON(!list_empty()) in >>> locks_start_grace(), but I'm mainly worried about fixing the original >>> bug. How about the following? >>> >> Yeah, the following fix is OK to me, although it only fixed >> "start_grace again after start_grace" case. > > OK, thanks. > >> The bug about "quit lockd without end_grace", which I encountered before >> incidentally, maybe is still there. > > You're talking about the report that started this thread?: > > http://marc.info/?l=linux-nfs&m=124054262421444&w=2 > Yes. I mean this. > It looks to me like that could be explained by two start_grace's in a > row. > But in that report, I didn't post the total message. Here are something show that: 1. not only lockd has the problem, but nfsd also. 2. every time I do "service nfs restart", I got the warning, so this is not "two start_grace's in a row" problem. Following is more message I got on last month. ------------------------------------------------------ Apr 16 16:35:41 localhost mountd[15061]: Caught signal 15, un-registering and exiting. Apr 16 16:35:42 localhost kernel: nfsd: last server has exited, flushing export cache Apr 16 16:35:43 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 16 16:35:43 localhost kernel: ------------[ cut here ]------------ Apr 16 16:35:43 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 16 16:35:43 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 16 16:35:43 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 16 16:35:43 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 16 16:35:43 localhost kernel: Pid: 17455, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 Apr 16 16:35:43 localhost kernel: Call Trace: Apr 16 16:35:43 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 16 16:35:43 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] Apr 16 16:35:43 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 16 16:35:43 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 Apr 16 16:35:43 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 16 16:35:43 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 16 16:35:43 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 16 16:35:43 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] Apr 16 16:35:43 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] Apr 16 16:35:43 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] Apr 16 16:35:43 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] Apr 16 16:35:43 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] Apr 16 16:35:43 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] Apr 16 16:35:43 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad Apr 16 16:35:43 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 Apr 16 16:35:43 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c Apr 16 16:35:43 localhost kernel: ---[ end trace fa484bd6d19ade87 ]--- Apr 16 16:35:43 localhost kernel: NFSD: starting 90-second grace period ...snip... Apr 17 13:02:54 localhost mountd[17468]: Caught signal 15, un-registering and exiting. Apr 17 13:02:54 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 13:02:55 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 13:02:55 localhost kernel: ------------[ cut here ]------------ Apr 17 13:02:55 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 13:02:55 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 13:02:55 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 13:02:55 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 13:02:55 localhost kernel: Pid: 22642, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 Apr 17 13:02:55 localhost kernel: Call Trace: Apr 17 13:02:55 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 13:02:55 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] Apr 17 13:02:55 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 13:02:55 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 Apr 17 13:02:55 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 13:02:55 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 13:02:55 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 13:02:55 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] Apr 17 13:02:55 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] Apr 17 13:02:55 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] Apr 17 13:02:55 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] Apr 17 13:02:55 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] Apr 17 13:02:55 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] Apr 17 13:02:55 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad Apr 17 13:02:55 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 Apr 17 13:02:55 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c Apr 17 13:02:55 localhost kernel: ---[ end trace fa484bd6d19ade88 ]--- Apr 17 13:02:55 localhost kernel: NFSD: starting 90-second grace period Apr 17 13:04:07 localhost mountd[22655]: Caught signal 15, un-registering and exiting. Apr 17 13:04:07 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 13:04:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 13:04:07 localhost kernel: NFSD: starting 90-second grace period Apr 17 13:05:04 localhost mountd[22760]: Caught signal 15, un-registering and exiting. Apr 17 13:05:04 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 13:05:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 13:05:05 localhost kernel: NFSD: starting 90-second grace period Apr 17 13:06:10 localhost mountd[22859]: Caught signal 15, un-registering and exiting. Apr 17 13:06:10 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 13:06:10 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 13:06:10 localhost kernel: NFSD: starting 90-second grace period Apr 17 13:08:07 localhost mountd[22960]: Caught signal 15, un-registering and exiting. Apr 17 13:08:07 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 13:08:07 localhost kernel: ------------[ cut here ]------------ Apr 17 13:08:07 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 13:08:07 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 13:08:07 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 13:08:07 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 13:08:07 localhost kernel: Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 13:08:07 localhost kernel: Call Trace: Apr 17 13:08:07 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 13:08:07 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 13:08:07 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 13:08:07 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 13:08:07 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 13:08:07 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 13:08:07 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 13:08:07 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 13:08:07 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 13:08:07 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 13:08:07 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 13:08:07 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 13:08:07 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 13:08:07 localhost kernel: ---[ end trace fa484bd6d19ade89 ]--- Apr 17 13:08:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 13:08:07 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:39:45 localhost mountd[23074]: Caught signal 15, un-registering and exiting. Apr 17 14:39:45 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:39:45 localhost kernel: ------------[ cut here ]------------ Apr 17 14:39:45 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 14:39:45 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 14:39:45 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 14:39:45 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 14:39:45 localhost kernel: Pid: 24287, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 14:39:45 localhost kernel: Call Trace: Apr 17 14:39:45 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 14:39:45 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 14:39:45 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 14:39:45 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 14:39:45 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 14:39:45 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 14:39:45 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 14:39:45 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 14:39:45 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 14:39:45 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 14:39:45 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 14:39:45 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 14:39:45 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 14:39:45 localhost kernel: ---[ end trace fa484bd6d19ade8a ]--- Apr 17 14:39:45 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:39:45 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:41:32 localhost mountd[24299]: Caught signal 15, un-registering and exiting. Apr 17 14:41:32 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:41:33 localhost kernel: ------------[ cut here ]------------ Apr 17 14:41:33 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 14:41:33 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 14:41:33 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 14:41:33 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 14:41:33 localhost kernel: Pid: 24399, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 14:41:33 localhost kernel: Call Trace: Apr 17 14:41:33 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 14:41:33 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 14:41:33 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 14:41:33 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 14:41:33 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 14:41:33 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 14:41:33 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 14:41:33 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 14:41:33 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 14:41:33 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 14:41:33 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 14:41:33 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 14:41:33 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 14:41:33 localhost kernel: ---[ end trace fa484bd6d19ade8b ]--- Apr 17 14:41:33 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:41:33 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:42:16 localhost mountd[24411]: Caught signal 15, un-registering and exiting. Apr 17 14:42:17 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:42:17 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:42:17 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:42:52 localhost mountd[24508]: Caught signal 15, un-registering and exiting. Apr 17 14:42:52 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:42:53 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:42:53 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:43:28 localhost mountd[24602]: Caught signal 15, un-registering and exiting. Apr 17 14:43:28 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:43:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:43:29 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:43:59 localhost mountd[24697]: Caught signal 15, un-registering and exiting. Apr 17 14:43:59 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:44:00 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:44:00 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:44:28 localhost mountd[24791]: Caught signal 15, un-registering and exiting. Apr 17 14:44:28 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:44:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:44:29 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:45:33 localhost mountd[24885]: Caught signal 15, un-registering and exiting. Apr 17 14:45:33 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:45:34 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:45:34 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:46:05 localhost mountd[24988]: Caught signal 15, un-registering and exiting. Apr 17 14:46:05 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:46:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:46:05 localhost kernel: NFSD: starting 90-second grace period Apr 17 14:46:34 localhost mountd[25082]: Caught signal 15, un-registering and exiting. Apr 17 14:46:34 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 14:46:35 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 14:46:35 localhost kernel: NFSD: starting 90-second grace period Apr 17 15:35:01 localhost mountd[25176]: Caught signal 15, un-registering and exiting. Apr 17 15:35:02 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 15:35:02 localhost kernel: ------------[ cut here ]------------ Apr 17 15:35:02 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 15:35:02 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 15:35:02 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 15:35:02 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 15:35:02 localhost kernel: Pid: 25883, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 15:35:02 localhost kernel: Call Trace: Apr 17 15:35:02 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 15:35:02 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 15:35:02 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 15:35:02 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 15:35:02 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 15:35:02 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 15:35:02 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 15:35:02 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 15:35:02 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 15:35:02 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 15:35:02 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 15:35:02 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 15:35:02 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 15:35:02 localhost kernel: ---[ end trace fa484bd6d19ade8c ]--- Apr 17 15:35:02 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 15:35:02 localhost kernel: NFSD: starting 90-second grace period Apr 17 15:55:22 localhost mountd[25895]: Caught signal 15, un-registering and exiting. Apr 17 15:55:22 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 15:55:23 localhost kernel: ------------[ cut here ]------------ Apr 17 15:55:23 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 15:55:23 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 15:55:23 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 15:55:23 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 15:55:23 localhost kernel: Pid: 26230, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 15:55:23 localhost kernel: Call Trace: Apr 17 15:55:23 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 15:55:23 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 15:55:23 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 15:55:23 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 15:55:23 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 15:55:23 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 15:55:23 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 15:55:23 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 15:55:23 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 15:55:23 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 15:55:23 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 15:55:23 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 15:55:23 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 15:55:23 localhost kernel: ---[ end trace fa484bd6d19ade8d ]--- Apr 17 15:55:23 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 15:55:23 localhost kernel: NFSD: starting 90-second grace period Apr 17 16:54:27 localhost mountd[26242]: Caught signal 15, un-registering and exiting. Apr 17 16:54:27 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 16:54:28 localhost kernel: ------------[ cut here ]------------ Apr 17 16:54:28 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 16:54:28 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 16:54:28 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 16:54:28 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 16:54:28 localhost kernel: Pid: 27044, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 16:54:28 localhost kernel: Call Trace: Apr 17 16:54:28 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 16:54:28 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 16:54:28 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 16:54:28 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 16:54:28 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 16:54:28 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 16:54:28 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 16:54:28 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 16:54:28 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 16:54:28 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 16:54:28 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 16:54:28 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 16:54:28 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 16:54:28 localhost kernel: ---[ end trace fa484bd6d19ade8e ]--- Apr 17 16:54:28 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 16:54:28 localhost kernel: NFSD: starting 90-second grace period Apr 17 16:59:55 localhost mountd[27056]: Caught signal 15, un-registering and exiting. Apr 17 16:59:55 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 16:59:56 localhost kernel: ------------[ cut here ]------------ Apr 17 16:59:56 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 16:59:56 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 16:59:56 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 16:59:56 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 16:59:56 localhost kernel: Pid: 27197, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 16:59:56 localhost kernel: Call Trace: Apr 17 16:59:56 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 16:59:56 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 16:59:56 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 16:59:56 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 16:59:56 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 16:59:56 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 16:59:56 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 16:59:56 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 16:59:56 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 16:59:56 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 16:59:56 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 16:59:56 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 16:59:56 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 16:59:56 localhost kernel: ---[ end trace fa484bd6d19ade8f ]--- Apr 17 16:59:56 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 16:59:56 localhost kernel: NFSD: starting 90-second grace period Apr 17 17:02:50 localhost mountd[27209]: Caught signal 15, un-registering and exiting. Apr 17 17:02:50 localhost kernel: nfsd: last server has exited, flushing export cache Apr 17 17:02:51 localhost kernel: ------------[ cut here ]------------ Apr 17 17:02:51 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() Apr 17 17:02:51 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) Apr 17 17:02:51 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). Apr 17 17:02:51 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] Apr 17 17:02:51 localhost kernel: Pid: 27349, comm: lockd Tainted: G W 2.6.30-rc2 #3 Apr 17 17:02:51 localhost kernel: Call Trace: Apr 17 17:02:51 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 Apr 17 17:02:51 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 17:02:51 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd Apr 17 17:02:51 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa Apr 17 17:02:51 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c Apr 17 17:02:51 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] Apr 17 17:02:51 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] Apr 17 17:02:51 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 Apr 17 17:02:51 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 Apr 17 17:02:51 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] Apr 17 17:02:51 localhost kernel: [<c043dd42>] kthread+0x45/0x6b Apr 17 17:02:51 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b Apr 17 17:02:51 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 Apr 17 17:02:51 localhost kernel: ---[ end trace fa484bd6d19ade90 ]--- Apr 17 17:02:51 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory Apr 17 17:02:51 localhost kernel: NFSD: starting 90-second grace period Apr 17 17:08:09 localhost mountd[27361]: authenticated mount request from 10.167.141.101:695 for /tmp/nfs3 (/tmp/nfs3) > --b. > >>> --b. >>> >>> diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c >>> index abf8388..1a54ae1 100644 >>> --- a/fs/lockd/svc.c >>> +++ b/fs/lockd/svc.c >>> @@ -104,6 +104,16 @@ static void set_grace_period(void) >>> schedule_delayed_work(&grace_period_end, grace_period); >>> } >>> >>> +static void restart_grace(void) >>> +{ >>> + if (nlmsvc_ops) { >>> + cancel_delayed_work_sync(&grace_period_end); >>> + locks_end_grace(&lockd_manager); >>> + nlmsvc_invalidate_all(); >>> + set_grace_period(); >>> + } >>> +} >>> + >>> /* >>> * This is the lockd kernel thread >>> */ >>> @@ -149,10 +159,7 @@ lockd(void *vrqstp) >>> >>> if (signalled()) { >>> flush_signals(current); >>> - if (nlmsvc_ops) { >>> - nlmsvc_invalidate_all(); >>> - set_grace_period(); >>> - } >>> + restart_grace(); >>> continue; >>> } >>> >>> ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-11 6:13 ` Wang Chen @ 2009-05-11 20:57 ` J. Bruce Fields 2009-05-12 0:43 ` Wang Chen 0 siblings, 1 reply; 13+ messages in thread From: J. Bruce Fields @ 2009-05-11 20:57 UTC (permalink / raw) To: Wang Chen; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng On Mon, May 11, 2009 at 02:13:50PM +0800, Wang Chen wrote: > J. Bruce Fields said the following on 2009-5-9 2:26: > > On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: > >> J. Bruce Fields said the following on 2009-5-7 4:32: > >>> On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: > >>>> J. Bruce Fields said the following on 2009-4-25 7:12: > >>>>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: > >>>>>> Although I can't reproduce it now, it really happened that some lock manager > >>>>>> started grace period but didn't end it. > >>>>>> This causes an lm entry be left in grace_list, and when service nfs restart, > >>>>>> the same lm will be added again into the list. > >>>>>> As you know, adding an entry, which is in the list, to a list will leads to > >>>>>> list corruption. > >>>>> I'd really like to understand why locks_end_grace() isn't being called. > >>>>> I'm probably overlooking something obvious, but I just can't see how > >>>>> lockd or nfsd can be shut down right now without locks_end_grace() being > >>>>> called. > >>>>> > >>>> Me neither can figure out why locks_end_grace() isn't being called. > >>>> > >>>> But do locks_start_grace() twice can trigger this warning too. > >>>> You can do > >>>> 1. service nfs restart > >>>> 2. (immediately) kill -s SIGKILL lockd > >>>> this can trigger > >>>> --- > >>>> lockd(void *vrqstp) > >>>> ... > >>>> if (signalled()) { > >>>> flush_signals(current); > >>>> if (nlmsvc_ops) { > >>>> nlmsvc_invalidate_all(); > >>>> set_grace_period(); > >>>> --- > >>>> and makes locks_start_grace() be called twice without locks_end_grace(). > >>> Ah-hah! > >>> > >>>> So I still suggest to do something to protect the lm list. :) > >>> I wouldn't be opposed to a simple WARN_ON(!list_empty()) in > >>> locks_start_grace(), but I'm mainly worried about fixing the original > >>> bug. How about the following? > >>> > >> Yeah, the following fix is OK to me, although it only fixed > >> "start_grace again after start_grace" case. > > > > OK, thanks. > > > >> The bug about "quit lockd without end_grace", which I encountered before > >> incidentally, maybe is still there. > > > > You're talking about the report that started this thread?: > > > > http://marc.info/?l=linux-nfs&m=124054262421444&w=2 > > > > Yes. I mean this. > > > It looks to me like that could be explained by two start_grace's in a > > row. > > > > But in that report, I didn't post the total message. > Here are something show that: > 1. not only lockd has the problem, but nfsd also. > 2. every time I do "service nfs restart", I got the warning, so this is not > "two start_grace's in a row" problem. Once the list is corrupted, it stays corrupted, so that's expected; the only interesting warning is the first one. --b. > Following is more message I got on last month. > ------------------------------------------------------ > Apr 16 16:35:41 localhost mountd[15061]: Caught signal 15, un-registering and exiting. > Apr 16 16:35:42 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 16 16:35:43 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 16 16:35:43 localhost kernel: ------------[ cut here ]------------ > Apr 16 16:35:43 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 16 16:35:43 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 16 16:35:43 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 16 16:35:43 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 16 16:35:43 localhost kernel: Pid: 17455, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 > Apr 16 16:35:43 localhost kernel: Call Trace: > Apr 16 16:35:43 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 16 16:35:43 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] > Apr 16 16:35:43 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 16 16:35:43 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 > Apr 16 16:35:43 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 > Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 16 16:35:43 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 16 16:35:43 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 16 16:35:43 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] > Apr 16 16:35:43 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] > Apr 16 16:35:43 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] > Apr 16 16:35:43 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] > Apr 16 16:35:43 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] > Apr 16 16:35:43 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] > Apr 16 16:35:43 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad > Apr 16 16:35:43 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 > Apr 16 16:35:43 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c > Apr 16 16:35:43 localhost kernel: ---[ end trace fa484bd6d19ade87 ]--- > Apr 16 16:35:43 localhost kernel: NFSD: starting 90-second grace period > ...snip... > Apr 17 13:02:54 localhost mountd[17468]: Caught signal 15, un-registering and exiting. > Apr 17 13:02:54 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 13:02:55 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 13:02:55 localhost kernel: ------------[ cut here ]------------ > Apr 17 13:02:55 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 13:02:55 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 13:02:55 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 13:02:55 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 13:02:55 localhost kernel: Pid: 22642, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 > Apr 17 13:02:55 localhost kernel: Call Trace: > Apr 17 13:02:55 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 13:02:55 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] > Apr 17 13:02:55 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 13:02:55 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 > Apr 17 13:02:55 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 > Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 13:02:55 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 13:02:55 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 13:02:55 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] > Apr 17 13:02:55 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] > Apr 17 13:02:55 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] > Apr 17 13:02:55 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] > Apr 17 13:02:55 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] > Apr 17 13:02:55 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] > Apr 17 13:02:55 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad > Apr 17 13:02:55 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 > Apr 17 13:02:55 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c > Apr 17 13:02:55 localhost kernel: ---[ end trace fa484bd6d19ade88 ]--- > Apr 17 13:02:55 localhost kernel: NFSD: starting 90-second grace period > Apr 17 13:04:07 localhost mountd[22655]: Caught signal 15, un-registering and exiting. > Apr 17 13:04:07 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 13:04:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 13:04:07 localhost kernel: NFSD: starting 90-second grace period > Apr 17 13:05:04 localhost mountd[22760]: Caught signal 15, un-registering and exiting. > Apr 17 13:05:04 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 13:05:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 13:05:05 localhost kernel: NFSD: starting 90-second grace period > Apr 17 13:06:10 localhost mountd[22859]: Caught signal 15, un-registering and exiting. > Apr 17 13:06:10 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 13:06:10 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 13:06:10 localhost kernel: NFSD: starting 90-second grace period > Apr 17 13:08:07 localhost mountd[22960]: Caught signal 15, un-registering and exiting. > Apr 17 13:08:07 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 13:08:07 localhost kernel: ------------[ cut here ]------------ > Apr 17 13:08:07 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 13:08:07 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 13:08:07 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 13:08:07 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 13:08:07 localhost kernel: Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 13:08:07 localhost kernel: Call Trace: > Apr 17 13:08:07 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 13:08:07 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 13:08:07 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 13:08:07 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 13:08:07 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 13:08:07 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 13:08:07 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 13:08:07 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 13:08:07 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 13:08:07 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 13:08:07 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 13:08:07 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 13:08:07 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 13:08:07 localhost kernel: ---[ end trace fa484bd6d19ade89 ]--- > Apr 17 13:08:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 13:08:07 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:39:45 localhost mountd[23074]: Caught signal 15, un-registering and exiting. > Apr 17 14:39:45 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:39:45 localhost kernel: ------------[ cut here ]------------ > Apr 17 14:39:45 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 14:39:45 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 14:39:45 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 14:39:45 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 14:39:45 localhost kernel: Pid: 24287, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 14:39:45 localhost kernel: Call Trace: > Apr 17 14:39:45 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 14:39:45 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 14:39:45 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 14:39:45 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 14:39:45 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 14:39:45 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 14:39:45 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 14:39:45 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 14:39:45 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 14:39:45 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 14:39:45 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 14:39:45 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 14:39:45 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 14:39:45 localhost kernel: ---[ end trace fa484bd6d19ade8a ]--- > Apr 17 14:39:45 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:39:45 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:41:32 localhost mountd[24299]: Caught signal 15, un-registering and exiting. > Apr 17 14:41:32 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:41:33 localhost kernel: ------------[ cut here ]------------ > Apr 17 14:41:33 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 14:41:33 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 14:41:33 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 14:41:33 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 14:41:33 localhost kernel: Pid: 24399, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 14:41:33 localhost kernel: Call Trace: > Apr 17 14:41:33 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 14:41:33 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 14:41:33 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 14:41:33 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 14:41:33 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 14:41:33 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 14:41:33 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 14:41:33 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 14:41:33 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 14:41:33 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 14:41:33 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 14:41:33 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 14:41:33 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 14:41:33 localhost kernel: ---[ end trace fa484bd6d19ade8b ]--- > Apr 17 14:41:33 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:41:33 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:42:16 localhost mountd[24411]: Caught signal 15, un-registering and exiting. > Apr 17 14:42:17 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:42:17 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:42:17 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:42:52 localhost mountd[24508]: Caught signal 15, un-registering and exiting. > Apr 17 14:42:52 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:42:53 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:42:53 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:43:28 localhost mountd[24602]: Caught signal 15, un-registering and exiting. > Apr 17 14:43:28 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:43:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:43:29 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:43:59 localhost mountd[24697]: Caught signal 15, un-registering and exiting. > Apr 17 14:43:59 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:44:00 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:44:00 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:44:28 localhost mountd[24791]: Caught signal 15, un-registering and exiting. > Apr 17 14:44:28 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:44:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:44:29 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:45:33 localhost mountd[24885]: Caught signal 15, un-registering and exiting. > Apr 17 14:45:33 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:45:34 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:45:34 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:46:05 localhost mountd[24988]: Caught signal 15, un-registering and exiting. > Apr 17 14:46:05 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:46:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:46:05 localhost kernel: NFSD: starting 90-second grace period > Apr 17 14:46:34 localhost mountd[25082]: Caught signal 15, un-registering and exiting. > Apr 17 14:46:34 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 14:46:35 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 14:46:35 localhost kernel: NFSD: starting 90-second grace period > Apr 17 15:35:01 localhost mountd[25176]: Caught signal 15, un-registering and exiting. > Apr 17 15:35:02 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 15:35:02 localhost kernel: ------------[ cut here ]------------ > Apr 17 15:35:02 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 15:35:02 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 15:35:02 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 15:35:02 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 15:35:02 localhost kernel: Pid: 25883, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 15:35:02 localhost kernel: Call Trace: > Apr 17 15:35:02 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 15:35:02 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 15:35:02 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 15:35:02 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 15:35:02 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 15:35:02 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 15:35:02 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 15:35:02 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 15:35:02 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 15:35:02 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 15:35:02 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 15:35:02 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 15:35:02 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 15:35:02 localhost kernel: ---[ end trace fa484bd6d19ade8c ]--- > Apr 17 15:35:02 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 15:35:02 localhost kernel: NFSD: starting 90-second grace period > Apr 17 15:55:22 localhost mountd[25895]: Caught signal 15, un-registering and exiting. > Apr 17 15:55:22 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 15:55:23 localhost kernel: ------------[ cut here ]------------ > Apr 17 15:55:23 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 15:55:23 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 15:55:23 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 15:55:23 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 15:55:23 localhost kernel: Pid: 26230, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 15:55:23 localhost kernel: Call Trace: > Apr 17 15:55:23 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 15:55:23 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 15:55:23 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 15:55:23 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 15:55:23 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 15:55:23 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 15:55:23 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 15:55:23 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 15:55:23 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 15:55:23 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 15:55:23 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 15:55:23 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 15:55:23 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 15:55:23 localhost kernel: ---[ end trace fa484bd6d19ade8d ]--- > Apr 17 15:55:23 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 15:55:23 localhost kernel: NFSD: starting 90-second grace period > Apr 17 16:54:27 localhost mountd[26242]: Caught signal 15, un-registering and exiting. > Apr 17 16:54:27 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 16:54:28 localhost kernel: ------------[ cut here ]------------ > Apr 17 16:54:28 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 16:54:28 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 16:54:28 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 16:54:28 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 16:54:28 localhost kernel: Pid: 27044, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 16:54:28 localhost kernel: Call Trace: > Apr 17 16:54:28 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 16:54:28 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 16:54:28 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 16:54:28 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 16:54:28 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 16:54:28 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 16:54:28 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 16:54:28 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 16:54:28 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 16:54:28 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 16:54:28 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 16:54:28 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 16:54:28 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 16:54:28 localhost kernel: ---[ end trace fa484bd6d19ade8e ]--- > Apr 17 16:54:28 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 16:54:28 localhost kernel: NFSD: starting 90-second grace period > Apr 17 16:59:55 localhost mountd[27056]: Caught signal 15, un-registering and exiting. > Apr 17 16:59:55 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 16:59:56 localhost kernel: ------------[ cut here ]------------ > Apr 17 16:59:56 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 16:59:56 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 16:59:56 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 16:59:56 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 16:59:56 localhost kernel: Pid: 27197, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 16:59:56 localhost kernel: Call Trace: > Apr 17 16:59:56 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 16:59:56 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 16:59:56 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 16:59:56 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 16:59:56 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 16:59:56 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 16:59:56 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 16:59:56 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 16:59:56 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 16:59:56 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 16:59:56 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 16:59:56 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 16:59:56 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 16:59:56 localhost kernel: ---[ end trace fa484bd6d19ade8f ]--- > Apr 17 16:59:56 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 16:59:56 localhost kernel: NFSD: starting 90-second grace period > Apr 17 17:02:50 localhost mountd[27209]: Caught signal 15, un-registering and exiting. > Apr 17 17:02:50 localhost kernel: nfsd: last server has exited, flushing export cache > Apr 17 17:02:51 localhost kernel: ------------[ cut here ]------------ > Apr 17 17:02:51 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > Apr 17 17:02:51 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > Apr 17 17:02:51 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > Apr 17 17:02:51 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > Apr 17 17:02:51 localhost kernel: Pid: 27349, comm: lockd Tainted: G W 2.6.30-rc2 #3 > Apr 17 17:02:51 localhost kernel: Call Trace: > Apr 17 17:02:51 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > Apr 17 17:02:51 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 17:02:51 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > Apr 17 17:02:51 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > Apr 17 17:02:51 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > Apr 17 17:02:51 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > Apr 17 17:02:51 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > Apr 17 17:02:51 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > Apr 17 17:02:51 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > Apr 17 17:02:51 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > Apr 17 17:02:51 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > Apr 17 17:02:51 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > Apr 17 17:02:51 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > Apr 17 17:02:51 localhost kernel: ---[ end trace fa484bd6d19ade90 ]--- > Apr 17 17:02:51 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > Apr 17 17:02:51 localhost kernel: NFSD: starting 90-second grace period > Apr 17 17:08:09 localhost mountd[27361]: authenticated mount request from 10.167.141.101:695 for /tmp/nfs3 (/tmp/nfs3) > > > --b. > > > >>> --b. > >>> > >>> diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c > >>> index abf8388..1a54ae1 100644 > >>> --- a/fs/lockd/svc.c > >>> +++ b/fs/lockd/svc.c > >>> @@ -104,6 +104,16 @@ static void set_grace_period(void) > >>> schedule_delayed_work(&grace_period_end, grace_period); > >>> } > >>> > >>> +static void restart_grace(void) > >>> +{ > >>> + if (nlmsvc_ops) { > >>> + cancel_delayed_work_sync(&grace_period_end); > >>> + locks_end_grace(&lockd_manager); > >>> + nlmsvc_invalidate_all(); > >>> + set_grace_period(); > >>> + } > >>> +} > >>> + > >>> /* > >>> * This is the lockd kernel thread > >>> */ > >>> @@ -149,10 +159,7 @@ lockd(void *vrqstp) > >>> > >>> if (signalled()) { > >>> flush_signals(current); > >>> - if (nlmsvc_ops) { > >>> - nlmsvc_invalidate_all(); > >>> - set_grace_period(); > >>> - } > >>> + restart_grace(); > >>> continue; > >>> } > >>> > >>> > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-11 20:57 ` J. Bruce Fields @ 2009-05-12 0:43 ` Wang Chen 2009-05-12 19:13 ` J. Bruce Fields 0 siblings, 1 reply; 13+ messages in thread From: Wang Chen @ 2009-05-12 0:43 UTC (permalink / raw) To: J. Bruce Fields; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng J. Bruce Fields said the following on 2009-5-12 4:57: > On Mon, May 11, 2009 at 02:13:50PM +0800, Wang Chen wrote: >> J. Bruce Fields said the following on 2009-5-9 2:26: >>> On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: >>>> J. Bruce Fields said the following on 2009-5-7 4:32: >>>>> On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: >>>>>> J. Bruce Fields said the following on 2009-4-25 7:12: >>>>>>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: >>>>>>>> Although I can't reproduce it now, it really happened that some lock manager >>>>>>>> started grace period but didn't end it. >>>>>>>> This causes an lm entry be left in grace_list, and when service nfs restart, >>>>>>>> the same lm will be added again into the list. >>>>>>>> As you know, adding an entry, which is in the list, to a list will leads to >>>>>>>> list corruption. >>>>>>> I'd really like to understand why locks_end_grace() isn't being called. >>>>>>> I'm probably overlooking something obvious, but I just can't see how >>>>>>> lockd or nfsd can be shut down right now without locks_end_grace() being >>>>>>> called. >>>>>>> >>>>>> Me neither can figure out why locks_end_grace() isn't being called. >>>>>> >>>>>> But do locks_start_grace() twice can trigger this warning too. >>>>>> You can do >>>>>> 1. service nfs restart >>>>>> 2. (immediately) kill -s SIGKILL lockd >>>>>> this can trigger >>>>>> --- >>>>>> lockd(void *vrqstp) >>>>>> ... >>>>>> if (signalled()) { >>>>>> flush_signals(current); >>>>>> if (nlmsvc_ops) { >>>>>> nlmsvc_invalidate_all(); >>>>>> set_grace_period(); >>>>>> --- >>>>>> and makes locks_start_grace() be called twice without locks_end_grace(). >>>>> Ah-hah! >>>>> >>>>>> So I still suggest to do something to protect the lm list. :) >>>>> I wouldn't be opposed to a simple WARN_ON(!list_empty()) in >>>>> locks_start_grace(), but I'm mainly worried about fixing the original >>>>> bug. How about the following? >>>>> >>>> Yeah, the following fix is OK to me, although it only fixed >>>> "start_grace again after start_grace" case. >>> OK, thanks. >>> >>>> The bug about "quit lockd without end_grace", which I encountered before >>>> incidentally, maybe is still there. >>> You're talking about the report that started this thread?: >>> >>> http://marc.info/?l=linux-nfs&m=124054262421444&w=2 >>> >> Yes. I mean this. >> >>> It looks to me like that could be explained by two start_grace's in a >>> row. >>> >> But in that report, I didn't post the total message. >> Here are something show that: >> 1. not only lockd has the problem, but nfsd also. >> 2. every time I do "service nfs restart", I got the warning, so this is not >> "two start_grace's in a row" problem. > > Once the list is corrupted, it stays corrupted, so that's expected; the > only interesting warning is the first one. > But as you see the logs, nfsd made list corrupted first. Your fix only "two start_grace's in a row" of lockd . > --b. > >> Following is more message I got on last month. >> ------------------------------------------------------ >> Apr 16 16:35:41 localhost mountd[15061]: Caught signal 15, un-registering and exiting. >> Apr 16 16:35:42 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 16 16:35:43 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 16 16:35:43 localhost kernel: ------------[ cut here ]------------ >> Apr 16 16:35:43 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 16 16:35:43 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 16 16:35:43 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 16 16:35:43 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 16 16:35:43 localhost kernel: Pid: 17455, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 >> Apr 16 16:35:43 localhost kernel: Call Trace: >> Apr 16 16:35:43 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 16 16:35:43 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] >> Apr 16 16:35:43 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 16 16:35:43 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 >> Apr 16 16:35:43 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 >> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 16 16:35:43 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 16 16:35:43 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 16 16:35:43 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] >> Apr 16 16:35:43 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] >> Apr 16 16:35:43 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] >> Apr 16 16:35:43 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] >> Apr 16 16:35:43 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] >> Apr 16 16:35:43 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] >> Apr 16 16:35:43 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad >> Apr 16 16:35:43 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 >> Apr 16 16:35:43 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c >> Apr 16 16:35:43 localhost kernel: ---[ end trace fa484bd6d19ade87 ]--- >> Apr 16 16:35:43 localhost kernel: NFSD: starting 90-second grace period >> ...snip... >> Apr 17 13:02:54 localhost mountd[17468]: Caught signal 15, un-registering and exiting. >> Apr 17 13:02:54 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 13:02:55 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 13:02:55 localhost kernel: ------------[ cut here ]------------ >> Apr 17 13:02:55 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 13:02:55 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 13:02:55 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 13:02:55 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 13:02:55 localhost kernel: Pid: 22642, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 13:02:55 localhost kernel: Call Trace: >> Apr 17 13:02:55 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 13:02:55 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] >> Apr 17 13:02:55 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 13:02:55 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 >> Apr 17 13:02:55 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 >> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 13:02:55 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 13:02:55 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 13:02:55 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] >> Apr 17 13:02:55 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] >> Apr 17 13:02:55 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] >> Apr 17 13:02:55 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] >> Apr 17 13:02:55 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] >> Apr 17 13:02:55 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] >> Apr 17 13:02:55 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad >> Apr 17 13:02:55 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 >> Apr 17 13:02:55 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c >> Apr 17 13:02:55 localhost kernel: ---[ end trace fa484bd6d19ade88 ]--- >> Apr 17 13:02:55 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 13:04:07 localhost mountd[22655]: Caught signal 15, un-registering and exiting. >> Apr 17 13:04:07 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 13:04:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 13:04:07 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 13:05:04 localhost mountd[22760]: Caught signal 15, un-registering and exiting. >> Apr 17 13:05:04 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 13:05:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 13:05:05 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 13:06:10 localhost mountd[22859]: Caught signal 15, un-registering and exiting. >> Apr 17 13:06:10 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 13:06:10 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 13:06:10 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 13:08:07 localhost mountd[22960]: Caught signal 15, un-registering and exiting. >> Apr 17 13:08:07 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 13:08:07 localhost kernel: ------------[ cut here ]------------ >> Apr 17 13:08:07 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 13:08:07 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 13:08:07 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 13:08:07 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 13:08:07 localhost kernel: Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 13:08:07 localhost kernel: Call Trace: >> Apr 17 13:08:07 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 13:08:07 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 13:08:07 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 13:08:07 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 13:08:07 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 13:08:07 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 13:08:07 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 13:08:07 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 13:08:07 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 13:08:07 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 13:08:07 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 13:08:07 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 13:08:07 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 13:08:07 localhost kernel: ---[ end trace fa484bd6d19ade89 ]--- >> Apr 17 13:08:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 13:08:07 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:39:45 localhost mountd[23074]: Caught signal 15, un-registering and exiting. >> Apr 17 14:39:45 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:39:45 localhost kernel: ------------[ cut here ]------------ >> Apr 17 14:39:45 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 14:39:45 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 14:39:45 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 14:39:45 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 14:39:45 localhost kernel: Pid: 24287, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 14:39:45 localhost kernel: Call Trace: >> Apr 17 14:39:45 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 14:39:45 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 14:39:45 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 14:39:45 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 14:39:45 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 14:39:45 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 14:39:45 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 14:39:45 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 14:39:45 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 14:39:45 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 14:39:45 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 14:39:45 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 14:39:45 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 14:39:45 localhost kernel: ---[ end trace fa484bd6d19ade8a ]--- >> Apr 17 14:39:45 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:39:45 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:41:32 localhost mountd[24299]: Caught signal 15, un-registering and exiting. >> Apr 17 14:41:32 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:41:33 localhost kernel: ------------[ cut here ]------------ >> Apr 17 14:41:33 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 14:41:33 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 14:41:33 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 14:41:33 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 14:41:33 localhost kernel: Pid: 24399, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 14:41:33 localhost kernel: Call Trace: >> Apr 17 14:41:33 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 14:41:33 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 14:41:33 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 14:41:33 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 14:41:33 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 14:41:33 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 14:41:33 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 14:41:33 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 14:41:33 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 14:41:33 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 14:41:33 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 14:41:33 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 14:41:33 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 14:41:33 localhost kernel: ---[ end trace fa484bd6d19ade8b ]--- >> Apr 17 14:41:33 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:41:33 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:42:16 localhost mountd[24411]: Caught signal 15, un-registering and exiting. >> Apr 17 14:42:17 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:42:17 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:42:17 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:42:52 localhost mountd[24508]: Caught signal 15, un-registering and exiting. >> Apr 17 14:42:52 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:42:53 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:42:53 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:43:28 localhost mountd[24602]: Caught signal 15, un-registering and exiting. >> Apr 17 14:43:28 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:43:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:43:29 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:43:59 localhost mountd[24697]: Caught signal 15, un-registering and exiting. >> Apr 17 14:43:59 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:44:00 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:44:00 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:44:28 localhost mountd[24791]: Caught signal 15, un-registering and exiting. >> Apr 17 14:44:28 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:44:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:44:29 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:45:33 localhost mountd[24885]: Caught signal 15, un-registering and exiting. >> Apr 17 14:45:33 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:45:34 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:45:34 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:46:05 localhost mountd[24988]: Caught signal 15, un-registering and exiting. >> Apr 17 14:46:05 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:46:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:46:05 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 14:46:34 localhost mountd[25082]: Caught signal 15, un-registering and exiting. >> Apr 17 14:46:34 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 14:46:35 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 14:46:35 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 15:35:01 localhost mountd[25176]: Caught signal 15, un-registering and exiting. >> Apr 17 15:35:02 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 15:35:02 localhost kernel: ------------[ cut here ]------------ >> Apr 17 15:35:02 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 15:35:02 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 15:35:02 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 15:35:02 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 15:35:02 localhost kernel: Pid: 25883, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 15:35:02 localhost kernel: Call Trace: >> Apr 17 15:35:02 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 15:35:02 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 15:35:02 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 15:35:02 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 15:35:02 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 15:35:02 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 15:35:02 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 15:35:02 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 15:35:02 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 15:35:02 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 15:35:02 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 15:35:02 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 15:35:02 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 15:35:02 localhost kernel: ---[ end trace fa484bd6d19ade8c ]--- >> Apr 17 15:35:02 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 15:35:02 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 15:55:22 localhost mountd[25895]: Caught signal 15, un-registering and exiting. >> Apr 17 15:55:22 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 15:55:23 localhost kernel: ------------[ cut here ]------------ >> Apr 17 15:55:23 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 15:55:23 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 15:55:23 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 15:55:23 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 15:55:23 localhost kernel: Pid: 26230, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 15:55:23 localhost kernel: Call Trace: >> Apr 17 15:55:23 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 15:55:23 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 15:55:23 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 15:55:23 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 15:55:23 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 15:55:23 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 15:55:23 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 15:55:23 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 15:55:23 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 15:55:23 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 15:55:23 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 15:55:23 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 15:55:23 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 15:55:23 localhost kernel: ---[ end trace fa484bd6d19ade8d ]--- >> Apr 17 15:55:23 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 15:55:23 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 16:54:27 localhost mountd[26242]: Caught signal 15, un-registering and exiting. >> Apr 17 16:54:27 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 16:54:28 localhost kernel: ------------[ cut here ]------------ >> Apr 17 16:54:28 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 16:54:28 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 16:54:28 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 16:54:28 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 16:54:28 localhost kernel: Pid: 27044, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 16:54:28 localhost kernel: Call Trace: >> Apr 17 16:54:28 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 16:54:28 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 16:54:28 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 16:54:28 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 16:54:28 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 16:54:28 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 16:54:28 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 16:54:28 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 16:54:28 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 16:54:28 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 16:54:28 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 16:54:28 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 16:54:28 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 16:54:28 localhost kernel: ---[ end trace fa484bd6d19ade8e ]--- >> Apr 17 16:54:28 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 16:54:28 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 16:59:55 localhost mountd[27056]: Caught signal 15, un-registering and exiting. >> Apr 17 16:59:55 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 16:59:56 localhost kernel: ------------[ cut here ]------------ >> Apr 17 16:59:56 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 16:59:56 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 16:59:56 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 16:59:56 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 16:59:56 localhost kernel: Pid: 27197, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 16:59:56 localhost kernel: Call Trace: >> Apr 17 16:59:56 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 16:59:56 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 16:59:56 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 16:59:56 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 16:59:56 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 16:59:56 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 16:59:56 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 16:59:56 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 16:59:56 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 16:59:56 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 16:59:56 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 16:59:56 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 16:59:56 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 16:59:56 localhost kernel: ---[ end trace fa484bd6d19ade8f ]--- >> Apr 17 16:59:56 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 16:59:56 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 17:02:50 localhost mountd[27209]: Caught signal 15, un-registering and exiting. >> Apr 17 17:02:50 localhost kernel: nfsd: last server has exited, flushing export cache >> Apr 17 17:02:51 localhost kernel: ------------[ cut here ]------------ >> Apr 17 17:02:51 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >> Apr 17 17:02:51 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >> Apr 17 17:02:51 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >> Apr 17 17:02:51 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >> Apr 17 17:02:51 localhost kernel: Pid: 27349, comm: lockd Tainted: G W 2.6.30-rc2 #3 >> Apr 17 17:02:51 localhost kernel: Call Trace: >> Apr 17 17:02:51 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >> Apr 17 17:02:51 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 17:02:51 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >> Apr 17 17:02:51 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >> Apr 17 17:02:51 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >> Apr 17 17:02:51 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >> Apr 17 17:02:51 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >> Apr 17 17:02:51 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >> Apr 17 17:02:51 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >> Apr 17 17:02:51 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >> Apr 17 17:02:51 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >> Apr 17 17:02:51 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >> Apr 17 17:02:51 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >> Apr 17 17:02:51 localhost kernel: ---[ end trace fa484bd6d19ade90 ]--- >> Apr 17 17:02:51 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >> Apr 17 17:02:51 localhost kernel: NFSD: starting 90-second grace period >> Apr 17 17:08:09 localhost mountd[27361]: authenticated mount request from 10.167.141.101:695 for /tmp/nfs3 (/tmp/nfs3) >> >>> --b. >>> >>>>> --b. >>>>> >>>>> diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c >>>>> index abf8388..1a54ae1 100644 >>>>> --- a/fs/lockd/svc.c >>>>> +++ b/fs/lockd/svc.c >>>>> @@ -104,6 +104,16 @@ static void set_grace_period(void) >>>>> schedule_delayed_work(&grace_period_end, grace_period); >>>>> } >>>>> >>>>> +static void restart_grace(void) >>>>> +{ >>>>> + if (nlmsvc_ops) { >>>>> + cancel_delayed_work_sync(&grace_period_end); >>>>> + locks_end_grace(&lockd_manager); >>>>> + nlmsvc_invalidate_all(); >>>>> + set_grace_period(); >>>>> + } >>>>> +} >>>>> + >>>>> /* >>>>> * This is the lockd kernel thread >>>>> */ >>>>> @@ -149,10 +159,7 @@ lockd(void *vrqstp) >>>>> >>>>> if (signalled()) { >>>>> flush_signals(current); >>>>> - if (nlmsvc_ops) { >>>>> - nlmsvc_invalidate_all(); >>>>> - set_grace_period(); >>>>> - } >>>>> + restart_grace(); >>>>> continue; >>>>> } >>>>> >>>>> ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-12 0:43 ` Wang Chen @ 2009-05-12 19:13 ` J. Bruce Fields 2009-05-13 2:24 ` Wang Chen 0 siblings, 1 reply; 13+ messages in thread From: J. Bruce Fields @ 2009-05-12 19:13 UTC (permalink / raw) To: Wang Chen; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng On Tue, May 12, 2009 at 08:43:41AM +0800, Wang Chen wrote: > J. Bruce Fields said the following on 2009-5-12 4:57: > > On Mon, May 11, 2009 at 02:13:50PM +0800, Wang Chen wrote: > >> J. Bruce Fields said the following on 2009-5-9 2:26: > >>> On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: > >>>> J. Bruce Fields said the following on 2009-5-7 4:32: > >>>>> On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: > >>>>>> J. Bruce Fields said the following on 2009-4-25 7:12: > >>>>>>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: > >>>>>>>> Although I can't reproduce it now, it really happened that some lock manager > >>>>>>>> started grace period but didn't end it. > >>>>>>>> This causes an lm entry be left in grace_list, and when service nfs restart, > >>>>>>>> the same lm will be added again into the list. > >>>>>>>> As you know, adding an entry, which is in the list, to a list will leads to > >>>>>>>> list corruption. > >>>>>>> I'd really like to understand why locks_end_grace() isn't being called. > >>>>>>> I'm probably overlooking something obvious, but I just can't see how > >>>>>>> lockd or nfsd can be shut down right now without locks_end_grace() being > >>>>>>> called. > >>>>>>> > >>>>>> Me neither can figure out why locks_end_grace() isn't being called. > >>>>>> > >>>>>> But do locks_start_grace() twice can trigger this warning too. > >>>>>> You can do > >>>>>> 1. service nfs restart > >>>>>> 2. (immediately) kill -s SIGKILL lockd > >>>>>> this can trigger > >>>>>> --- > >>>>>> lockd(void *vrqstp) > >>>>>> ... > >>>>>> if (signalled()) { > >>>>>> flush_signals(current); > >>>>>> if (nlmsvc_ops) { > >>>>>> nlmsvc_invalidate_all(); > >>>>>> set_grace_period(); > >>>>>> --- > >>>>>> and makes locks_start_grace() be called twice without locks_end_grace(). > >>>>> Ah-hah! > >>>>> > >>>>>> So I still suggest to do something to protect the lm list. :) > >>>>> I wouldn't be opposed to a simple WARN_ON(!list_empty()) in > >>>>> locks_start_grace(), but I'm mainly worried about fixing the original > >>>>> bug. How about the following? > >>>>> > >>>> Yeah, the following fix is OK to me, although it only fixed > >>>> "start_grace again after start_grace" case. > >>> OK, thanks. > >>> > >>>> The bug about "quit lockd without end_grace", which I encountered before > >>>> incidentally, maybe is still there. > >>> You're talking about the report that started this thread?: > >>> > >>> http://marc.info/?l=linux-nfs&m=124054262421444&w=2 > >>> > >> Yes. I mean this. > >> > >>> It looks to me like that could be explained by two start_grace's in a > >>> row. > >>> > >> But in that report, I didn't post the total message. > >> Here are something show that: > >> 1. not only lockd has the problem, but nfsd also. > >> 2. every time I do "service nfs restart", I got the warning, so this is not > >> "two start_grace's in a row" problem. > > > > Once the list is corrupted, it stays corrupted, so that's expected; the > > only interesting warning is the first one. > > > > But as you see the logs, nfsd made list corrupted first. Are you sure? You may be right, I just don't understand why. If you look at the definitions in lib/list_debug.c and include/linux/list.h, and trace through what would happen e.g. in a sequence like: list_add(item, head); list_add(item, head); list_del(item); list_add(item, head); where would corruption first be reported? I don't think it would be a the first place where the corruption was actually *created*. In fact I *think* the warning would first occur on the third list_add(), resulting in exactly the behavior seen in your logs--but someone should check that. --b. > Your fix only "two start_grace's in a row" of lockd . > > > --b. > > > >> Following is more message I got on last month. > >> ------------------------------------------------------ > >> Apr 16 16:35:41 localhost mountd[15061]: Caught signal 15, un-registering and exiting. > >> Apr 16 16:35:42 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 16 16:35:43 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 16 16:35:43 localhost kernel: ------------[ cut here ]------------ > >> Apr 16 16:35:43 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 16 16:35:43 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 16 16:35:43 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 16 16:35:43 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 16 16:35:43 localhost kernel: Pid: 17455, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 > >> Apr 16 16:35:43 localhost kernel: Call Trace: > >> Apr 16 16:35:43 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 16 16:35:43 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 16 16:35:43 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 > >> Apr 16 16:35:43 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 > >> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 16 16:35:43 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 16 16:35:43 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 16 16:35:43 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] > >> Apr 16 16:35:43 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad > >> Apr 16 16:35:43 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 > >> Apr 16 16:35:43 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c > >> Apr 16 16:35:43 localhost kernel: ---[ end trace fa484bd6d19ade87 ]--- > >> Apr 16 16:35:43 localhost kernel: NFSD: starting 90-second grace period > >> ...snip... > >> Apr 17 13:02:54 localhost mountd[17468]: Caught signal 15, un-registering and exiting. > >> Apr 17 13:02:54 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 13:02:55 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 13:02:55 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 13:02:55 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 13:02:55 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 13:02:55 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 13:02:55 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 13:02:55 localhost kernel: Pid: 22642, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 13:02:55 localhost kernel: Call Trace: > >> Apr 17 13:02:55 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 13:02:55 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 13:02:55 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 > >> Apr 17 13:02:55 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 > >> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 13:02:55 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 13:02:55 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 13:02:55 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] > >> Apr 17 13:02:55 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad > >> Apr 17 13:02:55 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 > >> Apr 17 13:02:55 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c > >> Apr 17 13:02:55 localhost kernel: ---[ end trace fa484bd6d19ade88 ]--- > >> Apr 17 13:02:55 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 13:04:07 localhost mountd[22655]: Caught signal 15, un-registering and exiting. > >> Apr 17 13:04:07 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 13:04:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 13:04:07 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 13:05:04 localhost mountd[22760]: Caught signal 15, un-registering and exiting. > >> Apr 17 13:05:04 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 13:05:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 13:05:05 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 13:06:10 localhost mountd[22859]: Caught signal 15, un-registering and exiting. > >> Apr 17 13:06:10 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 13:06:10 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 13:06:10 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 13:08:07 localhost mountd[22960]: Caught signal 15, un-registering and exiting. > >> Apr 17 13:08:07 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 13:08:07 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 13:08:07 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 13:08:07 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 13:08:07 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 13:08:07 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 13:08:07 localhost kernel: Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 13:08:07 localhost kernel: Call Trace: > >> Apr 17 13:08:07 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 13:08:07 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 13:08:07 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 13:08:07 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 13:08:07 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 13:08:07 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 13:08:07 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 13:08:07 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 13:08:07 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 13:08:07 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 13:08:07 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 13:08:07 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 13:08:07 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 13:08:07 localhost kernel: ---[ end trace fa484bd6d19ade89 ]--- > >> Apr 17 13:08:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 13:08:07 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:39:45 localhost mountd[23074]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:39:45 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:39:45 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 14:39:45 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 14:39:45 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 14:39:45 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 14:39:45 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 14:39:45 localhost kernel: Pid: 24287, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 14:39:45 localhost kernel: Call Trace: > >> Apr 17 14:39:45 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 14:39:45 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 14:39:45 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 14:39:45 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 14:39:45 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 14:39:45 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 14:39:45 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 14:39:45 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 14:39:45 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 14:39:45 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 14:39:45 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 14:39:45 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 14:39:45 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 14:39:45 localhost kernel: ---[ end trace fa484bd6d19ade8a ]--- > >> Apr 17 14:39:45 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:39:45 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:41:32 localhost mountd[24299]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:41:32 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:41:33 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 14:41:33 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 14:41:33 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 14:41:33 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 14:41:33 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 14:41:33 localhost kernel: Pid: 24399, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 14:41:33 localhost kernel: Call Trace: > >> Apr 17 14:41:33 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 14:41:33 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 14:41:33 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 14:41:33 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 14:41:33 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 14:41:33 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 14:41:33 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 14:41:33 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 14:41:33 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 14:41:33 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 14:41:33 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 14:41:33 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 14:41:33 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 14:41:33 localhost kernel: ---[ end trace fa484bd6d19ade8b ]--- > >> Apr 17 14:41:33 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:41:33 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:42:16 localhost mountd[24411]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:42:17 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:42:17 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:42:17 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:42:52 localhost mountd[24508]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:42:52 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:42:53 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:42:53 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:43:28 localhost mountd[24602]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:43:28 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:43:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:43:29 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:43:59 localhost mountd[24697]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:43:59 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:44:00 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:44:00 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:44:28 localhost mountd[24791]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:44:28 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:44:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:44:29 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:45:33 localhost mountd[24885]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:45:33 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:45:34 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:45:34 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:46:05 localhost mountd[24988]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:46:05 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:46:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:46:05 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 14:46:34 localhost mountd[25082]: Caught signal 15, un-registering and exiting. > >> Apr 17 14:46:34 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 14:46:35 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 14:46:35 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 15:35:01 localhost mountd[25176]: Caught signal 15, un-registering and exiting. > >> Apr 17 15:35:02 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 15:35:02 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 15:35:02 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 15:35:02 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 15:35:02 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 15:35:02 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 15:35:02 localhost kernel: Pid: 25883, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 15:35:02 localhost kernel: Call Trace: > >> Apr 17 15:35:02 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 15:35:02 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 15:35:02 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 15:35:02 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 15:35:02 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 15:35:02 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 15:35:02 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 15:35:02 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 15:35:02 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 15:35:02 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 15:35:02 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 15:35:02 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 15:35:02 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 15:35:02 localhost kernel: ---[ end trace fa484bd6d19ade8c ]--- > >> Apr 17 15:35:02 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 15:35:02 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 15:55:22 localhost mountd[25895]: Caught signal 15, un-registering and exiting. > >> Apr 17 15:55:22 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 15:55:23 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 15:55:23 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 15:55:23 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 15:55:23 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 15:55:23 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 15:55:23 localhost kernel: Pid: 26230, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 15:55:23 localhost kernel: Call Trace: > >> Apr 17 15:55:23 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 15:55:23 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 15:55:23 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 15:55:23 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 15:55:23 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 15:55:23 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 15:55:23 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 15:55:23 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 15:55:23 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 15:55:23 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 15:55:23 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 15:55:23 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 15:55:23 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 15:55:23 localhost kernel: ---[ end trace fa484bd6d19ade8d ]--- > >> Apr 17 15:55:23 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 15:55:23 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 16:54:27 localhost mountd[26242]: Caught signal 15, un-registering and exiting. > >> Apr 17 16:54:27 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 16:54:28 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 16:54:28 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 16:54:28 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 16:54:28 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 16:54:28 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 16:54:28 localhost kernel: Pid: 27044, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 16:54:28 localhost kernel: Call Trace: > >> Apr 17 16:54:28 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 16:54:28 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 16:54:28 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 16:54:28 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 16:54:28 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 16:54:28 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 16:54:28 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 16:54:28 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 16:54:28 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 16:54:28 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 16:54:28 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 16:54:28 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 16:54:28 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 16:54:28 localhost kernel: ---[ end trace fa484bd6d19ade8e ]--- > >> Apr 17 16:54:28 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 16:54:28 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 16:59:55 localhost mountd[27056]: Caught signal 15, un-registering and exiting. > >> Apr 17 16:59:55 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 16:59:56 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 16:59:56 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 16:59:56 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 16:59:56 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 16:59:56 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 16:59:56 localhost kernel: Pid: 27197, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 16:59:56 localhost kernel: Call Trace: > >> Apr 17 16:59:56 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 16:59:56 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 16:59:56 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 16:59:56 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 16:59:56 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 16:59:56 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 16:59:56 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 16:59:56 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 16:59:56 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 16:59:56 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 16:59:56 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 16:59:56 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 16:59:56 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 16:59:56 localhost kernel: ---[ end trace fa484bd6d19ade8f ]--- > >> Apr 17 16:59:56 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 16:59:56 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 17:02:50 localhost mountd[27209]: Caught signal 15, un-registering and exiting. > >> Apr 17 17:02:50 localhost kernel: nfsd: last server has exited, flushing export cache > >> Apr 17 17:02:51 localhost kernel: ------------[ cut here ]------------ > >> Apr 17 17:02:51 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() > >> Apr 17 17:02:51 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) > >> Apr 17 17:02:51 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). > >> Apr 17 17:02:51 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] > >> Apr 17 17:02:51 localhost kernel: Pid: 27349, comm: lockd Tainted: G W 2.6.30-rc2 #3 > >> Apr 17 17:02:51 localhost kernel: Call Trace: > >> Apr 17 17:02:51 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 > >> Apr 17 17:02:51 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 > >> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 17:02:51 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd > >> Apr 17 17:02:51 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa > >> Apr 17 17:02:51 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c > >> Apr 17 17:02:51 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] > >> Apr 17 17:02:51 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] > >> Apr 17 17:02:51 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 > >> Apr 17 17:02:51 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] > >> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 > >> Apr 17 17:02:51 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e > >> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] > >> Apr 17 17:02:51 localhost kernel: [<c043dd42>] kthread+0x45/0x6b > >> Apr 17 17:02:51 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b > >> Apr 17 17:02:51 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 > >> Apr 17 17:02:51 localhost kernel: ---[ end trace fa484bd6d19ade90 ]--- > >> Apr 17 17:02:51 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory > >> Apr 17 17:02:51 localhost kernel: NFSD: starting 90-second grace period > >> Apr 17 17:08:09 localhost mountd[27361]: authenticated mount request from 10.167.141.101:695 for /tmp/nfs3 (/tmp/nfs3) > >> > >>> --b. > >>> > >>>>> --b. > >>>>> > >>>>> diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c > >>>>> index abf8388..1a54ae1 100644 > >>>>> --- a/fs/lockd/svc.c > >>>>> +++ b/fs/lockd/svc.c > >>>>> @@ -104,6 +104,16 @@ static void set_grace_period(void) > >>>>> schedule_delayed_work(&grace_period_end, grace_period); > >>>>> } > >>>>> > >>>>> +static void restart_grace(void) > >>>>> +{ > >>>>> + if (nlmsvc_ops) { > >>>>> + cancel_delayed_work_sync(&grace_period_end); > >>>>> + locks_end_grace(&lockd_manager); > >>>>> + nlmsvc_invalidate_all(); > >>>>> + set_grace_period(); > >>>>> + } > >>>>> +} > >>>>> + > >>>>> /* > >>>>> * This is the lockd kernel thread > >>>>> */ > >>>>> @@ -149,10 +159,7 @@ lockd(void *vrqstp) > >>>>> > >>>>> if (signalled()) { > >>>>> flush_signals(current); > >>>>> - if (nlmsvc_ops) { > >>>>> - nlmsvc_invalidate_all(); > >>>>> - set_grace_period(); > >>>>> - } > >>>>> + restart_grace(); > >>>>> continue; > >>>>> } > >>>>> > >>>>> > ^ permalink raw reply [flat|nested] 13+ messages in thread
* Re: [PATCH] nfs lockd: detect grace_list corruption 2009-05-12 19:13 ` J. Bruce Fields @ 2009-05-13 2:24 ` Wang Chen 0 siblings, 0 replies; 13+ messages in thread From: Wang Chen @ 2009-05-13 2:24 UTC (permalink / raw) To: J. Bruce Fields; +Cc: neilb, Trond.Myklebust, linux-nfs, FNST-Bian Naimeng J. Bruce Fields said the following on 2009-5-13 3:13: > On Tue, May 12, 2009 at 08:43:41AM +0800, Wang Chen wrote: >> J. Bruce Fields said the following on 2009-5-12 4:57: >>> On Mon, May 11, 2009 at 02:13:50PM +0800, Wang Chen wrote: >>>> J. Bruce Fields said the following on 2009-5-9 2:26: >>>>> On Thu, May 07, 2009 at 02:51:23PM +0800, Wang Chen wrote: >>>>>> J. Bruce Fields said the following on 2009-5-7 4:32: >>>>>>> On Wed, May 06, 2009 at 05:17:20PM +0800, Wang Chen wrote: >>>>>>>> J. Bruce Fields said the following on 2009-4-25 7:12: >>>>>>>>> On Fri, Apr 24, 2009 at 11:09:44AM +0800, Wang Chen wrote: >>>>>>>>>> Although I can't reproduce it now, it really happened that some lock manager >>>>>>>>>> started grace period but didn't end it. >>>>>>>>>> This causes an lm entry be left in grace_list, and when service nfs restart, >>>>>>>>>> the same lm will be added again into the list. >>>>>>>>>> As you know, adding an entry, which is in the list, to a list will leads to >>>>>>>>>> list corruption. >>>>>>>>> I'd really like to understand why locks_end_grace() isn't being called. >>>>>>>>> I'm probably overlooking something obvious, but I just can't see how >>>>>>>>> lockd or nfsd can be shut down right now without locks_end_grace() being >>>>>>>>> called. >>>>>>>>> >>>>>>>> Me neither can figure out why locks_end_grace() isn't being called. >>>>>>>> >>>>>>>> But do locks_start_grace() twice can trigger this warning too. >>>>>>>> You can do >>>>>>>> 1. service nfs restart >>>>>>>> 2. (immediately) kill -s SIGKILL lockd >>>>>>>> this can trigger >>>>>>>> --- >>>>>>>> lockd(void *vrqstp) >>>>>>>> ... >>>>>>>> if (signalled()) { >>>>>>>> flush_signals(current); >>>>>>>> if (nlmsvc_ops) { >>>>>>>> nlmsvc_invalidate_all(); >>>>>>>> set_grace_period(); >>>>>>>> --- >>>>>>>> and makes locks_start_grace() be called twice without locks_end_grace(). >>>>>>> Ah-hah! >>>>>>> >>>>>>>> So I still suggest to do something to protect the lm list. :) >>>>>>> I wouldn't be opposed to a simple WARN_ON(!list_empty()) in >>>>>>> locks_start_grace(), but I'm mainly worried about fixing the original >>>>>>> bug. How about the following? >>>>>>> >>>>>> Yeah, the following fix is OK to me, although it only fixed >>>>>> "start_grace again after start_grace" case. >>>>> OK, thanks. >>>>> >>>>>> The bug about "quit lockd without end_grace", which I encountered before >>>>>> incidentally, maybe is still there. >>>>> You're talking about the report that started this thread?: >>>>> >>>>> http://marc.info/?l=linux-nfs&m=124054262421444&w=2 >>>>> >>>> Yes. I mean this. >>>> >>>>> It looks to me like that could be explained by two start_grace's in a >>>>> row. >>>>> >>>> But in that report, I didn't post the total message. >>>> Here are something show that: >>>> 1. not only lockd has the problem, but nfsd also. >>>> 2. every time I do "service nfs restart", I got the warning, so this is not >>>> "two start_grace's in a row" problem. >>> Once the list is corrupted, it stays corrupted, so that's expected; the >>> only interesting warning is the first one. >>> >> But as you see the logs, nfsd made list corrupted first. > > Are you sure? You may be right, I just don't understand why. If you > look at the definitions in lib/list_debug.c and include/linux/list.h, > and trace through what would happen e.g. in a sequence like: > > list_add(item, head); > list_add(item, head); > list_del(item); > list_add(item, head); > > where would corruption first be reported? > > I don't think it would be a the first place where the corruption was > actually *created*. In fact I *think* the warning would first occur on > the third list_add(), resulting in exactly the behavior seen in your > logs--but someone should check that. > Yes. The warning will be reported on the third list_add(). I was wrong. So, I am completely has no question about your previous patch. Thanks Bruce. > --b. > >> Your fix only "two start_grace's in a row" of lockd . > >>> --b. >>> >>>> Following is more message I got on last month. >>>> ------------------------------------------------------ >>>> Apr 16 16:35:41 localhost mountd[15061]: Caught signal 15, un-registering and exiting. >>>> Apr 16 16:35:42 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 16 16:35:43 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 16 16:35:43 localhost kernel: ------------[ cut here ]------------ >>>> Apr 16 16:35:43 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 16 16:35:43 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 16 16:35:43 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 16 16:35:43 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 16 16:35:43 localhost kernel: Pid: 17455, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 16 16:35:43 localhost kernel: Call Trace: >>>> Apr 16 16:35:43 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 16 16:35:43 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 16 16:35:43 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 >>>> Apr 16 16:35:43 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 >>>> Apr 16 16:35:43 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 16 16:35:43 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 16 16:35:43 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 16 16:35:43 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] >>>> Apr 16 16:35:43 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad >>>> Apr 16 16:35:43 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 >>>> Apr 16 16:35:43 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c >>>> Apr 16 16:35:43 localhost kernel: ---[ end trace fa484bd6d19ade87 ]--- >>>> Apr 16 16:35:43 localhost kernel: NFSD: starting 90-second grace period >>>> ...snip... >>>> Apr 17 13:02:54 localhost mountd[17468]: Caught signal 15, un-registering and exiting. >>>> Apr 17 13:02:54 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 13:02:55 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 13:02:55 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 13:02:55 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 13:02:55 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 13:02:55 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 13:02:55 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 13:02:55 localhost kernel: Pid: 22642, comm: rpc.nfsd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 13:02:55 localhost kernel: Call Trace: >>>> Apr 17 13:02:55 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 13:02:55 localhost kernel: [<efc17dec>] ? nfsd4_build_namelist+0x0/0x8e [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 13:02:55 localhost kernel: [<c04a3f66>] ? mntput_no_expire+0x1c/0x101 >>>> Apr 17 13:02:55 localhost kernel: [<c04a0062>] ? dput+0x35/0x103 >>>> Apr 17 13:02:55 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 13:02:55 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 13:02:55 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 13:02:55 localhost kernel: [<efc13a49>] nfs4_state_start+0x7a/0xdd [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<efbfd5be>] nfsd_svc+0x57/0xf9 [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<efbfdff6>] ? write_threads+0x0/0x59 [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<efbfe02b>] write_threads+0x35/0x59 [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<efbfd84d>] nfsctl_transaction_write+0x3b/0x58 [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<efbfd812>] ? nfsctl_transaction_write+0x0/0x58 [nfsd] >>>> Apr 17 13:02:55 localhost kernel: [<c04927af>] vfs_write+0x7c/0xad >>>> Apr 17 13:02:55 localhost kernel: [<c0492879>] sys_write+0x3b/0x60 >>>> Apr 17 13:02:55 localhost kernel: [<c0403148>] sysenter_do_call+0x12/0x3c >>>> Apr 17 13:02:55 localhost kernel: ---[ end trace fa484bd6d19ade88 ]--- >>>> Apr 17 13:02:55 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 13:04:07 localhost mountd[22655]: Caught signal 15, un-registering and exiting. >>>> Apr 17 13:04:07 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 13:04:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 13:04:07 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 13:05:04 localhost mountd[22760]: Caught signal 15, un-registering and exiting. >>>> Apr 17 13:05:04 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 13:05:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 13:05:05 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 13:06:10 localhost mountd[22859]: Caught signal 15, un-registering and exiting. >>>> Apr 17 13:06:10 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 13:06:10 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 13:06:10 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 13:08:07 localhost mountd[22960]: Caught signal 15, un-registering and exiting. >>>> Apr 17 13:08:07 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 13:08:07 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 13:08:07 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 13:08:07 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 13:08:07 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 13:08:07 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 13:08:07 localhost kernel: Pid: 23062, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 13:08:07 localhost kernel: Call Trace: >>>> Apr 17 13:08:07 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 13:08:07 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 13:08:07 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 13:08:07 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 13:08:07 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 13:08:07 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 13:08:07 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 13:08:07 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 13:08:07 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 13:08:07 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 13:08:07 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 13:08:07 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 13:08:07 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 13:08:07 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 13:08:07 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 13:08:07 localhost kernel: ---[ end trace fa484bd6d19ade89 ]--- >>>> Apr 17 13:08:07 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 13:08:07 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:39:45 localhost mountd[23074]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:39:45 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:39:45 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 14:39:45 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 14:39:45 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 14:39:45 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 14:39:45 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 14:39:45 localhost kernel: Pid: 24287, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 14:39:45 localhost kernel: Call Trace: >>>> Apr 17 14:39:45 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 14:39:45 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 14:39:45 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 14:39:45 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 14:39:45 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 14:39:45 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 14:39:45 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 14:39:45 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 14:39:45 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 14:39:45 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 14:39:45 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 14:39:45 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 14:39:45 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 14:39:45 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 14:39:45 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 14:39:45 localhost kernel: ---[ end trace fa484bd6d19ade8a ]--- >>>> Apr 17 14:39:45 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:39:45 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:41:32 localhost mountd[24299]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:41:32 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:41:33 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 14:41:33 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 14:41:33 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 14:41:33 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 14:41:33 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 14:41:33 localhost kernel: Pid: 24399, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 14:41:33 localhost kernel: Call Trace: >>>> Apr 17 14:41:33 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 14:41:33 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 14:41:33 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 14:41:33 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 14:41:33 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 14:41:33 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 14:41:33 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 14:41:33 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 14:41:33 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 14:41:33 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 14:41:33 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 14:41:33 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 14:41:33 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 14:41:33 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 14:41:33 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 14:41:33 localhost kernel: ---[ end trace fa484bd6d19ade8b ]--- >>>> Apr 17 14:41:33 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:41:33 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:42:16 localhost mountd[24411]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:42:17 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:42:17 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:42:17 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:42:52 localhost mountd[24508]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:42:52 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:42:53 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:42:53 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:43:28 localhost mountd[24602]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:43:28 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:43:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:43:29 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:43:59 localhost mountd[24697]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:43:59 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:44:00 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:44:00 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:44:28 localhost mountd[24791]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:44:28 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:44:29 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:44:29 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:45:33 localhost mountd[24885]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:45:33 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:45:34 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:45:34 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:46:05 localhost mountd[24988]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:46:05 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:46:05 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:46:05 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 14:46:34 localhost mountd[25082]: Caught signal 15, un-registering and exiting. >>>> Apr 17 14:46:34 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 14:46:35 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 14:46:35 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 15:35:01 localhost mountd[25176]: Caught signal 15, un-registering and exiting. >>>> Apr 17 15:35:02 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 15:35:02 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 15:35:02 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 15:35:02 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 15:35:02 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 15:35:02 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 15:35:02 localhost kernel: Pid: 25883, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 15:35:02 localhost kernel: Call Trace: >>>> Apr 17 15:35:02 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 15:35:02 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 15:35:02 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 15:35:02 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 15:35:02 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 15:35:02 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 15:35:02 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 15:35:02 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 15:35:02 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 15:35:02 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 15:35:02 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 15:35:02 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 15:35:02 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 15:35:02 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 15:35:02 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 15:35:02 localhost kernel: ---[ end trace fa484bd6d19ade8c ]--- >>>> Apr 17 15:35:02 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 15:35:02 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 15:55:22 localhost mountd[25895]: Caught signal 15, un-registering and exiting. >>>> Apr 17 15:55:22 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 15:55:23 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 15:55:23 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 15:55:23 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 15:55:23 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 15:55:23 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 15:55:23 localhost kernel: Pid: 26230, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 15:55:23 localhost kernel: Call Trace: >>>> Apr 17 15:55:23 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 15:55:23 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 15:55:23 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 15:55:23 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 15:55:23 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 15:55:23 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 15:55:23 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 15:55:23 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 15:55:23 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 15:55:23 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 15:55:23 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 15:55:23 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 15:55:23 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 15:55:23 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 15:55:23 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 15:55:23 localhost kernel: ---[ end trace fa484bd6d19ade8d ]--- >>>> Apr 17 15:55:23 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 15:55:23 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 16:54:27 localhost mountd[26242]: Caught signal 15, un-registering and exiting. >>>> Apr 17 16:54:27 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 16:54:28 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 16:54:28 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 16:54:28 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 16:54:28 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 16:54:28 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 16:54:28 localhost kernel: Pid: 27044, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 16:54:28 localhost kernel: Call Trace: >>>> Apr 17 16:54:28 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 16:54:28 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 16:54:28 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 16:54:28 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 16:54:28 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 16:54:28 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 16:54:28 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 16:54:28 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 16:54:28 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 16:54:28 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 16:54:28 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 16:54:28 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 16:54:28 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 16:54:28 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 16:54:28 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 16:54:28 localhost kernel: ---[ end trace fa484bd6d19ade8e ]--- >>>> Apr 17 16:54:28 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 16:54:28 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 16:59:55 localhost mountd[27056]: Caught signal 15, un-registering and exiting. >>>> Apr 17 16:59:55 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 16:59:56 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 16:59:56 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 16:59:56 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 16:59:56 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 16:59:56 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 16:59:56 localhost kernel: Pid: 27197, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 16:59:56 localhost kernel: Call Trace: >>>> Apr 17 16:59:56 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 16:59:56 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 16:59:56 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 16:59:56 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 16:59:56 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 16:59:56 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 16:59:56 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 16:59:56 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 16:59:56 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 16:59:56 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 16:59:56 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 16:59:56 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 16:59:56 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 16:59:56 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 16:59:56 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 16:59:56 localhost kernel: ---[ end trace fa484bd6d19ade8f ]--- >>>> Apr 17 16:59:56 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 16:59:56 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 17:02:50 localhost mountd[27209]: Caught signal 15, un-registering and exiting. >>>> Apr 17 17:02:50 localhost kernel: nfsd: last server has exited, flushing export cache >>>> Apr 17 17:02:51 localhost kernel: ------------[ cut here ]------------ >>>> Apr 17 17:02:51 localhost kernel: WARNING: at lib/list_debug.c:26 __list_add+0x27/0x5c() >>>> Apr 17 17:02:51 localhost kernel: Hardware name: Presario M2000 (PT365PA#AB2) >>>> Apr 17 17:02:51 localhost kernel: list_add corruption. next->prev should be prev (ef8fe958), but was ef8ff128. (next=ef8ff128). >>>> Apr 17 17:02:51 localhost kernel: Modules linked in: fuse i915 drm i2c_algo_bit nfsd lockd nfs_acl auth_rpcgss exportfs sunrpc ipv6 p4_clockmod dm_multipath uinput snd_intel8x0m snd_intel8x0 snd_seq_dummy snd_ac97_codec ac97_bus snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd soundcore 8139cp firewire_ohci firewire_core snd_page_alloc tifm_7xx1 i2c_i801 iTCO_wdt 8139too tifm_core i2c_core yenta_socket crc_itu_t iTCO_vendor_support pcspkr mii rsrc_nonstatic wmi video output ata_generic pata_acpi [last unloaded: microcode] >>>> Apr 17 17:02:51 localhost kernel: Pid: 27349, comm: lockd Tainted: G W 2.6.30-rc2 #3 >>>> Apr 17 17:02:51 localhost kernel: Call Trace: >>>> Apr 17 17:02:51 localhost kernel: [<c042d5b5>] warn_slowpath+0x71/0xa0 >>>> Apr 17 17:02:51 localhost kernel: [<c0422a96>] ? update_curr+0x11d/0x125 >>>> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 17:02:51 localhost kernel: [<c044b270>] ? trace_hardirqs_on+0xb/0xd >>>> Apr 17 17:02:51 localhost kernel: [<c051c61a>] ? _raw_spin_lock+0x53/0xfa >>>> Apr 17 17:02:51 localhost kernel: [<c051c89f>] __list_add+0x27/0x5c >>>> Apr 17 17:02:51 localhost kernel: [<ef8f6daa>] locks_start_grace+0x22/0x30 [lockd] >>>> Apr 17 17:02:51 localhost kernel: [<ef8f34da>] set_grace_period+0x39/0x53 [lockd] >>>> Apr 17 17:02:51 localhost kernel: [<c06b8921>] ? lock_kernel+0x1c/0x28 >>>> Apr 17 17:02:51 localhost kernel: [<ef8f3558>] lockd+0x64/0x164 [lockd] >>>> Apr 17 17:02:51 localhost kernel: [<c044b12d>] ? trace_hardirqs_on_caller+0x18/0x150 >>>> Apr 17 17:02:51 localhost kernel: [<c04227b0>] ? complete+0x34/0x3e >>>> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 17:02:51 localhost kernel: [<ef8f34f4>] ? lockd+0x0/0x164 [lockd] >>>> Apr 17 17:02:51 localhost kernel: [<c043dd42>] kthread+0x45/0x6b >>>> Apr 17 17:02:51 localhost kernel: [<c043dcfd>] ? kthread+0x0/0x6b >>>> Apr 17 17:02:51 localhost kernel: [<c0403c23>] kernel_thread_helper+0x7/0x10 >>>> Apr 17 17:02:51 localhost kernel: ---[ end trace fa484bd6d19ade90 ]--- >>>> Apr 17 17:02:51 localhost kernel: NFSD: Using /var/lib/nfs/v4recovery as the NFSv4 state recovery directory >>>> Apr 17 17:02:51 localhost kernel: NFSD: starting 90-second grace period >>>> Apr 17 17:08:09 localhost mountd[27361]: authenticated mount request from 10.167.141.101:695 for /tmp/nfs3 (/tmp/nfs3) >>>> >>>>> --b. >>>>> >>>>>>> --b. >>>>>>> >>>>>>> diff --git a/fs/lockd/svc.c b/fs/lockd/svc.c >>>>>>> index abf8388..1a54ae1 100644 >>>>>>> --- a/fs/lockd/svc.c >>>>>>> +++ b/fs/lockd/svc.c >>>>>>> @@ -104,6 +104,16 @@ static void set_grace_period(void) >>>>>>> schedule_delayed_work(&grace_period_end, grace_period); >>>>>>> } >>>>>>> >>>>>>> +static void restart_grace(void) >>>>>>> +{ >>>>>>> + if (nlmsvc_ops) { >>>>>>> + cancel_delayed_work_sync(&grace_period_end); >>>>>>> + locks_end_grace(&lockd_manager); >>>>>>> + nlmsvc_invalidate_all(); >>>>>>> + set_grace_period(); >>>>>>> + } >>>>>>> +} >>>>>>> + >>>>>>> /* >>>>>>> * This is the lockd kernel thread >>>>>>> */ >>>>>>> @@ -149,10 +159,7 @@ lockd(void *vrqstp) >>>>>>> >>>>>>> if (signalled()) { >>>>>>> flush_signals(current); >>>>>>> - if (nlmsvc_ops) { >>>>>>> - nlmsvc_invalidate_all(); >>>>>>> - set_grace_period(); >>>>>>> - } >>>>>>> + restart_grace(); >>>>>>> continue; >>>>>>> } >>>>>>> >>>>>>> ^ permalink raw reply [flat|nested] 13+ messages in thread
end of thread, other threads:[~2009-05-13 2:25 UTC | newest] Thread overview: 13+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2009-04-24 3:09 [PATCH] nfs lockd: detect grace_list corruption Wang Chen 2009-04-24 5:15 ` Bian Naimeng 2009-04-24 7:35 ` [PATCH V2] " Wang Chen 2009-04-24 23:12 ` [PATCH] " J. Bruce Fields 2009-05-06 9:17 ` Wang Chen 2009-05-06 20:32 ` J. Bruce Fields 2009-05-07 6:51 ` Wang Chen 2009-05-08 18:26 ` J. Bruce Fields 2009-05-11 6:13 ` Wang Chen 2009-05-11 20:57 ` J. Bruce Fields 2009-05-12 0:43 ` Wang Chen 2009-05-12 19:13 ` J. Bruce Fields 2009-05-13 2:24 ` Wang Chen
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox