From: Dirk Morris <dmorris@metavize.com>
To: Rusty Russell <rusty@rustcorp.com.au>
Cc: akpm@osdl.org, linux-kernel@vger.kernel.org
Subject: Re: [2.6.2] Badness in futex_wait revisited
Date: Tue, 17 Feb 2004 11:55:30 -0800 [thread overview]
Message-ID: <403271B2.3040107@metavize.com> (raw)
In-Reply-To: <20040217051911.6AC112C066@lists.samba.org>
I installed this patch. (which gives be lots of Badness in sched.c errors)
Here is the dump of what I believe is the offending awakening:
Feb 17 10:49:09 timmy kernel: iptables 0 waking foobar: 898 898
Feb 17 10:49:09 timmy kernel: Badness in try_to_wake_up at
kernel/sched.c:666
Feb 17 10:49:09 timmy kernel: Call Trace:
Feb 17 10:49:09 timmy kernel: [<c0117e79>] try_to_wake_up+0x1b9/0x1c0
Feb 17 10:49:09 timmy kernel: [<c0117eba>] wake_up_state+0x1a/0x20
Feb 17 10:49:09 timmy kernel: [<c0126358>] do_notify_parent+0x3f8/0x530
Feb 17 10:49:09 timmy kernel: [<c013fb03>] unmap_page_range+0x43/0x70
Feb 17 10:49:09 timmy kernel: [<c014ede9>] __fput+0x99/0xf0
Feb 17 10:49:09 timmy kernel: [<c011d831>] exit_notify+0x211/0x750
Feb 17 10:49:09 timmy kernel: [<c011d31e>] put_files_struct+0x8e/0xc0
Feb 17 10:49:09 timmy kernel: [<c011df0c>] do_exit+0x19c/0x370
Feb 17 10:49:09 timmy kernel: [<c011e113>] sys_exit+0x13/0x20
Feb 17 10:49:09 timmy kernel: [<c010922b>] syscall_call+0x7/0xb
foobar makes a system("iptables -t nat -L") like call from one pthread.
Around this time... Some other thread (or possibly the same one?) gets
an EINTR from sem_wait.
I am still unable to replicate this in a simple test program.
It also seems that the Badness in futex_wait message is only printed
some of the time.
This is with SMP and preempt off. (again, doesnt happen in old 2.2.x libc)
I also have pages worth of other taking debug messages if anyone would
like them.
Let me know if I can help further.
-Dirk
Rusty Russell wrote:
>In message <40311703.8070309@metavize.com> you write:
>
>
>>Please send me the patch, and I'll give you some updated information.
>>
>>
>
>Here it is: Andrew's patch updated and fixed.
>
>Thanks!
>Rusty.
>--
> Anyone who quotes me in their sig is an idiot. -- Rusty Russell.
>
>Name: Who's Spuriously Waking Futexes?
>Author: Andrew Morton, Rusty Russell
>Status: Tested on 2.6.3-bk1
>
>Someone is triggering the WARN_ON() in futex.c. We know that software
>suspend could do it, in theory. But noone else should be.
>
>This code adds a PF_FUTEX_DEBUG flag, which is set in the futex code
>when we sleep, and also when we wake up. If a task with
>PF_FUTEX_DEBUG is woken by a task without PF_FUTEX_DEBUG, we have
>found our culprit.
>
>diff -urpN --exclude TAGS -X /home/rusty/devel/kernel/kernel-patches/current-dontdiff --minimal .6375-linux-2.6.3-rc3-bk1/include/linux/sched.h .6375-linux-2.6.3-rc3-bk1.updated/include/linux/sched.h
>--- .6375-linux-2.6.3-rc3-bk1/include/linux/sched.h 2004-02-15 18:17:21.000000000 +1100
>+++ .6375-linux-2.6.3-rc3-bk1.updated/include/linux/sched.h 2004-02-17 12:01:47.000000000 +1100
>@@ -500,6 +500,7 @@ do { if (atomic_dec_and_test(&(tsk)->usa
> #define PF_SWAPOFF 0x00080000 /* I am in swapoff */
> #define PF_LESS_THROTTLE 0x00100000 /* Throttle me less: I clean memory */
> #define PF_SYNCWRITE 0x00200000 /* I am doing a sync write */
>+#define PF_FUTEX_DEBUG 0x00400000
>
> #ifdef CONFIG_SMP
> extern int set_cpus_allowed(task_t *p, cpumask_t new_mask);
>diff -urpN --exclude TAGS -X /home/rusty/devel/kernel/kernel-patches/current-dontdiff --minimal .6375-linux-2.6.3-rc3-bk1/kernel/futex.c .6375-linux-2.6.3-rc3-bk1.updated/kernel/futex.c
>--- .6375-linux-2.6.3-rc3-bk1/kernel/futex.c 2004-02-15 18:17:21.000000000 +1100
>+++ .6375-linux-2.6.3-rc3-bk1.updated/kernel/futex.c 2004-02-17 12:01:47.000000000 +1100
>@@ -269,7 +269,11 @@ static void wake_futex(struct futex_q *q
> * The lock in wake_up_all() is a crucial memory barrier after the
> * list_del_init() and also before assigning to q->lock_ptr.
> */
>+
>+ current->flags |= PF_FUTEX_DEBUG;
> wake_up_all(&q->waiters);
>+ current->flags &= ~PF_FUTEX_DEBUG;
>+
> /*
> * The waiting task can free the futex_q as soon as this is written,
> * without taking any locks. This must come last.
>@@ -490,8 +494,11 @@ static int futex_wait(unsigned long uadd
> * !list_empty() is safe here without any lock.
> * q.lock_ptr != 0 is not safe, because of ordering against wakeup.
> */
>- if (likely(!list_empty(&q.list)))
>+ if (likely(!list_empty(&q.list))) {
>+ current->flags |= PF_FUTEX_DEBUG;
> time = schedule_timeout(time);
>+ current->flags &= ~PF_FUTEX_DEBUG;
>+ }
> __set_current_state(TASK_RUNNING);
>
> /*
>diff -urpN --exclude TAGS -X /home/rusty/devel/kernel/kernel-patches/current-dontdiff --minimal .6375-linux-2.6.3-rc3-bk1/kernel/sched.c .6375-linux-2.6.3-rc3-bk1.updated/kernel/sched.c
>--- .6375-linux-2.6.3-rc3-bk1/kernel/sched.c 2004-02-15 18:17:22.000000000 +1100
>+++ .6375-linux-2.6.3-rc3-bk1.updated/kernel/sched.c 2004-02-17 12:02:24.000000000 +1100
>@@ -658,6 +658,14 @@ static int try_to_wake_up(task_t * p, un
> long old_state;
> runqueue_t *rq;
>
>+ if ((p->flags & PF_FUTEX_DEBUG)
>+ && !(current->flags & PF_FUTEX_DEBUG)) {
>+ printk("%s %i waking %s: %i %i\n",
>+ current->comm, (int)in_interrupt(),
>+ p->comm, p->tgid, p->pid);
>+ WARN_ON(1);
>+ }
>+
> repeat_lock_task:
> rq = task_rq_lock(p, &flags);
> old_state = p->state;
>
>
next prev parent reply other threads:[~2004-02-17 19:55 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <40311703.8070309@metavize.com>
2004-02-17 4:39 ` [2.6.2] Badness in futex_wait revisited Rusty Russell
2004-02-17 5:27 ` Andrew Morton
2004-02-18 4:14 ` Rusty Russell
2004-02-17 19:55 ` Dirk Morris [this message]
2004-03-31 16:56 ` Jamie Lokier
2004-03-31 17:38 ` Dirk Morris
2004-03-31 18:32 ` Jamie Lokier
2004-03-31 18:59 ` Dirk Morris
2004-04-01 2:16 ` Rusty Russell
2004-04-01 8:34 ` Andrew Morton
2004-04-01 9:24 ` Andrew Morton
2004-04-01 1:57 ` Rusty Russell
2004-02-13 21:13 Dirk Morris
2004-02-16 11:42 ` Rusty Russell
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=403271B2.3040107@metavize.com \
--to=dmorris@metavize.com \
--cc=akpm@osdl.org \
--cc=linux-kernel@vger.kernel.org \
--cc=rusty@rustcorp.com.au \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox