From: Johannes Weiner <hannes@cmpxchg.org>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Chris Mason <chris.mason@oracle.com>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Matthew Wilcox <matthew@wil.cx>, Chuck Lever <cel@citi.umich.edu>,
Nick Piggin <nickpiggin@yahoo.com.au>,
Andrew Morton <akpm@linux-foundation.org>,
linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Ingo Molnar <mingo@elte.hu>
Subject: Re: [RFC v4] wait: prevent waiter starvation in __wait_on_bit_lock
Date: Fri, 23 Jan 2009 20:24:55 +0100 [thread overview]
Message-ID: <20090123192454.GA23107@cmpxchg.org> (raw)
In-Reply-To: <20090123113541.GB12684@redhat.com>
On Fri, Jan 23, 2009 at 12:35:41PM +0100, Oleg Nesterov wrote:
> On 01/23, Johannes Weiner wrote:
> >
> > On Thu, Jan 22, 2009 at 09:25:50PM +0100, Oleg Nesterov wrote:
> > > On 01/21, Johannes Weiner wrote:
> > > >
> > > int finish_wait_xxx(wait_queue_head_t *q, wait_queue_t *wait)
> > > {
> > > unsigned long flags;
> > > int woken;
> > >
> > > __set_current_state(TASK_RUNNING);
> > > spin_lock_irqsave(&q->lock, flags);
> > > woken = list_empty(&wait->task_list);
> > > list_del_init(&wait->task_list);
> > > spin_unlock_irqrestore(&q->lock, flags);
> > >
> > > return woken;
> > > }
> >
> > Hehe, there is only n solutions to this problem. I had thought about
> > that too, even written it down. But I was not sure if taking the
> > spinlock, toggling irqs and (re)storing the flags is better than an
> > untaken branch. ;)
>
> Yes. Fortunately, this is "unlikely" path.
>
> > > if (test_bit(q->key.bit_nr, q->key.flags)) {
> > > if ((ret = (*action)(q->key.flags))) {
> > > if (finish_wait_xxx(...))
> > > __wake_up_bit(...);
> > > return ret;
> > > }
> > > }
> >
> > If you don't mind putting a second finish_wait() in there (you still
> > need the one after the loop, right?), we can fix up my version to not
> > check ret twice but do finish_wait() as you describe and then the
> > test_bit() && wake up:
> >
> > do {
> > if (test_bit())
> > if ((ret = action())) {
> > finish_wait()
> > smp_rmb()
> > if (!test_bit())
> > __wake_up_bit()
>
> Yes sure. Except this wakeup can be false.
>
> > > int finish_wait_yyy(wait_queue_head_t *q, wait_queue_t *wait,
> > > int mode, void *key)
> > > {
> > > unsigned long flags;
> > > int woken;
> > >
> > > __set_current_state(TASK_RUNNING);
> > > spin_lock_irqsave(&q->lock, flags);
> > > woken = list_empty(&wait->task_list);
> > > if (woken)
> > > __wake_up_common(q, mode, 1, key);
> > > else
> > > list_del_init(&wait->task_list);
> > > spin_unlock_irqrestore(&q->lock, flags);
> > >
> > > return woken;
> > > }
> > >
> > > Perhaps a bit too much for this particular case, but I am thinking
> > > about other cases when we need to abort the exclusive wait.
> > >
> > > For example, don't we have the similar problems with
> > > wait_event_interruptible_exclusive() ?
> >
> > Yeah, we do IIUC. Then having finish_wait() extended is probably a
> > good idea.
>
> Yes.
>
> It is no that I think this new helper is really needed for this
> particular case, personally I agree with the patch you sent.
>
> But if we have other places with the similar problem, then perhaps
> it is better to introduce the special finish_wait_exclusive() or
> whatever.
Agreed. I will whip up another series that adds
finish_wait_exclusive() and adjusts the problematic callsites.
Hannes
WARNING: multiple messages have this Message-ID (diff)
From: Johannes Weiner <hannes@cmpxchg.org>
To: Oleg Nesterov <oleg@redhat.com>
Cc: Chris Mason <chris.mason@oracle.com>,
Peter Zijlstra <a.p.zijlstra@chello.nl>,
Matthew Wilcox <matthew@wil.cx>, Chuck Lever <cel@citi.umich.edu>,
Nick Piggin <nickpiggin@yahoo.com.au>,
Andrew Morton <akpm@linux-foundation.org>,
linux-kernel@vger.kernel.org, linux-mm@kvack.org,
Ingo Molnar <mingo@elte.hu>
Subject: Re: [RFC v4] wait: prevent waiter starvation in __wait_on_bit_lock
Date: Fri, 23 Jan 2009 20:24:55 +0100 [thread overview]
Message-ID: <20090123192454.GA23107@cmpxchg.org> (raw)
In-Reply-To: <20090123113541.GB12684@redhat.com>
On Fri, Jan 23, 2009 at 12:35:41PM +0100, Oleg Nesterov wrote:
> On 01/23, Johannes Weiner wrote:
> >
> > On Thu, Jan 22, 2009 at 09:25:50PM +0100, Oleg Nesterov wrote:
> > > On 01/21, Johannes Weiner wrote:
> > > >
> > > int finish_wait_xxx(wait_queue_head_t *q, wait_queue_t *wait)
> > > {
> > > unsigned long flags;
> > > int woken;
> > >
> > > __set_current_state(TASK_RUNNING);
> > > spin_lock_irqsave(&q->lock, flags);
> > > woken = list_empty(&wait->task_list);
> > > list_del_init(&wait->task_list);
> > > spin_unlock_irqrestore(&q->lock, flags);
> > >
> > > return woken;
> > > }
> >
> > Hehe, there is only n solutions to this problem. I had thought about
> > that too, even written it down. But I was not sure if taking the
> > spinlock, toggling irqs and (re)storing the flags is better than an
> > untaken branch. ;)
>
> Yes. Fortunately, this is "unlikely" path.
>
> > > if (test_bit(q->key.bit_nr, q->key.flags)) {
> > > if ((ret = (*action)(q->key.flags))) {
> > > if (finish_wait_xxx(...))
> > > __wake_up_bit(...);
> > > return ret;
> > > }
> > > }
> >
> > If you don't mind putting a second finish_wait() in there (you still
> > need the one after the loop, right?), we can fix up my version to not
> > check ret twice but do finish_wait() as you describe and then the
> > test_bit() && wake up:
> >
> > do {
> > if (test_bit())
> > if ((ret = action())) {
> > finish_wait()
> > smp_rmb()
> > if (!test_bit())
> > __wake_up_bit()
>
> Yes sure. Except this wakeup can be false.
>
> > > int finish_wait_yyy(wait_queue_head_t *q, wait_queue_t *wait,
> > > int mode, void *key)
> > > {
> > > unsigned long flags;
> > > int woken;
> > >
> > > __set_current_state(TASK_RUNNING);
> > > spin_lock_irqsave(&q->lock, flags);
> > > woken = list_empty(&wait->task_list);
> > > if (woken)
> > > __wake_up_common(q, mode, 1, key);
> > > else
> > > list_del_init(&wait->task_list);
> > > spin_unlock_irqrestore(&q->lock, flags);
> > >
> > > return woken;
> > > }
> > >
> > > Perhaps a bit too much for this particular case, but I am thinking
> > > about other cases when we need to abort the exclusive wait.
> > >
> > > For example, don't we have the similar problems with
> > > wait_event_interruptible_exclusive() ?
> >
> > Yeah, we do IIUC. Then having finish_wait() extended is probably a
> > good idea.
>
> Yes.
>
> It is no that I think this new helper is really needed for this
> particular case, personally I agree with the patch you sent.
>
> But if we have other places with the similar problem, then perhaps
> it is better to introduce the special finish_wait_exclusive() or
> whatever.
Agreed. I will whip up another series that adds
finish_wait_exclusive() and adjusts the problematic callsites.
Hannes
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2009-01-23 19:26 UTC|newest]
Thread overview: 55+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-01-17 21:51 + lock_page_killable-avoid-lost-wakeups.patch added to -mm tree Oleg Nesterov
2009-01-18 1:38 ` [PATCH v3] wait: prevent waiter starvation in __wait_on_bit_lock Johannes Weiner
2009-01-18 1:38 ` Johannes Weiner
2009-01-18 2:32 ` Oleg Nesterov
2009-01-18 2:32 ` Oleg Nesterov
2009-01-20 20:31 ` Johannes Weiner
2009-01-20 20:31 ` Johannes Weiner
2009-01-21 14:36 ` Oleg Nesterov
2009-01-21 14:36 ` Oleg Nesterov
2009-01-21 21:38 ` [RFC v4] " Johannes Weiner
2009-01-21 21:38 ` Johannes Weiner
2009-01-22 20:25 ` Oleg Nesterov
2009-01-22 20:25 ` Oleg Nesterov
2009-01-23 0:26 ` Dmitry Adamushko
2009-01-23 0:26 ` Dmitry Adamushko
2009-01-23 0:47 ` Oleg Nesterov
2009-01-23 0:47 ` Oleg Nesterov
2009-01-23 10:07 ` Dmitry Adamushko
2009-01-23 10:07 ` Dmitry Adamushko
2009-01-23 11:05 ` Oleg Nesterov
2009-01-23 11:05 ` Oleg Nesterov
2009-01-23 12:36 ` Dmitry Adamushko
2009-01-23 12:36 ` Dmitry Adamushko
2009-01-23 9:59 ` Johannes Weiner
2009-01-23 9:59 ` Johannes Weiner
2009-01-23 11:35 ` Oleg Nesterov
2009-01-23 11:35 ` Oleg Nesterov
2009-01-23 13:30 ` Oleg Nesterov
2009-01-23 13:30 ` Oleg Nesterov
2009-01-26 21:59 ` [RFC v5] wait: prevent exclusive waiter starvation Johannes Weiner
2009-01-26 21:59 ` Johannes Weiner
2009-01-27 3:23 ` Oleg Nesterov
2009-01-27 3:23 ` Oleg Nesterov
2009-01-27 19:34 ` [RFC v6] " Johannes Weiner
2009-01-27 19:34 ` Johannes Weiner
2009-01-27 20:05 ` Oleg Nesterov
2009-01-27 20:05 ` Oleg Nesterov
2009-01-27 22:31 ` Johannes Weiner
2009-01-27 22:31 ` Johannes Weiner
2009-01-28 9:14 ` [RFC v7] " Johannes Weiner
2009-01-28 9:14 ` Johannes Weiner
2009-01-29 4:42 ` Oleg Nesterov
2009-01-29 4:42 ` Oleg Nesterov
2009-01-29 7:37 ` Andrew Morton
2009-01-29 7:37 ` Andrew Morton
2009-01-29 8:31 ` Oleg Nesterov
2009-01-29 8:31 ` Oleg Nesterov
2009-01-29 9:11 ` Andrew Morton
2009-01-29 9:11 ` Andrew Morton
2009-01-29 14:34 ` Chris Mason
2009-01-29 14:34 ` Chris Mason
2009-02-02 15:47 ` Chris Mason
2009-02-02 15:47 ` Chris Mason
2009-01-23 19:24 ` Johannes Weiner [this message]
2009-01-23 19:24 ` [RFC v4] wait: prevent waiter starvation in __wait_on_bit_lock Johannes Weiner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20090123192454.GA23107@cmpxchg.org \
--to=hannes@cmpxchg.org \
--cc=a.p.zijlstra@chello.nl \
--cc=akpm@linux-foundation.org \
--cc=cel@citi.umich.edu \
--cc=chris.mason@oracle.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=matthew@wil.cx \
--cc=mingo@elte.hu \
--cc=nickpiggin@yahoo.com.au \
--cc=oleg@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.