From: Greg KH <gregkh@suse.de>
To: linux-kernel@vger.kernel.org, stable@kernel.org
Cc: stable-review@kernel.org, torvalds@linux-foundation.org,
akpm@linux-foundation.org, alan@lxorguk.ukuu.org.uk,
Darren Hart <dvhltc@us.ibm.com>,
Peter Zijlstra <peterz@infradead.org>,
Steven Rostedt <rostedt@goodmis.org>,
Eric Dumazet <eric.dumazet@gmail.com>,
Dinakar Guniguntala <dino@in.ibm.com>,
John Stultz <johnstul@us.ibm.com>, Ingo Molnar <mingo@elte.hu>,
Greg Kroah-Hartman <gregkh@suse.de>
Subject: [22/46] futex: Fix wakeup race by setting TASK_INTERRUPTIBLE before queue_me()
Date: Fri, 16 Oct 2009 10:10:15 -0700 [thread overview]
Message-ID: <20091016171158.014986323@linux.site> (raw)
In-Reply-To: <20091016171422.GA13339@kroah.com>
[-- Attachment #1: futex-fix-wakeup-race-by-setting-task_interruptible-before-queue_me.patch --]
[-- Type: text/plain, Size: 2824 bytes --]
2.6.31-stable review patch. If anyone has any objections, please let us know.
------------------
From: Darren Hart <dvhltc@us.ibm.com>
commit 0729e196147692d84d4c099fcff056eba2ed61d8 upstream.
PI futexes do not use the same plist_node_empty() test for wakeup.
It was possible for the waiter (in futex_wait_requeue_pi()) to set
TASK_INTERRUPTIBLE after the waker assigned the rtmutex to the
waiter. The waiter would then note the plist was not empty and call
schedule(). The task would not be found by any subsequeuent futex
wakeups, resulting in a userspace hang.
By moving the setting of TASK_INTERRUPTIBLE to before the call to
queue_me(), the race with the waker is eliminated. Since we no
longer call get_user() from within queue_me(), there is no need to
delay the setting of TASK_INTERRUPTIBLE until after the call to
queue_me().
The FUTEX_LOCK_PI operation is not affected as futex_lock_pi()
relies entirely on the rtmutex code to handle schedule() and
wakeup. The requeue PI code is affected because the waiter starts
as a non-PI waiter and is woken on a PI futex.
Remove the crusty old comment about holding spinlocks() across
get_user() as we no longer do that. Correct the locking statement
with a description of why the test is performed.
Signed-off-by: Darren Hart <dvhltc@us.ibm.com>
Acked-by: Peter Zijlstra <peterz@infradead.org>
Cc: Steven Rostedt <rostedt@goodmis.org>
Cc: Eric Dumazet <eric.dumazet@gmail.com>
Cc: Dinakar Guniguntala <dino@in.ibm.com>
Cc: John Stultz <johnstul@us.ibm.com>
LKML-Reference: <20090922053038.8717.97838.stgit@Aeon>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>
---
kernel/futex.c | 15 +++------------
1 file changed, 3 insertions(+), 12 deletions(-)
--- a/kernel/futex.c
+++ b/kernel/futex.c
@@ -1638,17 +1638,8 @@ out:
static void futex_wait_queue_me(struct futex_hash_bucket *hb, struct futex_q *q,
struct hrtimer_sleeper *timeout)
{
- queue_me(q, hb);
-
- /*
- * There might have been scheduling since the queue_me(), as we
- * cannot hold a spinlock across the get_user() in case it
- * faults, and we cannot just set TASK_INTERRUPTIBLE state when
- * queueing ourselves into the futex hash. This code thus has to
- * rely on the futex_wake() code removing us from hash when it
- * wakes us up.
- */
set_current_state(TASK_INTERRUPTIBLE);
+ queue_me(q, hb);
/* Arm the timer */
if (timeout) {
@@ -1658,8 +1649,8 @@ static void futex_wait_queue_me(struct f
}
/*
- * !plist_node_empty() is safe here without any lock.
- * q.lock_ptr != 0 is not safe, because of ordering against wakeup.
+ * If we have been removed from the hash list, then another task
+ * has tried to wake us, and we can skip the call to schedule().
*/
if (likely(!plist_node_empty(&q->list))) {
/*
next prev parent reply other threads:[~2009-10-16 17:18 UTC|newest]
Thread overview: 51+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <20091016170953.128828149@linux.site>
2009-10-16 17:14 ` [00/46] 2.6.31.5-stable review Greg KH
2009-10-16 17:09 ` [01/46] SCSI: Retry ADD_TO_MLQUEUE return value for EH commands Greg KH
2009-10-16 17:09 ` [02/46] SCSI: Fix protection scsi_data_buffer leak Greg KH
2009-10-16 17:09 ` [03/46] SCSI: sg: Free data buffers after calling blk_rq_unmap_user Greg KH
2009-10-16 17:09 ` [04/46] ARM: pxa: workaround errata #37 by not using half turbo switching Greg KH
2009-10-16 17:09 ` [05/46] tracing/filters: Fix memory leak when setting a filter Greg KH
2009-10-16 17:09 ` [06/46] x86/paravirt: Use normal calling sequences for irq enable/disable Greg KH
2009-10-16 17:10 ` [07/46] USB: ftdi_sio: remove tty->low_latency Greg KH
2009-10-16 17:10 ` [08/46] USB: ftdi_sio: remove unused rx_byte counter Greg KH
2009-10-16 17:10 ` [09/46] USB: ftdi_sio: clean up read completion handler Greg KH
2009-10-16 17:10 ` [10/46] USB: ftdi_sio: re-implement read processing Greg KH
2009-10-16 17:10 ` [11/46] USB: pl2303: fix error characters not being reported to ldisc Greg KH
2009-10-16 17:10 ` [12/46] USB: digi_acceleport: Fix broken unthrottle Greg KH
2009-10-16 17:10 ` [13/46] USB: serial: dont call release without attach Greg KH
2009-10-16 17:10 ` [14/46] USB: option: Toshiba G450 device id Greg KH
2009-10-16 17:10 ` [15/46] USB: ipaq: fix oops when device is plugged in Greg KH
2009-10-16 17:10 ` [16/46] USB: cp210x: Add support for the DW700 UART Greg KH
2009-10-16 17:10 ` [17/46] USB: Fix throttling in generic usbserial driver Greg KH
2009-10-16 17:10 ` [18/46] USB: storage: When a device returns no sense data, call it a Hardware Error Greg KH
2009-10-16 17:10 ` [19/46] arm, cris, mips, sparc, powerpc, um, xtensa: fix build with bash 4.0 Greg KH
2009-10-16 17:10 ` [20/46] intel-iommu: Cope with broken HP DC7900 BIOS Greg KH
2009-10-16 17:10 ` [21/46] futex: Detect mismatched requeue targets Greg KH
2009-10-16 17:10 ` Greg KH [this message]
2009-10-16 17:10 ` [23/46] tpm-fixup-pcrs-sysfs-file-update Greg KH
2009-10-16 17:10 ` [24/46] TPM: fix pcrread Greg KH
2009-10-16 17:10 ` [25/46] Bluetooth: Disconnect HIDRAW devices on disconnect Greg KH
2009-10-16 17:10 ` [26/46] Bluetooth: Add extra device reference counting for connections Greg KH
2009-10-16 17:10 ` [27/46] Bluetooth: Let HIDP grab the device reference " Greg KH
2009-10-16 17:10 ` [28/46] connector: Keep the skb in cn_callback_data Greg KH
2009-10-16 17:10 ` [29/46] connector: Provide the senders credentials to the callback Greg KH
2009-10-16 17:10 ` [30/46] connector: Removed the destruct_data callback since it is always kfree_skb() Greg KH
2009-10-16 17:10 ` [31/46] dm/connector: Only process connector packages from privileged processes Greg KH
2009-10-16 17:10 ` [32/46] dst/connector: Disallow unpliviged users to configure dst Greg KH
2009-10-16 17:10 ` [33/46] pohmelfs/connector: Disallow unpliviged users to configure pohmelfs Greg KH
2009-10-16 17:10 ` [34/46] uvesafb/connector: Disallow unpliviged users to send netlink packets Greg KH
2009-10-16 17:10 ` [35/46] e1000e: swap max hw supported frame size between 82574 and 82583 Greg KH
2009-10-16 17:10 ` [36/46] MAINTAINERS: Fix Riku Voipios address Greg KH
2009-10-16 17:10 ` [37/46] macintosh: Dont assume i2c device probing always succeeds Greg KH
2009-10-16 17:10 ` [38/46] i2c: Hide probe errors caused by ACPI resource conflicts Greg KH
2009-10-16 17:10 ` [39/46] ALSA: Dont assume i2c device probing always succeeds Greg KH
2009-10-16 17:10 ` [40/46] bsdacct: switch credentials for writing to the accounting file Greg KH
2009-10-16 17:10 ` [41/46] sysfs: Allow sysfs_notify_dirent to be called from interrupt context Greg KH
2009-10-16 17:10 ` [42/46] Staging: rt2860sta: prevent a panic when disabling when associated Greg KH
2009-10-16 17:10 ` [43/46] usb-storage: Workaround devices with bogus sense size Greg KH
2009-10-16 17:10 ` [44/46] iwlwifi: incorrect method used for finding valid OTP blocks Greg KH
2009-10-16 17:10 ` [45/46] mac80211: fix vlan and optimise RX Greg KH
2009-10-16 17:10 ` [46/46] tty: Make flush_to_ldisc() locking more robust Greg KH
2009-10-16 17:57 ` [00/46] 2.6.31.5-stable review Massimo Cetra
2009-10-16 18:03 ` Greg KH
2009-10-17 5:20 ` Massimo Cetra
2009-11-05 23:12 ` Greg KH
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20091016171158.014986323@linux.site \
--to=gregkh@suse.de \
--cc=akpm@linux-foundation.org \
--cc=alan@lxorguk.ukuu.org.uk \
--cc=dino@in.ibm.com \
--cc=dvhltc@us.ibm.com \
--cc=eric.dumazet@gmail.com \
--cc=johnstul@us.ibm.com \
--cc=linux-kernel@vger.kernel.org \
--cc=mingo@elte.hu \
--cc=peterz@infradead.org \
--cc=rostedt@goodmis.org \
--cc=stable-review@kernel.org \
--cc=stable@kernel.org \
--cc=torvalds@linux-foundation.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox