public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Greg KH <gregkh@suse.de>
To: linux-kernel@vger.kernel.org, stable@kernel.org, jejb@kernel.org
Cc: Justin Forbes <jmforbes@linuxtx.org>,
	Zwane Mwaikambo <zwane@arm.linux.org.uk>,
	"Theodore Ts'o" <tytso@mit.edu>,
	Randy Dunlap <rdunlap@xenotime.net>,
	Dave Jones <davej@redhat.com>,
	Chuck Wolber <chuckw@quantumlinux.com>,
	Chris Wedgwood <reviews@ml.cw.f00f.org>,
	Michael Krufky <mkrufky@linuxtv.org>,
	Chuck Ebbert <cebbert@redhat.com>,
	Domenico Andreoli <cavokz@gmail.com>, Willy Tarreau <w@1wt.eu>,
	Rodrigo Rubira Branco <rbranco@la.checkpoint.com>,
	Jake Edge <jake@lwn.net>, Eugene Teo <eteo@redhat.com>,
	torvalds@linux-foundation.org, akpm@linux-foundation.org,
	alan@lxorguk.ukuu.org.uk,
	"Paul E. McKenney" <paulmck@linux.vnet.ibm.com>,
	Peter Zijlstra <a.p.zijlstra@chello.nl>,
	Josh Triplett <josh@kernel.org>,
	Dipankar Sarma <dipankar@in.ibm.com>, Ingo Molnar <mingo@elte.hu>
Subject: [patch 16/62] rcu: fix rcu_try_flip_waitack_needed() to prevent grace-period stall
Date: Wed, 30 Jul 2008 16:58:21 -0700	[thread overview]
Message-ID: <20080730235821.GP12896@suse.de> (raw)
In-Reply-To: <20080730234915.GA12426@suse.de>

[-- Attachment #1: rcu-fix-rcu_try_flip_waitack_needed-to-prevent-grace-period-stall.patch --]
[-- Type: text/plain, Size: 3058 bytes --]

2.6.26 -stable review patch.  If anyone has any objections, please let
us know.

------------------
From: Paul E. McKenney <paulmck@linux.vnet.ibm.com>

commit d7c0651390b6a03ad53f99faec0ba88109d7191d upstream

The comment was correct -- need to make the code match the comment.
Without this patch, if a CPU goes dynticks idle (and stays there forever)
in just the right phase of preemptible-RCU grace-period processing,
grace periods stall.  The offending sequence of events (courtesy
of Promela/spin, at least after I got the liveness criterion coded
correctly...) is as follows:

o	CPU 0 is in dynticks-idle mode.  Its dynticks_progress_counter
	is (say) 10.

o	CPU 0 takes an interrupt, so rcu_irq_enter() increments CPU 0's
	dynticks_progress_counter to 11.

o	CPU 1 is doing RCU grace-period processing in rcu_try_flip_idle(),
	sees rcu_pending(), so invokes dyntick_save_progress_counter(),
	which in turn takes a snapshot of CPU 0's dynticks_progress_counter
	into CPU 0's rcu_dyntick_snapshot -- now set to 11.  CPU 1 then
	updates the RCU grace-period state to rcu_try_flip_waitack().

o	CPU 0 returns from its interrupt, so rcu_irq_exit() increments
	CPU 0's dynticks_progress_counter to 12.

o	CPU 1 later invokes rcu_try_flip_waitack(), which notices that
	CPU 0 has not yet responded, and hence in turn invokes
	rcu_try_flip_waitack_needed().  This function examines the
	state of CPU 0's dynticks_progress_counter and rcu_dyntick_snapshot
	variables, which it copies to curr (== 12) and snap (== 11),
	respectively.

	Because curr!=snap, the first condition fails.

	Because curr-snap is only 1 and snap is odd, the second
	condition fails.

	rcu_try_flip_waitack_needed() therefore incorrectly concludes
	that it must wait for CPU 0 to explicitly acknowledge the
	counter flip.

o	CPU 0 remains forever in dynticks-idle mode, never taking
	any more hardware interrupts or any NMIs, and never running
	any more tasks.  (Of course, -something- will usually eventually
	happen, which might be why we haven't seen this one in the
	wild.  Still should be fixed!)

Therefore the grace period never ends.  Fix is to make the code match
the comment, as shown below.  With this fix, the above scenario
would be satisfied with curr being even, and allow the grace period
to proceed.

Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Cc: Josh Triplett <josh@kernel.org>
Cc: Dipankar Sarma <dipankar@in.ibm.com>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>

---
 kernel/rcupreempt.c |    2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

--- a/kernel/rcupreempt.c
+++ b/kernel/rcupreempt.c
@@ -567,7 +567,7 @@ rcu_try_flip_waitack_needed(int cpu)
 	 * that this CPU already acknowledged the counter.
 	 */
 
-	if ((curr - snap) > 2 || (snap & 0x1) == 0)
+	if ((curr - snap) > 2 || (curr & 0x1) == 0)
 		return 0;
 
 	/* We need this CPU to explicitly acknowledge the counter flip. */

-- 

  parent reply	other threads:[~2008-07-31  0:10 UTC|newest]

Thread overview: 64+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20080730233050.332789722@mini.kroah.org>
2008-07-30 23:49 ` [patch 00/62] 2.6.26-stable review Greg KH
2008-07-30 23:57   ` [patch 01/62] pxamci: trivial fix of DMA alignment register bit clearing Greg KH
2008-07-30 23:57   ` [patch 02/62] udplite: Protection against coverage value wrap-around Greg KH
2008-07-30 23:57   ` [patch 03/62] ipv6: use timer pending Greg KH
2008-07-30 23:57   ` [patch 04/62] ipv6: __KERNEL__ ifdef struct ipv6_devconf Greg KH
2008-07-30 23:57   ` [patch 05/62] hdlcdrv: Fix CRC calculation Greg KH
2008-07-30 23:57   ` [patch 06/62] quota: fix possible infinite loop in quota code Greg KH
2008-07-30 23:58   ` [patch 07/62] isofs: fix minor filesystem corruption Greg KH
2008-07-30 23:58   ` [patch 08/62] KVM: VMX: Fix a wrong usage of vmcs_config Greg KH
2008-07-30 23:58   ` [patch 09/62] KVM: SVM: fix suspend/resume support Greg KH
2008-07-30 23:58   ` [patch 10/62] KVM: mmu_shrink: kvm_mmu_zap_page requires slots_lock to be held Greg KH
2008-07-30 23:58   ` [patch 11/62] KVM: VMX: Add ept_sync_context in flush_tlb Greg KH
2008-07-30 23:58   ` [patch 12/62] KVM: x86 emulator: Fix HLT instruction Greg KH
2008-07-30 23:58   ` [patch 13/62] KVM: MMU: nuke shadowed pgtable pages and ptes on memslot destruction Greg KH
2008-07-30 23:58   ` [patch 14/62] KVM: MMU: Fix potential race setting upper shadow ptes on nonpae hosts Greg KH
2008-07-30 23:58   ` [patch 15/62] Patch Upstream: x86 ptrace: fix PTRACE_GETFPXREGS error Greg KH
2008-07-30 23:58   ` Greg KH [this message]
2008-07-30 23:58   ` [patch 17/62] Fix typos from signal_32/64.h merge Greg KH
2008-07-30 23:58   ` [patch 18/62] x86 reboot quirks: add Dell Precision WorkStation T5400 Greg KH
2008-07-30 23:58   ` [patch 19/62] USB: fix usb serial pm counter decrement for disconnected interfaces Greg KH
2008-07-30 23:58   ` [patch 20/62] x86, suspend, acpi: enter Big Real Mode Greg KH
2008-08-05 12:15     ` Pavel Machek
2008-07-30 23:58   ` [patch 21/62] markers: fix duplicate modpost entry Greg KH
2008-07-30 23:58   ` [patch 22/62] Fix build on COMPAT platforms when CONFIG_EPOLL is disabled Greg KH
2008-07-30 23:58   ` [patch 24/62] cpusets: fix wrong domain attr updates Greg KH
2008-07-30 23:58   ` [patch 25/62] x86: fix crash due to missing debugctlmsr on AMD K6-3 Greg KH
2008-07-30 23:58   ` [patch 23/62] proc: fix /proc/*/pagemap Greg KH
2008-07-30 23:58   ` [patch 26/62] ide-cd: fix oops when using growisofs Greg KH
2008-07-30 23:58   ` [patch 27/62] rtc-at91rm9200: avoid spurious irqs Greg KH
2008-07-30 23:58   ` [patch 28/62] vmlinux.lds: move __attribute__((__cold__)) functions back into final .text section Greg KH
2008-07-30 23:58   ` [patch 29/62] ARM: fix fls() for 64-bit arguments Greg KH
2008-07-30 23:58   ` [patch 30/62] tcp: Clear probes_out more aggressively in tcp_ack() Greg KH
2008-07-30 23:58   ` [patch 31/62] sparc64: Fix lockdep issues in LDC protocol layer Greg KH
2008-07-30 23:59   ` [patch 32/62] sparc64: Fix cpufreq notifier registry Greg KH
2008-07-30 23:59   ` [patch 33/62] sparc64: Do not define BIO_VMERGE_BOUNDARY Greg KH
2008-07-30 23:59   ` [patch 34/62] iop-adma: fix platform driver hotplug/coldplug Greg KH
2008-07-30 23:59   ` [patch 35/62] myri10ge: do not forget to setup the single slice pointers Greg KH
2008-07-30 23:59   ` [patch 36/62] myri10ge: do not use mgp->max_intr_slots before loading the firmware Greg KH
2008-07-30 23:59   ` [patch 37/62] ALSA: trident - pause s/pdif output Greg KH
2008-07-30 23:59   ` [patch 38/62] V4L: cx18: Upgrade to newer firmware & update documentation Greg KH
2008-07-30 23:59   ` [patch 39/62] DVB: dib0700: add support for Hauppauge Nova-TD Stick 52009 Greg KH
2008-07-30 23:59   ` [patch 40/62] V4L: uvcvideo: Fix a buffer overflow in format descriptor parsing Greg KH
2008-07-30 23:59   ` [patch 41/62] V4L: uvcvideo: Use GFP_NOIO when allocating memory during resume Greg KH
2008-07-30 23:59   ` [patch 42/62] V4L: uvcvideo: Dont free URB buffers on suspend Greg KH
2008-07-30 23:59   ` [patch 43/62] V4L: uvcvideo: Make input device support optional Greg KH
2008-07-30 23:59   ` [patch 44/62] V4L: uvcvideo: Add support for Medion Akoya Mini E1210 integrated webcam Greg KH
2008-07-30 23:59   ` [patch 45/62] V4L: saa7134: Copy tuner data earlier to avoid overwriting manual tuner type Greg KH
2008-07-30 23:59   ` [patch 46/62] V4L: cx23885: Bugfix for concurrent use of /dev/video0 and /dev/video1 Greg KH
2008-07-30 23:59   ` [patch 47/62] DVB: cx23885: Ensure PAD_CTRL is always reset to a sensible default Greg KH
2008-07-30 23:59   ` [patch 48/62] DVB: cx23885: DVB Transport cards using DVB port VIDB/TS1 did not stream Greg KH
2008-07-30 23:59   ` [patch 49/62] DVB: cx23885: Reallocated the sram to avoid concurrent VIDB/C issues Greg KH
2008-07-30 23:59   ` [patch 50/62] DVB: cx23885: SRAM changes for the 885 and 887 silicon parts Greg KH
2008-07-30 23:59   ` [patch 51/62] x86: fix kernel_physical_mapping_init() for large x86 systems Greg KH
2008-07-30 23:59   ` [patch 52/62] eCryptfs: use page_alloc not kmalloc to get a page of memory Greg KH
2008-07-30 23:59   ` [patch 53/62] UML - Fix boot crash Greg KH
2008-07-30 23:59   ` [patch 54/62] ixgbe: remove device ID for unsupported device Greg KH
2008-07-30 23:59   ` [patch 55/62] mpc52xx_psc_spi: fix block transfer Greg KH
2008-07-30 23:59   ` [patch 56/62] tmpfs: fix kernel BUG in shmem_delete_inode Greg KH
2008-07-30 23:59   ` [patch 57/62] markers: fix markers read barrier for multiple probes Greg KH
2008-07-31  0:00   ` [patch 58/62] VFS: increase pseudo-filesystem block size to PAGE_SIZE Greg KH
2008-07-31  0:00   ` [patch 59/62] cpufreq acpi: only call _PPC after cpufreq ACPI init funcs got called already Greg KH
2008-07-31  0:00   ` [patch 60/62] b43legacy: Release mutex in error handling code Greg KH
2008-07-31  0:00   ` [patch 61/62] ath5k: dont enable MSI, we cannot handle it yet Greg KH
2008-07-31  0:00   ` [patch 62/62] Fix off-by-one error in iov_iter_advance() Greg KH

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20080730235821.GP12896@suse.de \
    --to=gregkh@suse.de \
    --cc=a.p.zijlstra@chello.nl \
    --cc=akpm@linux-foundation.org \
    --cc=alan@lxorguk.ukuu.org.uk \
    --cc=cavokz@gmail.com \
    --cc=cebbert@redhat.com \
    --cc=chuckw@quantumlinux.com \
    --cc=davej@redhat.com \
    --cc=dipankar@in.ibm.com \
    --cc=eteo@redhat.com \
    --cc=jake@lwn.net \
    --cc=jejb@kernel.org \
    --cc=jmforbes@linuxtx.org \
    --cc=josh@kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mingo@elte.hu \
    --cc=mkrufky@linuxtv.org \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=rbranco@la.checkpoint.com \
    --cc=rdunlap@xenotime.net \
    --cc=reviews@ml.cw.f00f.org \
    --cc=stable@kernel.org \
    --cc=torvalds@linux-foundation.org \
    --cc=tytso@mit.edu \
    --cc=w@1wt.eu \
    --cc=zwane@arm.linux.org.uk \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox