public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
From: Nicolas Bareil <nico@chdir.org>
To: cl@linux.com, mel@csn.ul.ie
Cc: akpm@linux-foundation.org, torvalds@linux-foundation.org,
	linux-kernel@vger.kernel.org
Subject: [BISECTED][REGRESSION] INFO: rcu_sched_state detected stall on CPU
Date: Thu, 06 Jan 2011 12:38:15 +0100	[thread overview]
Message-ID: <87mxne6z4o.fsf@puppet.chdir.org> (raw)


Hello

On my two HP Proliant DL160 G6, the system locks up for tens of seconds
when I copy a regular file into a LVM volume with the following command
line:

$ sudo dd if=5gigabytesfile of=/dev/hosts/myvol bs=4096

Logs are filled with call traces and theses messages:

  kernel: INFO: rcu_sched_state detected stall on CPU 5 (t=6000 jiffies)
  kernel: Uhhuh. NMI received for unknown reason 00 on CPU 7.
  kernel: Do you have a strange power saving mode enabled?
  kernel: Dazed and confused, but trying to continue


 My .config is available here : http://chdir.org/~nbareil/aa45484031/config-2.6.37.gz
 The (big!) kern.log is here  : http://chdir.org/~nbareil/aa45484031/kern.log.gz
 My System.map                : http://chdir.org/~nbareil/aa45484031/System.map-2.6.37.gz

After bisection, the culprit is aa45484031, to be 100% sure, I compiled
a 2.6.37 with this commit reverted and it works.

As a reminder, here is the commit:

commit aa45484031ddee09b06350ab8528bfe5b2c76d1c
Author: Christoph Lameter <cl@linux.com>
Date:   Thu Sep 9 16:38:17 2010 -0700

    mm: page allocator: calculate a better estimate of NR_FREE_PAGES
    when memory is low and kswapd is awake
    
    Ordinarily watermark checks are based on the vmstat NR_FREE_PAGES as
    it is cheaper than scanning a number of lists.  To avoid
    synchronization overhead, counter deltas are maintained on a per-cpu
    basis and drained both periodically and when the delta is above a
    threshold.  On large CPU systems, the difference between the
    estimated and real value of NR_FREE_PAGES can be very high.  If
    NR_FREE_PAGES is much higher than number of real free page in buddy,
    the VM can allocate pages below min watermark, at worst reducing the
    real number of pages to zero.  Even if the OOM killer kills some
    victim for freeing memory, it may not free memory if the exit path
    requires a new page resulting in livelock.
    
    This patch introduces a zone_page_state_snapshot() function
    (courtesy of Christoph) that takes a slightly more accurate view of
    an arbitrary vmstat counter.  It is used to read NR_FREE_PAGES while
    kswapd is awake to avoid the watermark being accidentally broken.
    The estimate is not perfect and may result in cache line bounces but
    is expected to be lighter than the IPI calls necessary to
    continually drain the per-cpu counters while kswapd is awake.
    
    Signed-off-by: Christoph Lameter <cl@linux.com>
    Signed-off-by: Mel Gorman <mel@csn.ul.ie>
    Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
    Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>


Let me know if you need anything.

             reply	other threads:[~2011-01-06 11:48 UTC|newest]

Thread overview: 3+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-01-06 11:38 Nicolas Bareil [this message]
2011-01-06 11:45 ` [BISECTED][REGRESSION] INFO: rcu_sched_state detected stall on CPU Mel Gorman
2011-01-06 12:00   ` Nicolas Bareil

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=87mxne6z4o.fsf@puppet.chdir.org \
    --to=nico@chdir.org \
    --cc=akpm@linux-foundation.org \
    --cc=cl@linux.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mel@csn.ul.ie \
    --cc=torvalds@linux-foundation.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox