From: hitmoon <zhaoxiaoqiang007@gmail.com>
To: linux-fsdevel@vger.kernel.org
Cc: Andrew Morton <akpm@linux-foundation.org>,
jack@suse.cz, hannes@cmpxchg.org
Subject: fs: clear_inode failed with nrpages not zero!
Date: Wed, 26 Feb 2014 16:40:44 +0800 [thread overview]
Message-ID: <530DA88C.7020201@gmail.com> (raw)
Hi all:
I am running a redhat 2.6.32-279 offical kernel. Under heavy work
load and memory pressure, in my case, running ltp test for about 20
hours, kernel oops happened. Say concretely, a testcase process open a
file, truncate to 128M, mmap, munmap and close the file, this circle
repeatedly when kernel hangs. Through the vmcore, I also find it hangs
at: BUG_ON(inode->i_data.nrpages) in function clear_inode, which means
the truncate_inode_pages faild to decrase nrpages to 0. I have google
this problem and find no clear solutions but make me confused. The
comment of function truncate_inode_pages says that after it return, the
nrpages may not be zero.
My understanding is: the page reclaime migth still in the process
of deletion of the page. Jan Kara once post a patch, which use spin_lock
to sync the radix tree and nrpages. This kernel already contains this
patch. Then problem come: When kernel hangs, the nrpages is not a small
number like 1 or 2, but a bigger one, more than 500 or 700! So I think
even we take some sync measures before clear inode, the function
truncate_inode_pages together with other reclaim functions failed to set
nrpages to zero. By dump the vmcore, I also find the radix tree is also
not empty but with some slots left.
Then I think:
1. The fault might happen at pagevec_lookup, which return no page
even the radix tree is in fact not empty. Because lookup uses the rcu
lock, is it possible a race condition
happened in the lookup process and lead the function return
unexpectedly? If possiable, how dose it happened ?
2. I find Johannes Weiner post a
patch(http://www.spinics.net/lists/linux-fsdevel/msg72395.html), which
has following code:
+ if (nrpages || nrshadows) {
+ /*
+ * As truncation uses a lockless tree lookup, cycle
+ * the tree lock to make sure any ongoing tree
+ * modification that does not see AS_EXITING is
+ * completed before starting the final truncate.
+ */
+ spin_lock_irq(&mapping->tree_lock);
+ spin_unlock_irq(&mapping->tree_lock);
+
+ truncate_inode_pages(mapping, 0);
+ }
which wrapped the truncate_inode_pages in function
truncate_inode_pages_final. Does it make sence to my problem ?
Any suggestion will be appreciated!
next reply other threads:[~2014-02-26 8:38 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-02-26 8:40 hitmoon [this message]
2014-02-26 12:31 ` fs: clear_inode failed with nrpages not zero! Jan Kara
-- strict thread matches above, loose matches on Subject: below --
2014-02-26 14:44 xiaoqiang zhao
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=530DA88C.7020201@gmail.com \
--to=zhaoxiaoqiang007@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=jack@suse.cz \
--cc=linux-fsdevel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.