From: hitmoon <zhaoxiaoqiang007@gmail.com>
To: linux-fsdevel@vger.kernel.org
Cc: Andrew Morton <akpm@linux-foundation.org>,
jack@suse.cz, hannes@cmpxchg.org
Subject: fs: clear_inode failed with nrpages not zero!
Date: Wed, 26 Feb 2014 16:40:44 +0800 [thread overview]
Message-ID: <530DA88C.7020201@gmail.com> (raw)
Hi all:
I am running a redhat 2.6.32-279 offical kernel. Under heavy work
load and memory pressure, in my case, running ltp test for about 20
hours, kernel oops happened. Say concretely, a testcase process open a
file, truncate to 128M, mmap, munmap and close the file, this circle
repeatedly when kernel hangs. Through the vmcore, I also find it hangs
at: BUG_ON(inode->i_data.nrpages) in function clear_inode, which means
the truncate_inode_pages faild to decrase nrpages to 0. I have google
this problem and find no clear solutions but make me confused. The
comment of function truncate_inode_pages says that after it return, the
nrpages may not be zero.
My understanding is: the page reclaime migth still in the process
of deletion of the page. Jan Kara once post a patch, which use spin_lock
to sync the radix tree and nrpages. This kernel already contains this
patch. Then problem come: When kernel hangs, the nrpages is not a small
number like 1 or 2, but a bigger one, more than 500 or 700! So I think
even we take some sync measures before clear inode, the function
truncate_inode_pages together with other reclaim functions failed to set
nrpages to zero. By dump the vmcore, I also find the radix tree is also
not empty but with some slots left.
Then I think:
1. The fault might happen at pagevec_lookup, which return no page
even the radix tree is in fact not empty. Because lookup uses the rcu
lock, is it possible a race condition
happened in the lookup process and lead the function return
unexpectedly? If possiable, how dose it happened ?
2. I find Johannes Weiner post a
patch(http://www.spinics.net/lists/linux-fsdevel/msg72395.html), which
has following code:
+ if (nrpages || nrshadows) {
+ /*
+ * As truncation uses a lockless tree lookup, cycle
+ * the tree lock to make sure any ongoing tree
+ * modification that does not see AS_EXITING is
+ * completed before starting the final truncate.
+ */
+ spin_lock_irq(&mapping->tree_lock);
+ spin_unlock_irq(&mapping->tree_lock);
+
+ truncate_inode_pages(mapping, 0);
+ }
which wrapped the truncate_inode_pages in function
truncate_inode_pages_final. Does it make sence to my problem ?
Any suggestion will be appreciated!
next reply other threads:[~2014-02-26 8:38 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2014-02-26 8:40 hitmoon [this message]
2014-02-26 12:31 ` fs: clear_inode failed with nrpages not zero! Jan Kara
-- strict thread matches above, loose matches on Subject: below --
2014-02-26 14:44 xiaoqiang zhao
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=530DA88C.7020201@gmail.com \
--to=zhaoxiaoqiang007@gmail.com \
--cc=akpm@linux-foundation.org \
--cc=hannes@cmpxchg.org \
--cc=jack@suse.cz \
--cc=linux-fsdevel@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).