From: Jeff Moyer <jmoyer@redhat.com>
To: Dave Chinner <david@fromorbit.com>
Cc: Fengguang Wu <fengguang.wu@intel.com>,
Wanpeng Li <liwp.linux@gmail.com>,
Alexander Viro <viro@zeniv.linux.org.uk>,
linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
Gavin Shan <shangw@linux.vnet.ibm.com>
Subject: Re: [PATCH V2] writeback: fix hung_task alarm when sync block
Date: Tue, 19 Jun 2012 17:09:22 -0400 [thread overview]
Message-ID: <x49vcinhuv1.fsf@segfault.boston.devel.redhat.com> (raw)
In-Reply-To: <20120619210212.GN25389@dastard> (Dave Chinner's message of "Wed, 20 Jun 2012 07:02:12 +1000")
Dave Chinner <david@fromorbit.com> writes:
> On Tue, Jun 19, 2012 at 04:14:16PM -0400, Jeff Moyer wrote:
>> Fengguang Wu <fengguang.wu@intel.com> writes:
>>
>> > Good idea! Yes we can do some estimation and adaptively extend the
>> > hang timeout for the current writeback_inodes_sb_nr()/sync_inodes_sb()
>> > call.
>> >
>> > Note that it's not going to reliably get rid of false warnings due to
>> > estimation errors, which could be pretty large and unavoidable on
>> > change of workload. But still, it would be a net improvement and
>> > perhaps enough to get rid of most false warnings, while still being
>> > able to catch livelock or other kind of task hang.
>>
>> Hi, Fengguang,
>>
>> I didn't see a patch from you for this, so I went ahead and threw
>> something together. Let me know what you think of it. I wasn't sure
>> how to estimate the total I/O that will be issued for syncing out an
>> entire superblock, though, so I didn't do that part.
>
> As I said to the original patch - having a hang check timeout on a
> system that is overloaded w.r.t. IO is an important piece of
> information when it comes to debugging problems. Often the hangcheck
> timer is the first piece of information that we will get that
> indicates a problem somewhere in a production system.
So, you believe that we should always check at 2 minute intervals (or
whatever is configured), even if we know there is more than that much
I/O queued? In case there is any confusion, here, the patch I posted
ensured that we would eventually spew a warning, but only if the process
was blocked for longer than we (the kernel) expected.
> Removing it does not magically fix the underlying problem - it
> simply means that we don't hear about them until someone complains
> that unmount is taking hours....
There isn't necessarily an underlying problem. This is very much a gray
area, Dave. We get plenty of false positives in this code. I was
trying to reduce *that* problem. Do you have a better idea on how to
address the issue?
Maybe this discussion requires looking at specific instances of the
problem so we're all on the same page. What do you think is the best
way forward, here?
Cheers,
Jeff
next prev parent reply other threads:[~2012-06-19 21:09 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-06-13 4:42 [PATCH V2] writeback: fix hung_task alarm when sync block Wanpeng Li
2012-06-13 14:27 ` Jeff Moyer
2012-06-13 14:48 ` Fengguang Wu
2012-06-13 14:55 ` Fengguang Wu
2012-06-13 15:34 ` Jeff Moyer
2012-06-14 13:36 ` Fengguang Wu
2012-06-19 20:14 ` Jeff Moyer
2012-06-19 21:02 ` Dave Chinner
2012-06-19 21:09 ` Jeff Moyer [this message]
2012-06-19 21:56 ` Dave Chinner
2012-06-14 1:35 ` Wanpeng Li
2012-06-14 13:26 ` Fengguang Wu
2012-06-15 22:43 ` Dave Chinner
2012-06-14 10:52 ` Wanpeng Li
2012-06-15 22:38 ` Dave Chinner
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=x49vcinhuv1.fsf@segfault.boston.devel.redhat.com \
--to=jmoyer@redhat.com \
--cc=david@fromorbit.com \
--cc=fengguang.wu@intel.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=liwp.linux@gmail.com \
--cc=shangw@linux.vnet.ibm.com \
--cc=viro@zeniv.linux.org.uk \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).