linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
From: "Rafael J. Wysocki" <rafael.j.wysocki@intel.com>
To: "Johannes Weiner" <hannes@cmpxchg.org>,
	"\"Rodolfo García Peñas (kix)\"" <kix@kix.es>
Cc: Oliver Winker <oliverml1@oli1170.net>, Jan Kara <jack@suse.cz>,
	Andrew Morton <akpm@linux-foundation.org>,
	bugzilla-daemon@bugzilla.kernel.org, linux-mm@kvack.org,
	Maxim Patlasov <mpatlasov@parallels.com>,
	Fengguang Wu <fengguang.wu@intel.com>, Tejun Heo <tj@kernel.org>
Subject: Re: [Bug 75101] New: [bisected] s2disk / hibernate blocks on "Saving 506031 image data pages () ..."
Date: Fri, 13 Jun 2014 01:50:47 +0200	[thread overview]
Message-ID: <539A3CD7.6080100@intel.com> (raw)
In-Reply-To: <20140612220200.GA25344@cmpxchg.org>

On 6/13/2014 12:02 AM, Johannes Weiner wrote:
> On Tue, May 06, 2014 at 01:45:01AM +0200, Rafael J. Wysocki wrote:
>> On 5/6/2014 1:33 AM, Johannes Weiner wrote:
>>> Hi Oliver,
>>>
>>> On Mon, May 05, 2014 at 11:00:13PM +0200, Oliver Winker wrote:
>>>> Hello,
>>>>
>>>> 1) Attached a full function-trace log + other SysRq outputs, see [1]
>>>> attached.
>>>>
>>>> I saw bdi_...() calls in the s2disk paths, but didn't check in detail
>>>> Probably more efficient when one of you guys looks directly.
>>> Thanks, this looks interesting.  balance_dirty_pages() wakes up the
>>> bdi_wq workqueue as it should:
>>>
>>> [  249.148009]   s2disk-3327    2.... 48550413us : global_dirty_limits <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : global_dirtyable_memory <-global_dirty_limits
>>> [  249.148009]   s2disk-3327    2.... 48550414us : writeback_in_progress <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : bdi_start_background_writeback <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : mod_delayed_work_on <-balance_dirty_pages_ratelimited
>>> but the worker wakeup doesn't actually do anything:
>>> [  249.148009] kworker/-3466    2d... 48550431us : finish_task_switch <-__schedule
>>> [  249.148009] kworker/-3466    2.... 48550431us : _raw_spin_lock_irq <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550431us : need_to_create_worker <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550432us : worker_enter_idle <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550432us : too_many_workers <-worker_enter_idle
>>> [  249.148009] kworker/-3466    2.... 48550432us : schedule <-worker_thread
>>> [  249.148009] kworker/-3466    2.... 48550432us : __schedule <-worker_thread
>>>
>>> My suspicion is that this fails because the bdi_wq is frozen at this
>>> point and so the flush work never runs until resume, whereas before my
>>> patch the effective dirty limit was high enough so that image could be
>>> written in one go without being throttled; followed by an fsync() that
>>> then writes the pages in the context of the unfrozen s2disk.
>>>
>>> Does this make sense?  Rafael?  Tejun?
>> Well, it does seem to make sense to me.
>  From what I see, this is a deadlock in the userspace suspend model and
> just happened to work by chance in the past.

Well, it had been working for quite a while, so it was a rather large 
opportunity
window it seems. :-)

> Can we patch suspend-utils as follows?

Perhaps we can.  Let's ask the new maintainer.

Rodolfo, do you think you can apply the patch below to suspend-utils?

> Alternatively, suspend-utils
> could clear the dirty limits before it starts writing and restore them
> post-resume.

That (and the patch too) doesn't seem to address the problem with 
existing suspend-utils
binaries, however.

Rafael


> ---
>  From 73d6546d5e264130e3d108c97d8317f86dc11149 Mon Sep 17 00:00:00 2001
> From: Johannes Weiner <hannes@cmpxchg.org>
> Date: Thu, 12 Jun 2014 17:43:05 -0400
> Subject: [patch] s2disk: fix buffered IO throttling deadlock in frozen state
>
> s2disk uses buffered IO when writing the snapshot image to disk.  If
> it runs into the dirty limits, the kernel forces it to wait until the
> flusher threads clean some of the dirty pages.  However, at this point
> s2disk already froze the system, including the flusher infrastructure,
> and the whole operation deadlocks.
>
> Open the resume device with O_SYNC to force flushing any dirty pages
> directly from the write() context before they accumulate and engage
> dirty throttling.
>
> Signed-off-by: Johannes Weiner <hannes@cmpxchg.org>
> ---
>   suspend.c | 2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/suspend.c b/suspend.c
> index 479ce58555f7..1b9bed81f58a 100644
> --- a/suspend.c
> +++ b/suspend.c
> @@ -2436,7 +2436,7 @@ int main(int argc, char *argv[])
>   		suspend_error("Could not create %s/%s.", chroot_path, "resume");
>   		goto Umount;
>   	}
> -	resume_fd = open("resume", O_RDWR);
> +	resume_fd = open("resume", O_RDWR | O_SYNC);
>   	if (resume_fd < 0) {
>   		ret = errno;
>   		suspend_error("Could not open the resume device.");

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2014-06-12 23:51 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-05-05 23:33 [Bug 75101] New: [bisected] s2disk / hibernate blocks on "Saving 506031 image data pages () ..." Johannes Weiner
2014-05-05 23:45 ` Rafael J. Wysocki
2014-06-12 22:02   ` Johannes Weiner
2014-06-12 23:50     ` Rafael J. Wysocki [this message]
2014-06-13  4:55       ` Johannes Weiner
2014-06-16 16:29         ` Rafael J. Wysocki
2019-04-02 23:25           ` Andrew Morton
2019-04-03  3:54             ` Matheus Fillipe
2019-04-03  8:23               ` Rainer Fiebig
2019-04-03  8:34             ` Rainer Fiebig
2019-04-03  9:34             ` Jan Kara
2019-04-03 10:04               ` Rainer Fiebig
2019-04-03 16:59                 ` Matheus Fillipe
2019-04-03 17:55                   ` Rainer Fiebig
2019-04-03 19:08                     ` Matheus Fillipe
     [not found]                     ` <CAFWuBvfxS0S6me_pneXmNzKwObSRUOg08_7=YToAoBg53UtPKg@mail.gmail.com>
2019-04-04 10:48                       ` Rainer Fiebig
2019-04-04 16:04                         ` matheus
2019-04-03 21:43               ` Rafael J. Wysocki
     [not found] <bug-75101-27@https.bugzilla.kernel.org/>
2014-04-29 22:24 ` Andrew Morton
2014-05-05 15:35   ` Johannes Weiner
2014-05-05 16:10     ` Jan Kara
2014-05-05 21:00       ` Oliver Winker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=539A3CD7.6080100@intel.com \
    --to=rafael.j.wysocki@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=bugzilla-daemon@bugzilla.kernel.org \
    --cc=fengguang.wu@intel.com \
    --cc=hannes@cmpxchg.org \
    --cc=jack@suse.cz \
    --cc=kix@kix.es \
    --cc=linux-mm@kvack.org \
    --cc=mpatlasov@parallels.com \
    --cc=oliverml1@oli1170.net \
    --cc=tj@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).