All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Rafael J. Wysocki" <rafael.j.wysocki@intel.com>
To: "Johannes Weiner" <hannes@cmpxchg.org>,
	"\"Rodolfo García Peñas (kix)\"" <kix@kix.es>
Cc: Oliver Winker <oliverml1@oli1170.net>, Jan Kara <jack@suse.cz>,
	Andrew Morton <akpm@linux-foundation.org>,
	bugzilla-daemon@bugzilla.kernel.org, linux-mm@kvack.org,
	Maxim Patlasov <mpatlasov@parallels.com>,
	Fengguang Wu <fengguang.wu@intel.com>, Tejun Heo <tj@kernel.org>
Subject: Re: [Bug 75101] New: [bisected] s2disk / hibernate blocks on "Saving 506031 image data pages () ..."
Date: Fri, 13 Jun 2014 01:50:47 +0200	[thread overview]
Message-ID: <539A3CD7.6080100@intel.com> (raw)
In-Reply-To: <20140612220200.GA25344@cmpxchg.org>

On 6/13/2014 12:02 AM, Johannes Weiner wrote:
> On Tue, May 06, 2014 at 01:45:01AM +0200, Rafael J. Wysocki wrote:
>> On 5/6/2014 1:33 AM, Johannes Weiner wrote:
>>> Hi Oliver,
>>>
>>> On Mon, May 05, 2014 at 11:00:13PM +0200, Oliver Winker wrote:
>>>> Hello,
>>>>
>>>> 1) Attached a full function-trace log + other SysRq outputs, see [1]
>>>> attached.
>>>>
>>>> I saw bdi_...() calls in the s2disk paths, but didn't check in detail
>>>> Probably more efficient when one of you guys looks directly.
>>> Thanks, this looks interesting.  balance_dirty_pages() wakes up the
>>> bdi_wq workqueue as it should:
>>>
>>> [  249.148009]   s2disk-3327    2.... 48550413us : global_dirty_limits <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : global_dirtyable_memory <-global_dirty_limits
>>> [  249.148009]   s2disk-3327    2.... 48550414us : writeback_in_progress <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : bdi_start_background_writeback <-balance_dirty_pages_ratelimited
>>> [  249.148009]   s2disk-3327    2.... 48550414us : mod_delayed_work_on <-balance_dirty_pages_ratelimited
>>> but the worker wakeup doesn't actually do anything:
>>> [  249.148009] kworker/-3466    2d... 48550431us : finish_task_switch <-__schedule
>>> [  249.148009] kworker/-3466    2.... 48550431us : _raw_spin_lock_irq <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550431us : need_to_create_worker <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550432us : worker_enter_idle <-worker_thread
>>> [  249.148009] kworker/-3466    2d... 48550432us : too_many_workers <-worker_enter_idle
>>> [  249.148009] kworker/-3466    2.... 48550432us : schedule <-worker_thread
>>> [  249.148009] kworker/-3466    2.... 48550432us : __schedule <-worker_thread
>>>
>>> My suspicion is that this fails because the bdi_wq is frozen at this
>>> point and so the flush work never runs until resume, whereas before my
>>> patch the effective dirty limit was high enough so that image could be
>>> written in one go without being throttled; followed by an fsync() that
>>> then writes the pages in the context of the unfrozen s2disk.
>>>
>>> Does this make sense?  Rafael?  Tejun?
>> Well, it does seem to make sense to me.
>  From what I see, this is a deadlock in the userspace suspend model and
> just happened to work by chance in the past.

Well, it had been working for quite a while, so it was a rather large 
opportunity
window it seems. :-)

> Can we patch suspend-utils as follows?

Perhaps we can.  Let's ask the new maintainer.

Rodolfo, do you think you can apply the patch below to suspend-utils?

> Alternatively, suspend-utils
> could clear the dirty limits before it starts writing and restore them
> post-resume.

That (and the patch too) doesn't seem to address the problem with 
existing suspend-utils
binaries, however.

Rafael


> ---
>  From 73d6546d5e264130e3d108c97d8317f86dc11149 Mon Sep 17 00:00:00 2001
> From: Johannes Weiner <hannes@cmpxchg.org>
> Date: Thu, 12 Jun 2014 17:43:05 -0400
> Subject: [patch] s2disk: fix buffered IO throttling deadlock in frozen state
>
> s2disk uses buffered IO when writing the snapshot image to disk.  If
> it runs into the dirty limits, the kernel forces it to wait until the
> flusher threads clean some of the dirty pages.  However, at this point
> s2disk already froze the system, including the flusher infrastructure,
> and the whole operation deadlocks.
>
> Open the resume device with O_SYNC to force flushing any dirty pages
> directly from the write() context before they accumulate and engage
> dirty throttling.
>
> Signed-off-by: Johannes Weiner <hannes@cmpxchg.org>
> ---
>   suspend.c | 2 +-
>   1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/suspend.c b/suspend.c
> index 479ce58555f7..1b9bed81f58a 100644
> --- a/suspend.c
> +++ b/suspend.c
> @@ -2436,7 +2436,7 @@ int main(int argc, char *argv[])
>   		suspend_error("Could not create %s/%s.", chroot_path, "resume");
>   		goto Umount;
>   	}
> -	resume_fd = open("resume", O_RDWR);
> +	resume_fd = open("resume", O_RDWR | O_SYNC);
>   	if (resume_fd < 0) {
>   		ret = errno;
>   		suspend_error("Could not open the resume device.");

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

  reply	other threads:[~2014-06-12 23:51 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-05-05 23:33 [Bug 75101] New: [bisected] s2disk / hibernate blocks on "Saving 506031 image data pages () ..." Johannes Weiner
2014-05-05 23:45 ` Rafael J. Wysocki
2014-06-12 22:02   ` Johannes Weiner
2014-06-12 23:50     ` Rafael J. Wysocki [this message]
2014-06-13  4:55       ` Johannes Weiner
2014-06-16 16:29         ` Rafael J. Wysocki
2019-04-02 23:25           ` Andrew Morton
2019-04-03  3:54             ` Matheus Fillipe
2019-04-03  8:23               ` Rainer Fiebig
2019-04-03  8:34             ` Rainer Fiebig
2019-04-03  9:34             ` Jan Kara
2019-04-03 10:04               ` Rainer Fiebig
2019-04-03 16:59                 ` Matheus Fillipe
2019-04-03 17:55                   ` Rainer Fiebig
2019-04-03 19:08                     ` Matheus Fillipe
     [not found]                     ` <CAFWuBvfxS0S6me_pneXmNzKwObSRUOg08_7=YToAoBg53UtPKg@mail.gmail.com>
2019-04-04 10:48                       ` Rainer Fiebig
2019-04-04 16:04                         ` matheus
2019-04-03 21:43               ` Rafael J. Wysocki
     [not found] <bug-75101-27@https.bugzilla.kernel.org/>
2014-04-29 22:24 ` Andrew Morton
2014-05-05 15:35   ` Johannes Weiner
2014-05-05 16:10     ` Jan Kara
2014-05-05 21:00       ` Oliver Winker

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=539A3CD7.6080100@intel.com \
    --to=rafael.j.wysocki@intel.com \
    --cc=akpm@linux-foundation.org \
    --cc=bugzilla-daemon@bugzilla.kernel.org \
    --cc=fengguang.wu@intel.com \
    --cc=hannes@cmpxchg.org \
    --cc=jack@suse.cz \
    --cc=kix@kix.es \
    --cc=linux-mm@kvack.org \
    --cc=mpatlasov@parallels.com \
    --cc=oliverml1@oli1170.net \
    --cc=tj@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.