From: Andrew Morton <akpm@linux-foundation.org>
To: Jan Kara <jack@suse.cz>
Cc: Michal Hocko <mhocko@kernel.org>,
linux-mm@kvack.org, linux-fsdevel@vger.kernel.org,
Tetsuo Handa <penguin-kernel@I-love.SAKURA.ne.jp>,
tj@kernel.org
Subject: Re: [PATCH] writeback: Avoid exhausting allocation reserves under memory pressure
Date: Thu, 5 May 2016 14:37:51 -0700 [thread overview]
Message-ID: <20160505143751.06aa4223e266c1d92b3323a2@linux-foundation.org> (raw)
In-Reply-To: <20160505090750.GD1970@quack2.suse.cz>
On Thu, 5 May 2016 11:07:50 +0200 Jan Kara <jack@suse.cz> wrote:
> On Thu 05-05-16 10:24:33, Michal Hocko wrote:
> > > +/*
> > > + * Check whether the request to writeback some pages can be merged with some
> > > + * other request which is already pending. If yes, merge it and return true.
> > > + * If no, return false.
> > > + */
> > > +static bool wb_merge_request(struct bdi_writeback *wb, long nr_pages,
> > > + struct super_block *sb, bool range_cyclic,
> > > + enum wb_reason reason)
> > > +{
> > > + struct wb_writeback_work *work;
> > > + bool merged = false;
> > > +
> > > + spin_lock_bh(&wb->work_lock);
> > > + list_for_each_entry(work, &wb->work_list, list) {
> >
> > Is the lenght of the list bounded somehow? In other words is it possible
> > that the spinlock would be held for too long to traverse the whole list?
>
> I was thinking about this as well. With the merging enabled, the number of
> entries queued from wb_start_writeback() is essentially limited by the
> number of writeback reasons and there's only a couple of those. What is
> more questionable is the number of entries queued from
> __writeback_inodes_sb_nr(). Generally there should be a couple at maximum
> either but it is hard to give any guarantee since e.g. filesystems use this
> function to reduce amount of delay-allocated data when they are running out
> of space. Hum, maybe we could limit the merging to scan only the last say
> 16 entries. That should give good results in most cases... Thoughts?
If it's possible to cause a search complexity meltdown, someone will
find a way :(
Is there any reason why the requests coming out of
writeback_inodes_sb_nr() cannot also be merged?
Your wb_merge_request() doesn't check ->tagged_writepages?
Why is ->for_sync handled differently? Needs a comment.
Suggest turning this into a separate function. Build a local
wb_writeback_work in wb_start_writeback(), do:
/* comment goes here */
if (new->reason != old->reason)
return false;
/* comment goes here */
if (new->range_cyclic != old->range_cyclic)
retun false;
return true;
then copy wb_start_writeback()'s local wb_writeback_work into the
newly-allocated one if needed (kmemdup()). Or just pass a billion args
into that comparison function.
bdi_split_work_to_wbs() does GFP_ATOMIC as well. Problem? (Why the
heck don't we document the *reasons* for these things, sigh).
I suspect it would be best to be proactive here and use some smarter
data structure. It appears that all the wb_writeback_work fields
except sb can be squeezed into a single word so perhaps a radix-tree.
Or hash them all together and use a chained array or something. Maybe
fiddle at it for an hour or so, see how it's looking? It's a lot of
fuss to avoid one problematic kmalloc(), sigh.
We really don't want there to be *any* pathological workload which
results in merging failures - if that's the case then someone will hit
it. They'll experience the ooms (perhaps) and the search complexity
issues (for sure).
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2016-05-05 21:37 UTC|newest]
Thread overview: 7+ messages / expand[flat|nested] mbox.gz Atom feed top
2016-05-05 8:14 [PATCH] writeback: Avoid exhausting allocation reserves under memory pressure Jan Kara
2016-05-05 8:24 ` Michal Hocko
2016-05-05 9:07 ` Jan Kara
2016-05-05 9:18 ` Michal Hocko
2016-05-05 21:37 ` Andrew Morton [this message]
2016-05-12 16:08 ` Jan Kara
2016-05-16 11:45 ` Michal Hocko
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20160505143751.06aa4223e266c1d92b3323a2@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=jack@suse.cz \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=mhocko@kernel.org \
--cc=penguin-kernel@I-love.SAKURA.ne.jp \
--cc=tj@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).