From: Andrew Morton <akpm@linux-foundation.org>
To: Nick Piggin <npiggin@suse.de>
Cc: linux-mm@kvack.org, davem@davemloft.net
Subject: Re: [patch] radix-tree: avoid atomic allocations for preloaded insertions
Date: Wed, 7 Nov 2007 17:09:23 -0800 [thread overview]
Message-ID: <20071107170923.6cf3c389.akpm@linux-foundation.org> (raw)
In-Reply-To: <20071108004304.GD3227@wotan.suse.de>
> On Thu, 8 Nov 2007 01:43:04 +0100 Nick Piggin <npiggin@suse.de> wrote:
> OK, here's this patch again. This time I come with real failures on real
> systems (in this case, David is running some 'dd' pagecache throughput
> tests).
>
> I haven't got him to retest it yet, but I think the idea is just a no-brainer.
> We significantly reduce maximum tree_lock(W) hold time, and we reduce the
> amount of GFP_ATOMIC allocations.
>
> --
>
> Most pagecache (and some other) radix tree insertions have the great
> opportunity to preallocate a few nodes with relaxed gfp flags. But
> the preallocation is squandered when it comes time to allocate a node,
> we default to first attempting a GFP_ATOMIC allocation -- that doesn't
> normally fail, but it can eat into atomic memory reserves that we
> don't need to be using.
>
> Another upshot of this is that it removes the sometimes highly contended
> zone->lock from underneath tree_lock.
>
> David Miller reports seeing this allocation fail on a highly threaded
> sparc64 system when running a parallel 'dd' test:
>
> [527319.459981] dd: page allocation failure. order:0, mode:0x20
> [527319.460403] Call Trace:
> [527319.460568] [00000000004b71e0] __slab_alloc+0x1b0/0x6a8
> [527319.460636] [00000000004b7bbc] kmem_cache_alloc+0x4c/0xa8
> [527319.460698] [000000000055309c] radix_tree_node_alloc+0x20/0x90
> [527319.460763] [0000000000553238] radix_tree_insert+0x12c/0x260
> [527319.460830] [0000000000495cd0] add_to_page_cache+0x38/0xb0
> [527319.460893] [00000000004e4794] mpage_readpages+0x6c/0x134
> [527319.460955] [000000000049c7fc] __do_page_cache_readahead+0x170/0x280
> [527319.461028] [000000000049cc88] ondemand_readahead+0x208/0x214
> [527319.461094] [0000000000496018] do_generic_mapping_read+0xe8/0x428
> [527319.461152] [0000000000497948] generic_file_aio_read+0x108/0x170
> [527319.461217] [00000000004badac] do_sync_read+0x88/0xd0
> [527319.461292] [00000000004bb5cc] vfs_read+0x78/0x10c
> [527319.461361] [00000000004bb920] sys_read+0x34/0x60
> [527319.461424] [0000000000406294] linux_sparc_syscall32+0x3c/0x40
>
> The calltrace is significant: __do_page_cache_readahead allocates a number
> of pages with GFP_KERNEL, and hence it should have reclaimed sufficient
> memory to satisfy GFP_ATOMIC allocations. However after the list of pages
> goes to mpage_readpages, there can be significant intervals (including
> disk IO) before all the pages are inserted into the radix-tree. So the
> reserves can easily be depleted at that point.
>
So now I've got to re-re-remember why I didn't like this the first time.
Do you recall?
Why not just stomp the warning with __GFP_NOWARN?
Did you consider turning off __GFP_HIGH? (Dunno why)
This change will slow things down - has this been quantified? Probably
it's unmeasurable, but it's still there.
I'd have thought that a superior approach would be to just set
__GFP_NOWARN?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2007-11-08 1:09 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
2007-11-08 0:43 [patch] radix-tree: avoid atomic allocations for preloaded insertions Nick Piggin
2007-11-08 1:09 ` Andrew Morton [this message]
2007-11-08 1:34 ` David Miller, Andrew Morton
2007-11-08 1:41 ` Andrew Morton
2007-11-08 1:45 ` David Miller, Andrew Morton
2007-11-08 1:37 ` Nick Piggin
2007-11-08 3:02 ` Andrew Morton
2007-11-08 3:16 ` Nick Piggin
2007-11-08 4:12 ` Andrew Morton
2007-11-08 4:54 ` Nick Piggin
2007-11-08 5:02 ` Andrew Morton
2007-11-08 5:44 ` Nick Piggin
2007-11-08 6:02 ` Andrew Morton
2007-11-08 6:54 ` Nick Piggin
2007-11-08 6:56 ` [patch] nfs: use GFP_NOFS preloads for radix-tree insertion Nick Piggin
2007-11-13 10:55 ` Peter Zijlstra
2007-11-14 4:20 ` Nick Piggin
2007-11-14 9:06 ` Peter Zijlstra
2007-11-14 15:39 ` Nick Piggin
2007-11-08 11:57 ` [patch] radix-tree: avoid atomic allocations for preloaded insertions Peter Zijlstra
2007-11-08 20:37 ` Nick Piggin
2007-11-08 20:47 ` Peter Zijlstra
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20071107170923.6cf3c389.akpm@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=davem@davemloft.net \
--cc=linux-mm@kvack.org \
--cc=npiggin@suse.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.