* [PATCH] mm/swap: add independed bio pool for swap
@ 2012-12-14 11:01 Konstantin Khlebnikov
2012-12-18 4:45 ` Hugh Dickins
0 siblings, 1 reply; 4+ messages in thread
From: Konstantin Khlebnikov @ 2012-12-14 11:01 UTC (permalink / raw)
To: linux-kernel; +Cc: Andrew Morton, Hugh Dickins
This bio pool guarantees reclaiming progress for anonymous pages.
All avaliable bio in fs_bio_set may be borrowed by writeback which may
never ends, because disk too slow or broken. I have seen this situation in
real life in system where was a lot of bio requests to a loop device which
laying on top of special fuse-based filesystem.
Signed-off-by: Konstantin Khlebnikov <khlebnikov@openvz.org>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Hugh Dickins <hughd@google.com>
---
mm/page_io.c | 13 ++++++++++++-
1 file changed, 12 insertions(+), 1 deletion(-)
diff --git a/mm/page_io.c b/mm/page_io.c
index 78eee32..699f85e 100644
--- a/mm/page_io.c
+++ b/mm/page_io.c
@@ -22,12 +22,14 @@
#include <linux/frontswap.h>
#include <asm/pgtable.h>
+static struct bio_set *swap_bio_set;
+
static struct bio *get_swap_bio(gfp_t gfp_flags,
struct page *page, bio_end_io_t end_io)
{
struct bio *bio;
- bio = bio_alloc(gfp_flags, 1);
+ bio = bio_alloc_bioset(gfp_flags, 1, swap_bio_set);
if (bio) {
bio->bi_sector = map_swap_page(page, &bio->bi_bdev);
bio->bi_sector <<= PAGE_SHIFT - 9;
@@ -290,3 +292,12 @@ int swap_set_page_dirty(struct page *page)
return __set_page_dirty_no_writeback(page);
}
}
+
+static int __init swap_bio_init(void)
+{
+ swap_bio_set = bioset_create(SWAP_CLUSTER_MAX, 0);
+ if (!swap_bio_set)
+ panic("can't allocate swap_bio_set\n");
+ return 0;
+}
+late_initcall(swap_bio_init);
^ permalink raw reply related [flat|nested] 4+ messages in thread
* Re: [PATCH] mm/swap: add independed bio pool for swap
2012-12-14 11:01 [PATCH] mm/swap: add independed bio pool for swap Konstantin Khlebnikov
@ 2012-12-18 4:45 ` Hugh Dickins
2012-12-20 0:53 ` Andrew Morton
0 siblings, 1 reply; 4+ messages in thread
From: Hugh Dickins @ 2012-12-18 4:45 UTC (permalink / raw)
To: Konstantin Khlebnikov; +Cc: linux-kernel, Andrew Morton, Mel Gorman, Jens Axboe
On Fri, 14 Dec 2012, Konstantin Khlebnikov wrote:
> This bio pool guarantees reclaiming progress for anonymous pages.
> All avaliable bio in fs_bio_set may be borrowed by writeback which may
> never ends, because disk too slow or broken. I have seen this situation in
> real life in system where was a lot of bio requests to a loop device which
> laying on top of special fuse-based filesystem.
Hmm, perhaps, I'm not at all sure.
I don't particularly want to fragment off yet another pool if it's
not the right approach. Or maybe it's loop or fuse which should
have the pool, rather than swap.
If the disk is slow, I'd expect us to be okay; but if it's not
responding at all, then yes, those mempools will remain exhausted.
You're imagining swap going to a more reliable disk, but it's being
starved by the unresponding disk, so deserves a separate pool?
Let's Cc Mel and Jens, who will each have plenty of experience of
running out of bios/mempools, and the proper way to avoid or accept it.
(Note that BIO_POOL_SIZE is only 2 nowadays: when mempools were
first introduced, indeed they were sized larger; but once we found so
much memory disappearing into them, they got cut down to the minimum
needed for forward progress - I forget why that's 2 not 1).
Hugh
>
> Signed-off-by: Konstantin Khlebnikov <khlebnikov@openvz.org>
> Cc: Andrew Morton <akpm@linux-foundation.org>
> Cc: Hugh Dickins <hughd@google.com>
> ---
> mm/page_io.c | 13 ++++++++++++-
> 1 file changed, 12 insertions(+), 1 deletion(-)
>
> diff --git a/mm/page_io.c b/mm/page_io.c
> index 78eee32..699f85e 100644
> --- a/mm/page_io.c
> +++ b/mm/page_io.c
> @@ -22,12 +22,14 @@
> #include <linux/frontswap.h>
> #include <asm/pgtable.h>
>
> +static struct bio_set *swap_bio_set;
> +
> static struct bio *get_swap_bio(gfp_t gfp_flags,
> struct page *page, bio_end_io_t end_io)
> {
> struct bio *bio;
>
> - bio = bio_alloc(gfp_flags, 1);
> + bio = bio_alloc_bioset(gfp_flags, 1, swap_bio_set);
> if (bio) {
> bio->bi_sector = map_swap_page(page, &bio->bi_bdev);
> bio->bi_sector <<= PAGE_SHIFT - 9;
> @@ -290,3 +292,12 @@ int swap_set_page_dirty(struct page *page)
> return __set_page_dirty_no_writeback(page);
> }
> }
> +
> +static int __init swap_bio_init(void)
> +{
> + swap_bio_set = bioset_create(SWAP_CLUSTER_MAX, 0);
> + if (!swap_bio_set)
> + panic("can't allocate swap_bio_set\n");
> + return 0;
> +}
> +late_initcall(swap_bio_init);
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] mm/swap: add independed bio pool for swap
2012-12-18 4:45 ` Hugh Dickins
@ 2012-12-20 0:53 ` Andrew Morton
2012-12-20 9:29 ` Konstantin Khlebnikov
0 siblings, 1 reply; 4+ messages in thread
From: Andrew Morton @ 2012-12-20 0:53 UTC (permalink / raw)
To: Hugh Dickins; +Cc: Konstantin Khlebnikov, linux-kernel, Mel Gorman, Jens Axboe
On Mon, 17 Dec 2012 20:45:13 -0800 (PST)
Hugh Dickins <hughd@google.com> wrote:
> On Fri, 14 Dec 2012, Konstantin Khlebnikov wrote:
>
> > This bio pool guarantees reclaiming progress for anonymous pages.
> > All avaliable bio in fs_bio_set may be borrowed by writeback which may
> > never ends, because disk too slow or broken. I have seen this situation in
> > real life in system where was a lot of bio requests to a loop device which
> > laying on top of special fuse-based filesystem.
>
> Hmm, perhaps, I'm not at all sure.
It probably maybe perhaps makes sense. Of course, you're screwed if
one of your swap devices is "disk too slow or broken".
My crystal ball tells me that the 2015 kernel will have a bioset per
physical device...
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: [PATCH] mm/swap: add independed bio pool for swap
2012-12-20 0:53 ` Andrew Morton
@ 2012-12-20 9:29 ` Konstantin Khlebnikov
0 siblings, 0 replies; 4+ messages in thread
From: Konstantin Khlebnikov @ 2012-12-20 9:29 UTC (permalink / raw)
To: Andrew Morton; +Cc: Hugh Dickins, linux-kernel, Mel Gorman, Jens Axboe
Andrew Morton wrote:
> On Mon, 17 Dec 2012 20:45:13 -0800 (PST)
> Hugh Dickins<hughd@google.com> wrote:
>
>> On Fri, 14 Dec 2012, Konstantin Khlebnikov wrote:
>>
>>> This bio pool guarantees reclaiming progress for anonymous pages.
>>> All avaliable bio in fs_bio_set may be borrowed by writeback which may
>>> never ends, because disk too slow or broken. I have seen this situation in
>>> real life in system where was a lot of bio requests to a loop device which
>>> laying on top of special fuse-based filesystem.
>>
>> Hmm, perhaps, I'm not at all sure.
>
> It probably maybe perhaps makes sense. Of course, you're screwed if
> one of your swap devices is "disk too slow or broken".
>
> My crystal ball tells me that the 2015 kernel will have a bioset per
> physical device...
>
I predict that they also will be per-cpu. Well... probably maybe we should
just wait for this shining future. I leave this to your discretion...
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2012-12-20 9:29 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2012-12-14 11:01 [PATCH] mm/swap: add independed bio pool for swap Konstantin Khlebnikov
2012-12-18 4:45 ` Hugh Dickins
2012-12-20 0:53 ` Andrew Morton
2012-12-20 9:29 ` Konstantin Khlebnikov
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.