From: Gao Xiang <xiang@kernel.org>
To: David Howells <dhowells@redhat.com>
Cc: lsf-pc@lists.linux-foundation.org,
Matthew Wilcox <willy@infradead.org>,
netfs@lists.linux.dev, linux-fsdevel@vger.kernel.org,
linux-mm@kvack.org
Subject: Re: [LSF/MM/BPF TOPIC] Large folios, swap and fscache
Date: Sat, 3 Feb 2024 13:13:57 +0800 [thread overview]
Message-ID: <Zb3Llc+dHxu4eggS@debian> (raw)
In-Reply-To: <2701740.1706864989@warthog.procyon.org.uk>
Hi David,
On Fri, Feb 02, 2024 at 09:09:49AM +0000, David Howells wrote:
> Hi,
>
> The topic came up in a recent discussion about how to deal with large folios
> when it comes to swap as a swap device is normally considered a simple array
> of PAGE_SIZE-sized elements that can be indexed by a single integer.
>
> With the advent of large folios, however, we might need to change this in
> order to be better able to swap out a compound page efficiently. Swap
> fragmentation raises its head, as does the need to potentially save multiple
> indices per folio. Does swap need to grow more filesystem features?
>
> Further to this, we have at least two ways to cache data on disk/flash/etc. -
> swap and fscache - and both want to set aside disk space for their operation.
> Might it be possible to combine the two?
>
> One thing I want to look at for fscache is the possibility of switching from a
> file-per-object-based approach to a tagged cache more akin to the way OpenAFS
> does things. In OpenAFS, you have a whole bunch of small files, each
> containing a single block (e.g. 256K) of data, and an index that maps a
> particular {volume,file,version,block} to one of these files in the cache.
>
> Now, I could also consider holding all the data blocks in a single file (or
> blockdev) - and this might work for swap. For fscache, I do, however, need to
> have some sort of integrity across reboots that swap does not require.
If my understanding is correct, I think the old swapfile approach just
works with pinned local fs extents, which means it looks up extents in
advance and it doesn't expect these extents will be moved so the real
swap data I/O paths always work without fses involved. I don't look
into the new SWP_FS_OPS/.swap_rw way and it seems only some network
fses use it but IMHO it might have some deadlock risk if swapout
triggers local fs block allocation. But overall I think it's a good
idea to combine the two.
Just slight off the topic: Recently I had another rough thought. As
you said, even a single fscache block or called a single fscache chunk is like
256K or whatever.
Is it possible to implement an _optional_ partial cached data uptodate
like fscache chunk vs fsblock? For example a bitmap can be attached to
each 256K or 1M chunk. That would be much helpful.
Thanks,
Gao Xiang
>
> David
>
next prev parent reply other threads:[~2024-02-03 5:14 UTC|newest]
Thread overview: 13+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-02-02 9:09 [LSF/MM/BPF TOPIC] Large folios, swap and fscache David Howells
2024-02-02 14:29 ` Matthew Wilcox
2024-02-22 19:02 ` Luis Chamberlain
2024-02-22 19:16 ` Yosry Ahmed
2024-02-22 22:26 ` Chris Li
2024-02-29 19:31 ` Chris Li
2024-02-02 15:57 ` David Howells
2024-02-02 19:22 ` Matthew Wilcox
2024-02-03 5:13 ` Gao Xiang [this message]
2024-02-04 23:45 ` Dave Chinner
2024-02-22 22:45 ` Chris Li
2024-02-23 3:00 ` Andreas Dilger
2024-02-23 3:46 ` Chris Li
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Zb3Llc+dHxu4eggS@debian \
--to=xiang@kernel.org \
--cc=dhowells@redhat.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-mm@kvack.org \
--cc=lsf-pc@lists.linux-foundation.org \
--cc=netfs@lists.linux.dev \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).