From: Christoph Hellwig <hch@infradead.org>
To: "Ritesh Harjani (IBM)" <ritesh.list@gmail.com>
Cc: linux-xfs@vger.kernel.org, linux-fsdevel@vger.kernel.org,
Aravinda Herle <araherle@in.ibm.com>
Subject: Re: [RFCv2 3/3] iomap: Support subpage size dirty tracking to improve write performance
Date: Mon, 30 Jan 2023 09:16:33 -0800 [thread overview]
Message-ID: <Y9f7cZxnXbL7x0p+@infradead.org> (raw)
In-Reply-To: <5e49fa975ce9d719f5b6f765aa5d3a1d44d98d1d.1675093524.git.ritesh.list@gmail.com>
On Mon, Jan 30, 2023 at 09:44:13PM +0530, Ritesh Harjani (IBM) wrote:
> +iomap_page_create(struct inode *inode, struct folio *folio, unsigned int flags,
> + bool from_writeback)
> {
> struct iomap_page *iop = to_iomap_page(folio);
> unsigned int nr_blocks = i_blocks_per_folio(inode, folio);
> @@ -58,12 +59,32 @@ iomap_page_create(struct inode *inode, struct folio *folio, unsigned int flags)
> else
> gfp = GFP_NOFS | __GFP_NOFAIL;
>
> - iop = kzalloc(struct_size(iop, state, BITS_TO_LONGS(nr_blocks)),
> + iop = kzalloc(struct_size(iop, state, BITS_TO_LONGS(2 * nr_blocks)),
> gfp);
> if (iop) {
Please just return early here for the allocation failure case instead of
adding a lot of code with extra indentation.
> spin_lock_init(&iop->state_lock);
> - if (folio_test_uptodate(folio))
> - bitmap_fill(iop->state, nr_blocks);
> + /*
> + * iomap_page_create can get called from writeback after
> + * a truncate_inode_partial_folio operation on a large folio.
> + * For large folio the iop structure is freed in
> + * iomap_invalidate_folio() to ensure we can split the folio.
> + * That means we will have to let go of the optimization of
> + * tracking dirty bits here and set all bits as dirty if
> + * the folio is marked uptodate.
> + */
> + if (from_writeback && folio_test_uptodate(folio))
> + bitmap_fill(iop->state, 2 * nr_blocks);
> + else if (folio_test_uptodate(folio)) {
This code is very confusing. First please only check
folio_test_uptodate one, and then check the from_writeback flag
inside the branch. And as mentioned last time I think you really
need some symbolic constants for dealing with dirty vs uptodate
state and not just do a single fill for them.
> + unsigned start = offset_in_folio(folio,
> + folio_pos(folio)) >> inode->i_blkbits;
> + bitmap_set(iop->state, start, nr_blocks);
Also this code leaves my head scratching. Unless I'm missing something
important
offset_in_folio(folio, folio_pos(folio))
must always return 0.
Also the from_writeback logic is weird. I'd rather have a
"bool is_dirty" argument and then pass true for writeback beause
we know the folio is dirty, false where we know it can't be
dirty and do the folio_test_dirty in the caller where we don't
know the state.
> +bool iomap_dirty_folio(struct address_space *mapping, struct folio *folio)
> +{
> + unsigned int nr_blocks = i_blocks_per_folio(mapping->host, folio);
> + struct iomap_page *iop = iomap_page_create(mapping->host, folio, 0, false);
Please avoid the overly long line. In fact with such long function
calls I'd generally prefer if the initialization was moved out of the
declaration.
> +
> + iomap_set_range_dirty(folio, iop, offset_in_folio(folio, folio_pos(folio)),
Another overly long line here.
next prev parent reply other threads:[~2023-01-30 17:16 UTC|newest]
Thread overview: 23+ messages / expand[flat|nested] mbox.gz Atom feed top
2023-01-30 16:14 [RFCv2 0/3] iomap: Add support for subpage dirty state tracking to improve write performance Ritesh Harjani (IBM)
2023-01-30 16:14 ` [RFCv2 1/3] iomap: Move creation of iomap_page early in __iomap_write_begin Ritesh Harjani (IBM)
2023-01-30 17:02 ` Christoph Hellwig
2023-01-30 20:21 ` Ritesh Harjani (IBM)
2023-01-30 21:00 ` Matthew Wilcox
2023-01-31 18:37 ` Ritesh Harjani (IBM)
2023-01-31 18:48 ` Matthew Wilcox
2023-01-31 20:00 ` Ritesh Harjani (IBM)
2023-01-30 16:14 ` [RFCv2 2/3] iomap: Change uptodate variable name to state Ritesh Harjani (IBM)
2023-01-30 21:56 ` Dave Chinner
2023-01-30 22:24 ` Matthew Wilcox
2023-01-31 15:07 ` Christoph Hellwig
2023-01-31 18:05 ` Ritesh Harjani (IBM)
2023-01-30 16:14 ` [RFCv2 3/3] iomap: Support subpage size dirty tracking to improve write performance Ritesh Harjani (IBM)
2023-01-30 17:16 ` Christoph Hellwig [this message]
2023-01-30 18:01 ` Matthew Wilcox
2023-01-30 20:44 ` Ritesh Harjani (IBM)
2023-01-30 20:27 ` Ritesh Harjani (IBM)
2023-01-30 17:54 ` Matthew Wilcox
2023-01-30 20:34 ` Ritesh Harjani (IBM)
2023-01-30 18:10 ` [RFCv2 0/3] iomap: Add support for subpage dirty state " Matthew Wilcox
2023-01-30 21:01 ` Ritesh Harjani (IBM)
2023-02-02 4:45 ` Ritesh Harjani (IBM)
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=Y9f7cZxnXbL7x0p+@infradead.org \
--to=hch@infradead.org \
--cc=araherle@in.ibm.com \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-xfs@vger.kernel.org \
--cc=ritesh.list@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).