From: Pavel Begunkov <asml.silence@gmail.com>
To: Chenliang Li <cliang01.li@samsung.com>, axboe@kernel.dk
Cc: io-uring@vger.kernel.org, peiwei.li@samsung.com,
joshi.k@samsung.com, kundan.kumar@samsung.com,
anuj20.g@samsung.com, gost.dev@samsung.com
Subject: Re: [PATCH v4 1/4] io_uring/rsrc: add hugepage buffer coalesce helpers
Date: Sun, 16 Jun 2024 19:04:38 +0100 [thread overview]
Message-ID: <1233b470-c190-4b8f-873d-dfbf31b6874d@gmail.com> (raw)
In-Reply-To: <20240514075444.590910-2-cliang01.li@samsung.com>
On 5/14/24 08:54, Chenliang Li wrote:
> Introduce helper functions to check whether a buffer can
> be coalesced or not, and gather folio data for later use.
>
> The coalescing optimizes time and space consumption caused
> by mapping and storing multi-hugepage fixed buffers.
>
> A coalescable multi-hugepage buffer should fully cover its folios
> (except potentially the first and last one), and these folios should
> have the same size. These requirements are for easier later process,
> also we need same size'd chunks in io_import_fixed for fast iov_iter
> adjust.
>
> Signed-off-by: Chenliang Li <cliang01.li@samsung.com>
> ---
> io_uring/rsrc.c | 78 +++++++++++++++++++++++++++++++++++++++++++++++++
> io_uring/rsrc.h | 10 +++++++
> 2 files changed, 88 insertions(+)
>
> diff --git a/io_uring/rsrc.c b/io_uring/rsrc.c
> index 65417c9553b1..d08224c0c5b0 100644
> --- a/io_uring/rsrc.c
> +++ b/io_uring/rsrc.c
> @@ -871,6 +871,84 @@ static int io_buffer_account_pin(struct io_ring_ctx *ctx, struct page **pages,
> return ret;
> }
>
> +static bool __io_sqe_buffer_try_coalesce(struct page **pages, int nr_pages,
> + struct io_imu_folio_data *data)
io_can_coalesce_buffer(), you're not actually trying to
do it here.
> +{
> + struct folio *folio = page_folio(pages[0]);
> + unsigned int count = 1;
> + int i;
> +
> + data->nr_pages_mid = folio_nr_pages(folio);
> + if (data->nr_pages_mid == 1)
> + return false;
> +
> + data->folio_shift = folio_shift(folio);
> + data->folio_size = folio_size(folio);
> + data->nr_folios = 1;
> + /*
> + * Check if pages are contiguous inside a folio, and all folios have
> + * the same page count except for the head and tail.
> + */
> + for (i = 1; i < nr_pages; i++) {
> + if (page_folio(pages[i]) == folio &&
> + pages[i] == pages[i-1] + 1) {
> + count++;
> + continue;
> + }
> +
> + if (data->nr_folios == 1)
> + data->nr_pages_head = count;
> + else if (count != data->nr_pages_mid)
> + return false;
> +
> + folio = page_folio(pages[i]);
> + if (folio_size(folio) != data->folio_size)
> + return false;
> +
> + count = 1;
> + data->nr_folios++;
> + }
> + if (data->nr_folios == 1)
> + data->nr_pages_head = count;
> +
> + return true;
> +}
> +
> +static bool io_sqe_buffer_try_coalesce(struct page **pages, int nr_pages,
> + struct io_imu_folio_data *data)
> +{
> + int i, j;
> +
> + if (nr_pages <= 1 ||
> + !__io_sqe_buffer_try_coalesce(pages, nr_pages, data))
> + return false;
> +
> + /*
> + * The pages are bound to the folio, it doesn't
> + * actually unpin them but drops all but one reference,
> + * which is usually put down by io_buffer_unmap().
> + * Note, needs a better helper.
> + */
> + if (data->nr_pages_head > 1)
> + unpin_user_pages(&pages[1], data->nr_pages_head - 1);
Should be pages[0]. page[1] can be in another folio, and even
though data->nr_pages_head > 1 protects against touching it,
it's still flimsy.
> +
> + j = data->nr_pages_head;
> + nr_pages -= data->nr_pages_head;
> + for (i = 1; i < data->nr_folios; i++) {
> + unsigned int nr_unpin;
> +
> + nr_unpin = min_t(unsigned int, nr_pages - 1,
> + data->nr_pages_mid - 1);
> + if (nr_unpin == 0)
> + break;
> + unpin_user_pages(&pages[j+1], nr_unpin);
same
> + j += data->nr_pages_mid;
And instead of duplicating this voodoo iteration later,
please just assemble a new compacted ->nr_folios sized
page array.
> + nr_pages -= data->nr_pages_mid;
> + }
> +
> + return true;
> +}
> +
> static int io_sqe_buffer_register(struct io_ring_ctx *ctx, struct iovec *iov,
> struct io_mapped_ubuf **pimu,
> struct page **last_hpage)
> diff --git a/io_uring/rsrc.h b/io_uring/rsrc.h
> index c032ca3436ca..b2a9d66b76dd 100644
> --- a/io_uring/rsrc.h
> +++ b/io_uring/rsrc.h
> @@ -50,6 +50,16 @@ struct io_mapped_ubuf {
> struct bio_vec bvec[] __counted_by(nr_bvecs);
> };
>
> +struct io_imu_folio_data {
> + /* Head folio can be partially included in the fixed buf */
> + unsigned int nr_pages_head;
> + /* For non-head/tail folios, has to be fully included */
> + unsigned int nr_pages_mid;
> + unsigned int nr_folios;
> + unsigned int folio_shift;
> + size_t folio_size;
> +};
> +
> void io_rsrc_node_ref_zero(struct io_rsrc_node *node);
> void io_rsrc_node_destroy(struct io_ring_ctx *ctx, struct io_rsrc_node *ref_node);
> struct io_rsrc_node *io_rsrc_node_alloc(struct io_ring_ctx *ctx);
--
Pavel Begunkov
next prev parent reply other threads:[~2024-06-16 18:04 UTC|newest]
Thread overview: 22+ messages / expand[flat|nested] mbox.gz Atom feed top
[not found] <CGME20240514075453epcas5p17974fb62d65a88b1a1b55b97942ee2be@epcas5p1.samsung.com>
2024-05-14 7:54 ` [PATCH v4 0/4] io_uring/rsrc: coalescing multi-hugepage registered buffers Chenliang Li
2024-05-14 7:54 ` [PATCH v4 1/4] io_uring/rsrc: add hugepage buffer coalesce helpers Chenliang Li
2024-05-16 14:07 ` Anuj gupta
2024-06-16 18:04 ` Pavel Begunkov [this message]
2024-06-17 3:12 ` [PATCH v2 0/4] io_uring/rsrc: coalescing multi-hugepage registered buffers Chenliang Li
2024-06-17 12:38 ` Pavel Begunkov
2024-06-18 3:11 ` [PATCH v4 3/4] io_uring/rsrc: add init and account functions for coalesced imus Chenliang Li
2024-05-14 7:54 ` [PATCH v4 2/4] io_uring/rsrc: store folio shift and mask into imu Chenliang Li
2024-05-16 14:08 ` Anuj gupta
2024-05-14 7:54 ` [PATCH v4 3/4] io_uring/rsrc: add init and account functions for coalesced imus Chenliang Li
2024-06-16 17:43 ` Pavel Begunkov
2024-06-17 3:16 ` [PATCH v2 0/4] io_uring/rsrc: coalescing multi-hugepage registered buffers Chenliang Li
2024-06-17 12:22 ` Pavel Begunkov
2024-06-18 3:24 ` [PATCH v4 3/4] io_uring/rsrc: add init and account functions for coalesced imus Chenliang Li
2024-05-14 7:54 ` [PATCH v4 4/4] io_uring/rsrc: enable multi-hugepage buffer coalescing Chenliang Li
2024-05-16 14:09 ` Anuj gupta
2024-05-16 14:01 ` [PATCH v4 0/4] io_uring/rsrc: coalescing multi-hugepage registered buffers Anuj gupta
2024-05-16 14:58 ` Jens Axboe
2024-05-30 5:10 ` Chenliang Li
2024-06-04 13:33 ` Anuj gupta
2024-06-13 2:49 ` Chenliang Li
2024-06-16 2:54 ` Jens Axboe
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1233b470-c190-4b8f-873d-dfbf31b6874d@gmail.com \
--to=asml.silence@gmail.com \
--cc=anuj20.g@samsung.com \
--cc=axboe@kernel.dk \
--cc=cliang01.li@samsung.com \
--cc=gost.dev@samsung.com \
--cc=io-uring@vger.kernel.org \
--cc=joshi.k@samsung.com \
--cc=kundan.kumar@samsung.com \
--cc=peiwei.li@samsung.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox