From: Jens Axboe <axboe@kernel.dk>
To: Liu Wei <liuwei09@cestc.cn>
Cc: akpm@linux-foundation.org, hch@lst.de, jack@suse.cz,
linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
rgoldwyn@suse.com, willy@infradead.org
Subject: Re: [PATCH] mm/filemap: invalidating pages is still necessary when io with IOCB_NOWAIT
Date: Mon, 27 May 2024 09:36:24 -0600 [thread overview]
Message-ID: <c66ca795-da93-437c-bb11-718801f8114a@kernel.dk> (raw)
In-Reply-To: <20240527100908.49913-1-liuwei09@cestc.cn>
On 5/27/24 4:09 AM, Liu Wei wrote:
> I am a newer, thanks for the reminder.
>
>>
>>>> when we issuing AIO with direct I/O and IOCB_NOWAIT on a block device, the
>>>> process context will not be blocked.
>>>>
>>>> However, if the device already has page cache in memory, EAGAIN will be
>>>> returned. And even when trying to reissue the AIO with direct I/O and
>>>> IOCB_NOWAIT again, we consistently receive EAGAIN.
>>
>> -EAGAIN doesn't mean "just try again and it'll work".
>>
>>>> Maybe a better way to deal with it: filemap_fdatawrite_range dirty pages
>>>> with WB_SYNC_NONE flag, and invalidate_mapping_pages unmapped pages at
>>>> the same time.
>>>>
>>>> Signed-off-by: Liu Wei <liuwei09@cestc.cn>
>>>> ---
>>>> mm/filemap.c | 9 ++++++++-
>>>> 1 file changed, 8 insertions(+), 1 deletion(-)
>>>>
>>>> diff --git a/mm/filemap.c b/mm/filemap.c
>>>> index 30de18c4fd28..1852a00caf31 100644
>>>> --- a/mm/filemap.c
>>>> +++ b/mm/filemap.c
>>>> @@ -2697,8 +2697,15 @@ int kiocb_invalidate_pages(struct kiocb *iocb, size_t count)
>>>>
>>>> if (iocb->ki_flags & IOCB_NOWAIT) {
>>>> /* we could block if there are any pages in the range */
>>>> - if (filemap_range_has_page(mapping, pos, end))
>>>> + if (filemap_range_has_page(mapping, pos, end)) {
>>>> + if (mapping_needs_writeback(mapping)) {
>>>> + __filemap_fdatawrite_range(mapping,
>>>> + pos, end, WB_SYNC_NONE);
>>>> + }
>>
>> I don't think WB_SYNC_NONE tells it not to block, it just says not to
>> wait for it... So this won't work as-is.
>
> Yes, but I think an asynchronous writex-back is better than simply
> return EAGAIN. By using __filemap_fdatawrite_range to trigger a
> writeback, subsequent retries may have a higher chance of success.
And what's the application supposed to do, just hammer on the same
IOCB_NOWAIT submission until it then succeeds? The only way this can
reasonably work for that would be if yo can do:
1) Issue IOCB_NOWAIT IO
2) Get -EAGAIN
3) Sync kick off writeback, wait for it to be done
4) Issue IOCB_NOWAIT IO again
5) Success
If you just kick it off, then you'd repeat steps 1..2 ad nauseam until
it works out, not tenable.
And this doesn't even include the other point I mentioned, which is
__filemap_fdatawrite_range() IO issue blocking in the first place.
So no, NAK on this patch.
--
Jens Axboe
WARNING: multiple messages have this Message-ID (diff)
From: Liu Wei <liuwei09@cestc.cn>
To: liuwei09@cestc.cn
Cc: akpm@linux-foundation.org, hch@lst.de, jack@suse.cz,
linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org,
rgoldwyn@suse.com, willy@infradead.org,
Jens Axboe <axboe@kernel.dk>
Subject: Re: [PATCH] mm/filemap: invalidating pages is still necessary when io with IOCB_NOWAIT
Date: Thu, 30 May 2024 10:33:04 +0800 [thread overview]
Message-ID: <c66ca795-da93-437c-bb11-718801f8114a@kernel.dk> (raw)
Message-ID: <20240530023304.8UVNxQNYJoDrWU2wt4MhxUT7rFrHpKKu3sJR4n0FjIU@z> (raw)
In-Reply-To: <20240527100908.49913-1-liuwei09@cestc.cn>
From: Jens Axboe <axboe@kernel.dk>
On 5/27/24 4:09 AM, Liu Wei wrote:
> > I am a newer, thanks for the reminder.
> >
> >>
> >> I don't think WB_SYNC_NONE tells it not to block, it just says not to
> >> wait for it... So this won't work as-is.
> >
> > Yes, but I think an asynchronous writex-back is better than simply
> > return EAGAIN. By using __filemap_fdatawrite_range to trigger a
> > writeback, subsequent retries may have a higher chance of success.
>
> And what's the application supposed to do, just hammer on the same
> IOCB_NOWAIT submission until it then succeeds? The only way this can
> reasonably work for that would be if yo can do:
>
> 1) Issue IOCB_NOWAIT IO
> 2) Get -EAGAIN
> 3) Sync kick off writeback, wait for it to be done
> 4) Issue IOCB_NOWAIT IO again
> 5) Success
>
> If you just kick it off, then you'd repeat steps 1..2 ad nauseam until
> it works out, not tenable.
>
> And this doesn't even include the other point I mentioned, which is
> __filemap_fdatawrite_range() IO issue blocking in the first place.
>
> So no, NAK on this patch.
>
I know, thanks for your patient explanation.
next prev parent reply other threads:[~2024-05-27 15:36 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2024-05-13 13:23 [PATCH] mm/filemap: invalidating pages is still necessary when io with IOCB_NOWAIT Liu Wei
2024-05-23 20:08 ` Andrew Morton
2024-05-23 21:11 ` Matthew Wilcox
2024-05-23 21:17 ` Jens Axboe
2024-05-23 21:08 ` Matthew Wilcox
2024-05-23 21:16 ` Jens Axboe
2024-05-27 10:09 ` Liu Wei
2024-05-27 15:36 ` Jens Axboe [this message]
2024-05-30 2:33 ` Liu Wei
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=c66ca795-da93-437c-bb11-718801f8114a@kernel.dk \
--to=axboe@kernel.dk \
--cc=akpm@linux-foundation.org \
--cc=hch@lst.de \
--cc=jack@suse.cz \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=liuwei09@cestc.cn \
--cc=rgoldwyn@suse.com \
--cc=willy@infradead.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox