From: Matthew Wilcox <willy@infradead.org>
To: Christoph Hellwig <hch@infradead.org>
Cc: Vishal Moola <vishal.moola@gmail.com>,
linux-block@vger.kernel.org, linux-fsdevel@vger.kernel.org,
linux-kernel@vger.kernel.org
Subject: Re: [RFC PATCH] Page Cache Allowing Hard Interrupts
Date: Tue, 10 Aug 2021 15:08:21 +0100 [thread overview]
Message-ID: <YRKIVZIxdirjg7Ih@casper.infradead.org> (raw)
In-Reply-To: <YRJyGMLAFKoB1qUQ@infradead.org>
On Tue, Aug 10, 2021 at 01:33:28PM +0100, Christoph Hellwig wrote:
> On Tue, Aug 10, 2021 at 01:09:45PM +0100, Matthew Wilcox wrote:
> > On Tue, Aug 10, 2021 at 09:15:28AM +0100, Christoph Hellwig wrote:
> > > Stupid question, but where do we ever do page cache interaction from
> > > soft irq context?
> >
> > test_clear_page_writeback() happens in _some_ interrupt context (ie
> > the io completion path). We had been under the impression that it was
> > always actually softirq context, and so this patch was safe. However,
> > it's now clear that some drivers are calling it from hardirq context.
> > Writeback completions are clearly not latency sensitive and so can
> > be delayed from hardirq to softirq context without any problem, so I
> > think fixing this is just going to be a matter of tagging requests as
> > "complete in softirq context" and ensuring that blk_mq_raise_softirq()
> > is called for them.
> >
> > Assuming that DIO write completions _are_ latency-sensitive, of course.
> > Maybe all write completions could be run in softirqs.
>
> I really don't really see any benefit in introducing softirqs into
> the game.
The benefit is not having to disable interrupts while manipulating
the page cache, eg delete_from_page_cache_batch().
> If we want to simplify the locking and do not care too much
> about latency, we should just defer to workqueue/thread context.
It's not a bad idea. I thought BH would be the better place for it
because it wouldn't require scheduling in a task. If we are going to
schedule in a task though, can we make it the task which submitted the I/O
(assuming it still exists), or do we not have the infrastructure for that?
> For example XFS already does that for all writeback except for pure
> overwrites. Those OTOH can be latency critical for O_SYNC writes, but
> you're apparently looking into that already.
To my mind if you've asked for O_SYNC, you've asked for bad performance.
The writethrough improvement that I'm working on skips dirtying the page,
but still marks the page as writeback so that we don't submit overlapping
writes to the device. The O_SYNC write will wait for the writeback to
finish, so it'll still be delayed by one additional scheduling event
... unless we run the write completion in the context of this task.
prev parent reply other threads:[~2021-08-10 14:09 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-07-30 21:36 [RFC PATCH] Page Cache Allowing Hard Interrupts Vishal Moola
2021-07-31 14:31 ` dc1468867f: WARNING:at_mm/workingset.c:#workingset_update_node kernel test robot
2021-08-10 8:15 ` [RFC PATCH] Page Cache Allowing Hard Interrupts Christoph Hellwig
2021-08-10 12:09 ` Matthew Wilcox
2021-08-10 12:33 ` Christoph Hellwig
2021-08-10 14:08 ` Matthew Wilcox [this message]
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=YRKIVZIxdirjg7Ih@casper.infradead.org \
--to=willy@infradead.org \
--cc=hch@infradead.org \
--cc=linux-block@vger.kernel.org \
--cc=linux-fsdevel@vger.kernel.org \
--cc=linux-kernel@vger.kernel.org \
--cc=vishal.moola@gmail.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).