All of lore.kernel.org
 help / color / mirror / Atom feed
From: Jens Axboe <axboe@kernel.dk>
To: linux-block@vger.kernel.org
Subject: [PATCHSET RFC v3 0/6] Cache issue side time querying
Date: Thu, 18 Jan 2024 12:20:51 -0700	[thread overview]
Message-ID: <20240118192343.953539-1-axboe@kernel.dk> (raw)

Hi,

When I run my peak testing to see if we've regressed, my test script
always does:

	echo 0 > /sys/block/$DEV/queue/iostats
	echo 2 > /sys/block/$DEV/queue/nomerges

for each device being used. It's unfortunate that we need to disable
iostats, but without doing that, I lose about 12% performance. The main
reason for that is the time querying we need to do, when iostats are
enabled. As it turns out, lots of other block code is quite trigger
happy with querying time as well. We do have some nice batching in place
which helps ammortize that, but it's not perfect.

This trivial patchset simply caches the current time in struct blk_plug,
on the premise that any issue side time querying can get adequate
granularity through that. Nobody really needs nsec granularity on the
timestamp.

Results in patch 2, but tldr is a more than 9% improvement (108M -> 118M
IOPS) for my test case, which doesn't even enable most of the costly
block layer items that you'd typically find in a distro and which would
further increase the number of issue side time calls. This brings iostats
enabled _almost_ to the level of turning it off.

Can also be found in my block-issue-ts branch:

https://git.kernel.dk/cgit/linux/log/?h=block-issue-ts

 block/bfq-cgroup.c        | 14 +++---
 block/bfq-iosched.c       | 22 +++++-----
 block/blk-cgroup.c        |  2 +-
 block/blk-core.c          | 33 ++++++++------
 block/blk-flush.c         |  2 +-
 block/blk-iocost.c        |  6 +--
 block/blk-iolatency.c     |  6 +--
 block/blk-mq.c            | 18 ++++----
 block/blk-throttle.c      |  6 +--
 block/blk-wbt.c           |  5 +--
 drivers/md/raid1-10.c     |  2 +-
 include/linux/blk_types.h | 42 ------------------
 include/linux/blkdev.h    | 92 ++++++++++++++++++++++++++++++++++++---
 include/linux/sched.h     |  2 +-
 kernel/sched/core.c       |  4 +-
 15 files changed, 151 insertions(+), 105 deletions(-)

Changes since v2:
	- Ensure PF_BLOCK_TS is cleared when plug is flushed or
	  invalidated
	- Fix missing raid1-10 conversion
	- Fix missing cgroup timestamp

-- 
Jens Axboe


             reply	other threads:[~2024-01-18 19:23 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-01-18 19:20 Jens Axboe [this message]
2024-01-18 19:20 ` [PATCH 1/6] block: move cgroup time handling code into blkdev.h Jens Axboe
2024-01-18 19:20 ` [PATCH 2/6] block: add blk_time_get_ns() helper Jens Axboe
2024-01-18 19:20 ` [PATCH 3/6] block: cache current nsec time in struct blk_plug Jens Axboe
2024-01-18 19:20 ` [PATCH 4/6] block: update cached timestamp post schedule/preemption Jens Axboe
2024-01-18 19:20 ` [PATCH 5/6] block: shrink plug->{nr_ios, rq_count} to unsigned char Jens Axboe
2024-01-18 19:20 ` [PATCH 6/6] block: convert struct blk_plug callback list to hlists Jens Axboe

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20240118192343.953539-1-axboe@kernel.dk \
    --to=axboe@kernel.dk \
    --cc=linux-block@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.