linux-block.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* [PATCH 0/3] Performance enhancements
@ 2017-12-21 20:46 Keith Busch
  2017-12-21 20:46 ` [PATCH 1/3] nvme/pci: Start request after doorbell ring Keith Busch
                   ` (2 more replies)
  0 siblings, 3 replies; 34+ messages in thread
From: Keith Busch @ 2017-12-21 20:46 UTC (permalink / raw)
  To: linux-nvme, linux-block, Christoph Hellwig, Jens Axboe,
	Sagi Grimberg
  Cc: Keith Busch

A few IO micro-optimizations for IO polling and NVMe. I'm really working
to close the performance gap with userspace drivers, and this gets me
halfway there on latency. The fastest hardware I could get measured
roundtrip read latency at 5usec with this series that was previously
measuring 5.7usec.

Note with NVMe, you really need to crank up the interrupt coalescing to
see the completion polling benefit.

Test pre-setup:

  echo performance | tee /sys/devices/system/cpu/cpufreq/policy*/scaling_governor
  echo 0 > /sys/block/nvme0n1/queue/iostats
  echo -1 > /sys/block/nvme0n1/queue/io_poll_delay
  nvme set-feature /dev/nvme0 -f 8 -v 0x4ff

fio profile:

  [global]
  ioengine=pvsync2
  rw=randread
  norandommap
  direct=1
  bs=4k
  hipri

  [hi-pri]
  filename=/dev/nvme0n1
  cpus_allowed=2

Keith Busch (3):
  nvme/pci: Start request after doorbell ring
  nvme/pci: Remove cq_vector check in IO path
  block: Polling completion performance optimization

 drivers/nvme/host/pci.c | 14 +++-----------
 fs/block_dev.c          |  5 ++++-
 2 files changed, 7 insertions(+), 12 deletions(-)

-- 
2.13.6

^ permalink raw reply	[flat|nested] 34+ messages in thread

end of thread, other threads:[~2018-01-23  0:16 UTC | newest]

Thread overview: 34+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-12-21 20:46 [PATCH 0/3] Performance enhancements Keith Busch
2017-12-21 20:46 ` [PATCH 1/3] nvme/pci: Start request after doorbell ring Keith Busch
2017-12-21 20:49   ` Jens Axboe
2017-12-21 20:53     ` Jens Axboe
2017-12-21 21:02       ` Keith Busch
2017-12-21 21:01         ` Jens Axboe
2018-01-03 20:21           ` Keith Busch
2018-01-23  0:16             ` Keith Busch
2017-12-25 10:12         ` Sagi Grimberg
2017-12-29  9:44           ` Christoph Hellwig
2017-12-25 10:11   ` Sagi Grimberg
2017-12-26 20:35     ` Keith Busch
2017-12-27  9:02       ` Sagi Grimberg
2017-12-29  9:44   ` Christoph Hellwig
2017-12-21 20:46 ` [PATCH 2/3] nvme/pci: Remove cq_vector check in IO path Keith Busch
2017-12-21 20:54   ` Jens Axboe
2017-12-25 10:10   ` Sagi Grimberg
2017-12-27 21:01     ` Sagi Grimberg
2017-12-29  9:48       ` Christoph Hellwig
2017-12-29 15:39         ` Keith Busch
2017-12-31 12:30           ` Sagi Grimberg
2018-01-02 16:50             ` Keith Busch
2017-12-21 20:46 ` [PATCH 3/3] block: Polling completion performance optimization Keith Busch
2017-12-21 20:56   ` Scott Bauer
2017-12-21 21:00     ` Jens Axboe
2017-12-21 21:34       ` Keith Busch
2017-12-21 22:17         ` Jens Axboe
2017-12-21 23:10           ` Keith Busch
2017-12-22 15:40             ` Jens Axboe
2017-12-29  9:50         ` Christoph Hellwig
2017-12-29 15:51           ` Keith Busch
2017-12-31 12:48           ` Sagi Grimberg
2017-12-21 20:57   ` Jens Axboe
2017-12-29  9:51   ` Christoph Hellwig

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).