All of lore.kernel.org
 help / color / mirror / Atom feed
From: Robin Murphy <robin.murphy@arm.com>
To: joro@8bytes.org
Cc: iommu@lists.linux-foundation.org, linux-kernel@vger.kernel.org,
	thunder.leizhen@huawei.com, ard.biesheuvel@linaro.org,
	nwatters@codeaurora.org, ray.jui@broadcom.com
Subject: [PATCH v3 0/4] Optimise 64-bit IOVA allocations
Date: Tue, 22 Aug 2017 16:17:41 +0100	[thread overview]
Message-ID: <cover.1503412074.git.robin.murphy@arm.com> (raw)

Hi all,

Just a quick repost of v2[1] with a small fix for the bug reported by Nate.
To recap, whilst this mostly only improves worst-case performance, those
worst-cases have a tendency to be pathologically bad:

Ard reports general desktop performance with Chromium on AMD Seattle going
from ~1-2FPS to perfectly usable.

Leizhen reports gigabit ethernet throughput going from ~6.5Mbit/s to line
speed.

I also inadvertantly found that the HiSilicon hns_dsaf driver was taking ~35s
to probe simply becuase of the number of DMA buffers it maps on startup (perf
shows around 76% of that was spent under the lock in alloc_iova()). With this
series applied it takes a mere ~1s, mostly of unrelated mdelay()s, with
alloc_iova() entirely lost in the noise.

Robin.

[1] https://www.mail-archive.com/iommu@lists.linux-foundation.org/msg19139.html

Robin Murphy (1):
  iommu/iova: Extend rbtree node caching

Zhen Lei (3):
  iommu/iova: Optimise rbtree searching
  iommu/iova: Optimise the padding calculation
  iommu/iova: Make dma_32bit_pfn implicit

 drivers/gpu/drm/tegra/drm.c      |   3 +-
 drivers/gpu/host1x/dev.c         |   3 +-
 drivers/iommu/amd_iommu.c        |   7 +--
 drivers/iommu/dma-iommu.c        |  18 +------
 drivers/iommu/intel-iommu.c      |  11 ++--
 drivers/iommu/iova.c             | 114 +++++++++++++++++----------------------
 drivers/misc/mic/scif/scif_rma.c |   3 +-
 include/linux/iova.h             |   8 +--
 8 files changed, 62 insertions(+), 105 deletions(-)

-- 
2.13.4.dirty

             reply	other threads:[~2017-08-22 15:17 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-08-22 15:17 Robin Murphy [this message]
     [not found] ` <cover.1503412074.git.robin.murphy-5wv7dgnIgG8@public.gmane.org>
2017-08-22 15:17   ` [PATCH v3 1/4] iommu/iova: Optimise rbtree searching Robin Murphy
2017-08-22 15:17     ` Robin Murphy
2017-08-22 15:17   ` [PATCH v3 2/4] iommu/iova: Optimise the padding calculation Robin Murphy
2017-08-22 15:17     ` Robin Murphy
2017-08-22 15:17   ` [PATCH v3 4/4] iommu/iova: Make dma_32bit_pfn implicit Robin Murphy
2017-08-22 15:17     ` Robin Murphy
2017-08-25 18:52   ` [PATCH v3 0/4] Optimise 64-bit IOVA allocations Nate Watterson
2017-08-25 18:52     ` Nate Watterson
2017-08-30 12:14     ` Joerg Roedel
2017-08-22 15:17 ` [PATCH v3 3/4] iommu/iova: Extend rbtree node caching Robin Murphy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=cover.1503412074.git.robin.murphy@arm.com \
    --to=robin.murphy@arm.com \
    --cc=ard.biesheuvel@linaro.org \
    --cc=iommu@lists.linux-foundation.org \
    --cc=joro@8bytes.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=nwatters@codeaurora.org \
    --cc=ray.jui@broadcom.com \
    --cc=thunder.leizhen@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.