From: Alexander Duyck <alexander.h.duyck@redhat.com>
To: Amir Vadai <amirv@mellanox.com>
Cc: achiad@mellanox.com, Or Gerlitz <ogerlitz@mellanox.com>,
"netdev@vger.kernel.org" <netdev@vger.kernel.org>
Subject: Re: dma_alloc_coherent() to use memory close to cpu
Date: Wed, 13 May 2015 08:49:51 -0700 [thread overview]
Message-ID: <5553729F.3020205@redhat.com> (raw)
In-Reply-To: <55534659.9000606@mellanox.com>
On 05/13/2015 05:40 AM, Amir Vadai wrote:
> Hi Alex,
>
> dma_alloc_coherent() is allocating memory close to the device -
> according to dev_to_node(dev). Sometimes it is better to use memory
> close to the CPU. e.g. when it is a buffer that NIC writes and CPU reads.
Yes, the easiest way to visualize this is do you want to have this
operator under a push or pull model. Either you can have the hardware
push the data to where the interrupt will be processed, or the interrupt
will have to pull the data to the CPU it is being processed on. As long
as there are enough PCIe credits to keep the PCIe link fully utilized
you are usually better off pushing the data to the CPU the interrupt is
on as the reads/writes are usually batched by the hardware.
> It seems that you thought that too, and added a commit to ixgbe driver
> that follows that logic [1].
> You added calls to set_dev_node() before and after the allocation.
> This seems to be prone to races in case multiple process want to alloc
> in parallel. The proper fix seems to be to extend the
> dma_alloc_coherent() to accept a NUMA node as an argument (if device's
> node is not good enough).
I'm not sure how racy it would be since you can really only have one
driver per device and the function that does this is protected by the
RTNL lock as I recall.
> I looked for, but couldn't find any discussion about that - is there a
> special reason not to extend dma_alloc_coherent()?
I think most of that is due to the fact that it is buried in multiple
levels of abstraction and at the time I wrote that code I had only been
working in the kernel drivers for a year or so. I had to revert similar
code from igb as it was buggy so I wasn't really in a place to be
modifying that at that time.
If you are planning to give it a try I would say go for it. The fact is
there are models where you want to have the device memory spread around
since the DMA writes usually are much less expensive to a remote node,
than accessing a remote node from the interrupt handler.
- Alex
next prev parent reply other threads:[~2015-05-13 16:45 UTC|newest]
Thread overview: 3+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-05-13 12:40 dma_alloc_coherent() to use memory close to cpu Amir Vadai
2015-05-13 15:49 ` Alexander Duyck [this message]
2015-05-14 7:15 ` Amir Vadai
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5553729F.3020205@redhat.com \
--to=alexander.h.duyck@redhat.com \
--cc=achiad@mellanox.com \
--cc=amirv@mellanox.com \
--cc=netdev@vger.kernel.org \
--cc=ogerlitz@mellanox.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).