From: Sinan Kaya <okaya@codeaurora.org>
To: David Miller <davem@davemloft.net>
Cc: netdev@vger.kernel.org, timur@codeaurora.org,
sulrich@codeaurora.org, linux-arm-kernel@lists.infradead.org,
linux-arm-msm@vger.kernel.org
Subject: Re: [PATCH v4 00/17] netdev: Eliminate duplicate barriers on weakly-ordered archs
Date: Wed, 21 Mar 2018 14:06:02 -0500 [thread overview]
Message-ID: <652fd716-6ad2-428d-4bfc-e050f9db3e41@codeaurora.org> (raw)
In-Reply-To: <20180321.115655.108053425798020503.davem@davemloft.net>
On 3/21/2018 10:56 AM, David Miller wrote:
> From: Sinan Kaya <okaya@codeaurora.org>
> Date: Mon, 19 Mar 2018 22:42:15 -0400
>
>> Code includes wmb() followed by writel() in multiple places. writel()
>> already has a barrier on some architectures like arm64.
>>
>> This ends up CPU observing two barriers back to back before executing the
>> register write.
>>
>> Since code already has an explicit barrier call, changing writel() to
>> writel_relaxed().
>>
>> I did a regex search for wmb() followed by writel() in each drivers
>> directory.
>> I scrubbed the ones I care about in this series.
>>
>> I considered "ease of change", "popular usage" and "performance critical
>> path" as the determining criteria for my filtering.
>
> I agree that for performance sensitive operations, specifically writing
> doorbell registers in the hot paths or RX and TX packet processing, this
> is a good change.
>
> However, in configuration paths and whatnot, it is much less urgent and
> useful.
>
> Therefore I think it would work better if you concentrated solely on
> hot code path cases.
>
> You can, on a driver by driver basis, submit the other transformations
> in the slow paths, and let the driver maintainers decide whether to
> take those on or not.
>
> Also, please stick exactly to the case where we have:
>
> wmb/mb/etc.
> writel()
>
OK
> Because I see some changes where we have:
>
> writel()
>
> barrier()
>
> writel()
>
barrier() on ARM is a write barrier. Apparently, it is a compiler barrier
on Intel. I briefly discussed the barrier() behavior in rdma mailing list [1].
Our conclusion is that code should have used wmb() if it really needed
to synchronize memory contents to the device and barrier() is already
wrong. It just guarantees that code doesn't move. writel() already has
a compiler barrier inside. It won't move to begin with.
Like you suggested, we decided to leave these changes alone and even
skip those drivers.
I'll take another look at the patches.
> for exmaple, and you are turning that second writel() into a relaxed
> on as well. The above is using a compile barrier, not a memory
> barrier, so effectively it is two writel()'s in sequence which is
> not what this patch set is about.
>
> If anything, that compile barrier() is superfluous and could be
> removed. But that is also a separate change from what this patch
> series is doing here.
>
agreed, I'll remove such changes.
> Finally, it makes it that much easier if we can see the preceeding
> memory barrier in the context of the patch that adjusts the writel
> into a writel_relaxed.
>
> In one case, a macro DOORBELL() is changed to use writel(). This
> makes it so that the patch reviewer has to scan over the entire
> driver in question to see exactly how DOORBELL() is used and whether
> it fits the criteria for the writel_relaxed() transformation.
>
> I would suggest that you adjust the name of the macro in a situation
> like this, f.e. to DOORBELL_RELAXED().
makes sense.
>
> Thank you.
>
[1] https://patchwork.kernel.org/project/LKML/list/?submitter=145491
--
Sinan Kaya
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.
WARNING: multiple messages have this Message-ID (diff)
From: okaya@codeaurora.org (Sinan Kaya)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH v4 00/17] netdev: Eliminate duplicate barriers on weakly-ordered archs
Date: Wed, 21 Mar 2018 14:06:02 -0500 [thread overview]
Message-ID: <652fd716-6ad2-428d-4bfc-e050f9db3e41@codeaurora.org> (raw)
In-Reply-To: <20180321.115655.108053425798020503.davem@davemloft.net>
On 3/21/2018 10:56 AM, David Miller wrote:
> From: Sinan Kaya <okaya@codeaurora.org>
> Date: Mon, 19 Mar 2018 22:42:15 -0400
>
>> Code includes wmb() followed by writel() in multiple places. writel()
>> already has a barrier on some architectures like arm64.
>>
>> This ends up CPU observing two barriers back to back before executing the
>> register write.
>>
>> Since code already has an explicit barrier call, changing writel() to
>> writel_relaxed().
>>
>> I did a regex search for wmb() followed by writel() in each drivers
>> directory.
>> I scrubbed the ones I care about in this series.
>>
>> I considered "ease of change", "popular usage" and "performance critical
>> path" as the determining criteria for my filtering.
>
> I agree that for performance sensitive operations, specifically writing
> doorbell registers in the hot paths or RX and TX packet processing, this
> is a good change.
>
> However, in configuration paths and whatnot, it is much less urgent and
> useful.
>
> Therefore I think it would work better if you concentrated solely on
> hot code path cases.
>
> You can, on a driver by driver basis, submit the other transformations
> in the slow paths, and let the driver maintainers decide whether to
> take those on or not.
>
> Also, please stick exactly to the case where we have:
>
> wmb/mb/etc.
> writel()
>
OK
> Because I see some changes where we have:
>
> writel()
>
> barrier()
>
> writel()
>
barrier() on ARM is a write barrier. Apparently, it is a compiler barrier
on Intel. I briefly discussed the barrier() behavior in rdma mailing list [1].
Our conclusion is that code should have used wmb() if it really needed
to synchronize memory contents to the device and barrier() is already
wrong. It just guarantees that code doesn't move. writel() already has
a compiler barrier inside. It won't move to begin with.
Like you suggested, we decided to leave these changes alone and even
skip those drivers.
I'll take another look at the patches.
> for exmaple, and you are turning that second writel() into a relaxed
> on as well. The above is using a compile barrier, not a memory
> barrier, so effectively it is two writel()'s in sequence which is
> not what this patch set is about.
>
> If anything, that compile barrier() is superfluous and could be
> removed. But that is also a separate change from what this patch
> series is doing here.
>
agreed, I'll remove such changes.
> Finally, it makes it that much easier if we can see the preceeding
> memory barrier in the context of the patch that adjusts the writel
> into a writel_relaxed.
>
> In one case, a macro DOORBELL() is changed to use writel(). This
> makes it so that the patch reviewer has to scan over the entire
> driver in question to see exactly how DOORBELL() is used and whether
> it fits the criteria for the writel_relaxed() transformation.
>
> I would suggest that you adjust the name of the macro in a situation
> like this, f.e. to DOORBELL_RELAXED().
makes sense.
>
> Thank you.
>
[1] https://patchwork.kernel.org/project/LKML/list/?submitter=145491
--
Sinan Kaya
Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc.
Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.
next prev parent reply other threads:[~2018-03-21 19:06 UTC|newest]
Thread overview: 57+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-03-20 2:42 [PATCH v4 00/17] netdev: Eliminate duplicate barriers on weakly-ordered archs Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 01/17] i40e/i40evf: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 02/17] ixgbe: eliminate " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 03/17] igbvf: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 04/17] igb: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 05/17] ixgbevf: keep writel() closer to wmb() Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 06/17] ixgbevf: eliminate duplicate barriers on weakly-ordered archs Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [Intel-wired-lan] [PATCH v4 07/17] fm10k: Eliminate " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 08/17] drivers: net: cxgb: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 09/17] net: qla3xxx: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 10/17] qlcnic: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 11/17] bnx2x: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-22 10:10 ` Kalluru, Sudarsana
2018-03-22 10:10 ` Kalluru, Sudarsana
2018-03-20 2:42 ` [PATCH v4 12/17] net: cxgb4/cxgb4vf: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-21 23:03 ` Casey Leedom
2018-03-21 23:03 ` Casey Leedom
2018-03-22 0:00 ` okaya
2018-03-22 0:00 ` okaya at codeaurora.org
2018-03-20 2:42 ` [PATCH v4 13/17] net: cxgb3: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 14/17] net: qlge: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 15/17] bnxt_en: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 16/17] qed/qede: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-20 2:42 ` [PATCH v4 17/17] net: ena: " Sinan Kaya
2018-03-20 2:42 ` Sinan Kaya
2018-03-25 12:06 ` Belgazal, Netanel
2018-03-25 12:06 ` Belgazal, Netanel
2018-03-25 13:33 ` okaya
2018-03-25 13:33 ` okaya at codeaurora.org
2018-03-21 15:56 ` [PATCH v4 00/17] netdev: " David Miller
2018-03-21 15:56 ` David Miller
2018-03-21 19:06 ` Sinan Kaya [this message]
2018-03-21 19:06 ` Sinan Kaya
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=652fd716-6ad2-428d-4bfc-e050f9db3e41@codeaurora.org \
--to=okaya@codeaurora.org \
--cc=davem@davemloft.net \
--cc=linux-arm-kernel@lists.infradead.org \
--cc=linux-arm-msm@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=sulrich@codeaurora.org \
--cc=timur@codeaurora.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.