From: Leonid Yegoshin <Leonid.Yegoshin@imgtec.com>
To: James Hogan <james.hogan@imgtec.com>, <linux-mips@linux-mips.org>,
<benh@kernel.crashing.org>, <will.deacon@arm.com>,
<linux-kernel@vger.kernel.org>, <ralf@linux-mips.org>,
<markos.chandras@imgtec.com>, <macro@linux-mips.org>,
<Steven.Hill@imgtec.com>, <alexander.h.duyck@redhat.com>,
<davem@davemloft.net>
Subject: Re: [PATCH 2/3] MIPS: enforce LL-SC loop enclosing with SYNC (ACQUIRE and RELEASE)
Date: Tue, 2 Jun 2015 11:43:37 -0700 [thread overview]
Message-ID: <556DF959.1010704@imgtec.com> (raw)
In-Reply-To: <556D95DA.5090305@imgtec.com>
[-- Attachment #1: Type: text/plain, Size: 2909 bytes --]
On 06/02/2015 04:39 AM, James Hogan wrote:
> Hi Leonid,
>
> On 02/06/15 01:09, Leonid Yegoshin wrote:
>
>> CPUs may occasionally have problems in accordance with HW team.
> "have problems in accordance with HW team" is a bit confusing. What do
> you mean?
I wrote about memory barriers and problems may happens without it.
Some details from internal e-mail exchange:
-------------------------------------------
yes, it is possible for the stores to be observed out of order
The SC B can complete if it has ownership and the SW A is still
waiting for ownership. This scenario was also possible on older
cores.
(snip)
-----Original Message-----
From: Leonid Yegoshin
Subject: more SYNC issues
...
I want to know - do XXXX orders stores without SYNC?
Let's consider a scenario:
SW $0, A (cache miss)
...
LL $1, B (cache hit)
..
SC $1, B (cache hit again)
B (is not taken!)
..
SYNC 0x10
Is it possible for other core to get new value of B before new value of A between SC-B and SYNC?
-------------------------------------------------------------
Another mail, another processor:
==========================================
Hi Leonid
I looked into the LSU RTL and I do not see speculation being blocked for
younger loads following LL.
I also ran a testcase to confirm my observation:
LL (cacheable)Miss
LW (cacheable)Miss, different cacheline
Miss request for LW goes out before the Miss request for LL, also the
GPR updated for LW happens before the LL GPR update.
==========================================
> Actually *true*? P5600 you mean? Same in Kconfig help text.
Yes, thank you, it is P5600 and I5600 doesn't exist.
> It feels wrong to be giving the user this option. Can't we just select
> WEAK_REORDERING_BEYOND_LLSC automatically based on the hardware that
> needs to be supported by the kernel configuration (e.g. CPU_MIPSR6 or
> CPU_MIPS32_R2)?
No, we can't - a lot of old processors are in-order and all of that is
still MIPS R2.
> Those who care about mips r2 performance on hardware
> which doesn't strictly need it can always speak up / add an exception.
>
> Out of interest, are futex operations safe with weak llsc ordering, on
> the premise that they're mainly used by userland so ordering with normal
> kernel accesses just doesn't matter in practice?
I think futex is used to communicate between user threads and problem is
theoretically still here.
> This patch does 3 logically separable things:
> 1) add smp_release/smp_acquire based on MIPS_LIGHTWEIGHT_SYNC and weaken
> smp_store_release()/smp_load_acquire() to use them.
> 2) weaken llsc barriers when MIPS_LIGHTWEIGHT_SYNC.
> 3) the MIPS_ENFORCE_WEAK_REORDERING_BEYOND_LLSC Kconfig stuff (or
> whatever method to select WEAK_REORDERING_BEYOND_LLSC more often).
>
> Any reason not to split them, and give a clear description of each?
>
>
I don't see a reason to split it.
- Leonid.
[-- Attachment #2: Type: text/html, Size: 5842 bytes --]
WARNING: multiple messages have this Message-ID (diff)
From: Leonid Yegoshin <Leonid.Yegoshin@imgtec.com>
To: James Hogan <james.hogan@imgtec.com>,
linux-mips@linux-mips.org, benh@kernel.crashing.org,
will.deacon@arm.com, linux-kernel@vger.kernel.org,
ralf@linux-mips.org, markos.chandras@imgtec.com,
macro@linux-mips.org, Steven.Hill@imgtec.com,
alexander.h.duyck@redhat.com, davem@davemloft.net
Subject: Re: [PATCH 2/3] MIPS: enforce LL-SC loop enclosing with SYNC (ACQUIRE and RELEASE)
Date: Tue, 2 Jun 2015 11:43:37 -0700 [thread overview]
Message-ID: <556DF959.1010704@imgtec.com> (raw)
Message-ID: <20150602184337.YyRzgrx5PWO9lg7z0KhqK6dcw1-ixKUfkhNGOm4pCLc@z> (raw)
In-Reply-To: <556D95DA.5090305@imgtec.com>
[-- Attachment #1: Type: text/plain, Size: 2909 bytes --]
On 06/02/2015 04:39 AM, James Hogan wrote:
> Hi Leonid,
>
> On 02/06/15 01:09, Leonid Yegoshin wrote:
>
>> CPUs may occasionally have problems in accordance with HW team.
> "have problems in accordance with HW team" is a bit confusing. What do
> you mean?
I wrote about memory barriers and problems may happens without it.
Some details from internal e-mail exchange:
-------------------------------------------
yes, it is possible for the stores to be observed out of order
The SC B can complete if it has ownership and the SW A is still
waiting for ownership. This scenario was also possible on older
cores.
(snip)
-----Original Message-----
From: Leonid Yegoshin
Subject: more SYNC issues
...
I want to know - do XXXX orders stores without SYNC?
Let's consider a scenario:
SW $0, A (cache miss)
...
LL $1, B (cache hit)
..
SC $1, B (cache hit again)
B (is not taken!)
..
SYNC 0x10
Is it possible for other core to get new value of B before new value of A between SC-B and SYNC?
-------------------------------------------------------------
Another mail, another processor:
==========================================
Hi Leonid
I looked into the LSU RTL and I do not see speculation being blocked for
younger loads following LL.
I also ran a testcase to confirm my observation:
LL (cacheable)Miss
LW (cacheable)Miss, different cacheline
Miss request for LW goes out before the Miss request for LL, also the
GPR updated for LW happens before the LL GPR update.
==========================================
> Actually *true*? P5600 you mean? Same in Kconfig help text.
Yes, thank you, it is P5600 and I5600 doesn't exist.
> It feels wrong to be giving the user this option. Can't we just select
> WEAK_REORDERING_BEYOND_LLSC automatically based on the hardware that
> needs to be supported by the kernel configuration (e.g. CPU_MIPSR6 or
> CPU_MIPS32_R2)?
No, we can't - a lot of old processors are in-order and all of that is
still MIPS R2.
> Those who care about mips r2 performance on hardware
> which doesn't strictly need it can always speak up / add an exception.
>
> Out of interest, are futex operations safe with weak llsc ordering, on
> the premise that they're mainly used by userland so ordering with normal
> kernel accesses just doesn't matter in practice?
I think futex is used to communicate between user threads and problem is
theoretically still here.
> This patch does 3 logically separable things:
> 1) add smp_release/smp_acquire based on MIPS_LIGHTWEIGHT_SYNC and weaken
> smp_store_release()/smp_load_acquire() to use them.
> 2) weaken llsc barriers when MIPS_LIGHTWEIGHT_SYNC.
> 3) the MIPS_ENFORCE_WEAK_REORDERING_BEYOND_LLSC Kconfig stuff (or
> whatever method to select WEAK_REORDERING_BEYOND_LLSC more often).
>
> Any reason not to split them, and give a clear description of each?
>
>
I don't see a reason to split it.
- Leonid.
[-- Attachment #2: Type: text/html, Size: 5842 bytes --]
next prev parent reply other threads:[~2015-06-02 18:52 UTC|newest]
Thread overview: 38+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-06-02 0:09 [PATCH 0/3] MIPS: SMP memory barriers: lightweight sync, acquire-release Leonid Yegoshin
2015-06-02 0:09 ` Leonid Yegoshin
2015-06-02 0:09 ` [PATCH 1/3] MIPS: R6: Use lightweight SYNC instruction in smp_* memory barriers Leonid Yegoshin
2015-06-02 0:09 ` Leonid Yegoshin
2015-06-02 10:08 ` Paul Burton
2015-06-02 10:08 ` Paul Burton
2015-06-02 12:12 ` Luc Van Oostenryck
2015-06-02 12:44 ` Ralf Baechle
2015-06-02 18:20 ` Leonid Yegoshin
2015-06-02 18:20 ` Leonid Yegoshin
2015-06-02 10:48 ` James Hogan
2015-06-02 10:48 ` James Hogan
2015-06-02 16:15 ` Maciej W. Rozycki
2015-06-02 23:56 ` David Daney
2015-06-03 1:56 ` Leonid Yegoshin
2015-06-03 1:56 ` Leonid Yegoshin
2015-06-05 13:10 ` Ralf Baechle
2015-06-05 21:18 ` Maciej W. Rozycki
2016-01-28 2:28 ` Joshua Kinard
2016-01-29 13:32 ` Maciej W. Rozycki
2016-01-29 13:32 ` Maciej W. Rozycki
2016-01-30 16:25 ` Joshua Kinard
2015-06-02 0:09 ` [PATCH 2/3] MIPS: enforce LL-SC loop enclosing with SYNC (ACQUIRE and RELEASE) Leonid Yegoshin
2015-06-02 0:09 ` Leonid Yegoshin
2015-06-02 11:39 ` James Hogan
2015-06-02 11:39 ` James Hogan
2015-06-02 18:43 ` Leonid Yegoshin [this message]
2015-06-02 18:43 ` Leonid Yegoshin
2015-06-02 18:53 ` Leonid Yegoshin
2015-06-02 0:09 ` [PATCH 3/3] MIPS: bugfix - replace smp_mb with release barrier function in unlocks Leonid Yegoshin
2015-06-02 0:09 ` Leonid Yegoshin
2015-06-02 11:42 ` James Hogan
2015-06-02 11:42 ` James Hogan
2015-06-02 13:22 ` Ralf Baechle
2015-06-02 8:41 ` [PATCH 0/3] MIPS: SMP memory barriers: lightweight sync, acquire-release Joshua Kinard
2015-06-02 9:59 ` Ralf Baechle
2015-06-02 18:59 ` Joshua Kinard
2015-06-02 19:19 ` Ralf Baechle
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=556DF959.1010704@imgtec.com \
--to=leonid.yegoshin@imgtec.com \
--cc=Steven.Hill@imgtec.com \
--cc=alexander.h.duyck@redhat.com \
--cc=benh@kernel.crashing.org \
--cc=davem@davemloft.net \
--cc=james.hogan@imgtec.com \
--cc=linux-kernel@vger.kernel.org \
--cc=linux-mips@linux-mips.org \
--cc=macro@linux-mips.org \
--cc=markos.chandras@imgtec.com \
--cc=ralf@linux-mips.org \
--cc=will.deacon@arm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.