* [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways
@ 2023-12-22 6:12 Dan Williams
2023-12-22 15:57 ` Dave Jiang
2023-12-22 20:12 ` Alison Schofield
0 siblings, 2 replies; 5+ messages in thread
From: Dan Williams @ 2023-12-22 6:12 UTC (permalink / raw)
To: linux-cxl; +Cc: stable, Huang, Ying, alison.schofield
From: Huang Ying <ying.huang@intel.com>
The decoder_populate_targets() helper walks all of the targets in a port
and makes sure they can be looked up in @target_map. Where @target_map
is a lookup table from target position to target id (corresponding to a
cxl_dport instance). However @target_map is only responsible for
conveying the active dport instances as conveyed by interleave_ways.
When nr_targets > interleave_ways it results in
decoder_populate_targets() walking off the end of the valid entries in
@target_map. Given target_map is initialized to 0 it results in the
dport lookup failing if position 0 is not mapped to a dport with an id
of 0:
cxl_port port3: Failed to populate active decoder targets
cxl_port port3: Failed to add decoder
cxl_port port3: Failed to add decoder3.0
cxl_bus_probe: cxl_port port3: probe: -6
This bug also highlights that when the decoder's ->targets[] array is
written in cxl_port_setup_targets() it is missing a hold of the
targets_lock to synchronize against sysfs readers of the target list. A
fix for that is saved for a later patch.
Fixes: a5c258021689 ("cxl/bus: Populate the target list at decoder create")
Cc: <stable@vger.kernel.org>
Signed-off-by: "Huang, Ying" <ying.huang@intel.com>
[djbw: rewrite the changelog, find the Fixes: tag]
Co-developed-by: Dan Williams <dan.j.williams@intel.com>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
---
drivers/cxl/core/port.c | 2 +-
1 file changed, 1 insertion(+), 1 deletion(-)
diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
index b7c93bb18f6e..57495cdc181f 100644
--- a/drivers/cxl/core/port.c
+++ b/drivers/cxl/core/port.c
@@ -1644,7 +1644,7 @@ static int decoder_populate_targets(struct cxl_switch_decoder *cxlsd,
return -EINVAL;
write_seqlock(&cxlsd->target_lock);
- for (i = 0; i < cxlsd->nr_targets; i++) {
+ for (i = 0; i < cxlsd->cxld.interleave_ways; i++) {
struct cxl_dport *dport = find_dport(port, target_map[i]);
if (!dport) {
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways
2023-12-22 6:12 [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways Dan Williams
@ 2023-12-22 15:57 ` Dave Jiang
2023-12-22 20:12 ` Alison Schofield
1 sibling, 0 replies; 5+ messages in thread
From: Dave Jiang @ 2023-12-22 15:57 UTC (permalink / raw)
To: Dan Williams, linux-cxl; +Cc: stable, Huang, Ying, alison.schofield
On 12/21/23 23:12, Dan Williams wrote:
> From: Huang Ying <ying.huang@intel.com>
>
> The decoder_populate_targets() helper walks all of the targets in a port
> and makes sure they can be looked up in @target_map. Where @target_map
> is a lookup table from target position to target id (corresponding to a
> cxl_dport instance). However @target_map is only responsible for
> conveying the active dport instances as conveyed by interleave_ways.
>
> When nr_targets > interleave_ways it results in
> decoder_populate_targets() walking off the end of the valid entries in
> @target_map. Given target_map is initialized to 0 it results in the
> dport lookup failing if position 0 is not mapped to a dport with an id
> of 0:
>
> cxl_port port3: Failed to populate active decoder targets
> cxl_port port3: Failed to add decoder
> cxl_port port3: Failed to add decoder3.0
> cxl_bus_probe: cxl_port port3: probe: -6
>
> This bug also highlights that when the decoder's ->targets[] array is
> written in cxl_port_setup_targets() it is missing a hold of the
> targets_lock to synchronize against sysfs readers of the target list. A
> fix for that is saved for a later patch.
>
> Fixes: a5c258021689 ("cxl/bus: Populate the target list at decoder create")
> Cc: <stable@vger.kernel.org>
> Signed-off-by: "Huang, Ying" <ying.huang@intel.com>
> [djbw: rewrite the changelog, find the Fixes: tag]
> Co-developed-by: Dan Williams <dan.j.williams@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Reviewed-by: Dave Jiang <dave.jiang@intel.com>
> ---
> drivers/cxl/core/port.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
> index b7c93bb18f6e..57495cdc181f 100644
> --- a/drivers/cxl/core/port.c
> +++ b/drivers/cxl/core/port.c
> @@ -1644,7 +1644,7 @@ static int decoder_populate_targets(struct cxl_switch_decoder *cxlsd,
> return -EINVAL;
>
> write_seqlock(&cxlsd->target_lock);
> - for (i = 0; i < cxlsd->nr_targets; i++) {
> + for (i = 0; i < cxlsd->cxld.interleave_ways; i++) {
> struct cxl_dport *dport = find_dport(port, target_map[i]);
>
> if (!dport) {
>
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways
2023-12-22 6:12 [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways Dan Williams
2023-12-22 15:57 ` Dave Jiang
@ 2023-12-22 20:12 ` Alison Schofield
2023-12-22 21:10 ` Dan Williams
1 sibling, 1 reply; 5+ messages in thread
From: Alison Schofield @ 2023-12-22 20:12 UTC (permalink / raw)
To: Dan Williams; +Cc: linux-cxl, stable, Huang, Ying
On Thu, Dec 21, 2023 at 10:12:12PM -0800, Dan Williams wrote:
> From: Huang Ying <ying.huang@intel.com>
>
> The decoder_populate_targets() helper walks all of the targets in a port
> and makes sure they can be looked up in @target_map. Where @target_map
> is a lookup table from target position to target id (corresponding to a
> cxl_dport instance). However @target_map is only responsible for
> conveying the active dport instances as conveyed by interleave_ways.
>
> When nr_targets > interleave_ways it results in
> decoder_populate_targets() walking off the end of the valid entries in
> @target_map. Given target_map is initialized to 0 it results in the
> dport lookup failing if position 0 is not mapped to a dport with an id
> of 0:
>
> cxl_port port3: Failed to populate active decoder targets
> cxl_port port3: Failed to add decoder
> cxl_port port3: Failed to add decoder3.0
> cxl_bus_probe: cxl_port port3: probe: -6
>
> This bug also highlights that when the decoder's ->targets[] array is
> written in cxl_port_setup_targets() it is missing a hold of the
> targets_lock to synchronize against sysfs readers of the target list. A
> fix for that is saved for a later patch.
>
> Fixes: a5c258021689 ("cxl/bus: Populate the target list at decoder create")
> Cc: <stable@vger.kernel.org>
> Signed-off-by: "Huang, Ying" <ying.huang@intel.com>
> [djbw: rewrite the changelog, find the Fixes: tag]
> Co-developed-by: Dan Williams <dan.j.williams@intel.com>
> Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> ---
> drivers/cxl/core/port.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
> index b7c93bb18f6e..57495cdc181f 100644
> --- a/drivers/cxl/core/port.c
> +++ b/drivers/cxl/core/port.c
> @@ -1644,7 +1644,7 @@ static int decoder_populate_targets(struct cxl_switch_decoder *cxlsd,
> return -EINVAL;
>
> write_seqlock(&cxlsd->target_lock);
> - for (i = 0; i < cxlsd->nr_targets; i++) {
> + for (i = 0; i < cxlsd->cxld.interleave_ways; i++) {
> struct cxl_dport *dport = find_dport(port, target_map[i]);
>
Does this loop need to protect against interleave_ways > nr_targets?
ie protect from walking off the target_map[nr_targets].
There is a check for that in cxl_port_setup_targets()
>> if (iw > 8 || iw > cxlsd->nr_targets) {
>> dev_dbg(&cxlr->dev,
>> "%s:%s:%s: ways: %d overflows targets: %d\n",
Wondering if a check at the time of walking is clearer, esp since
nr_targets is explicitly defined w the target list in cxlsd.
>> struct cxl_switch_decoder {
>> struct cxl_decoder cxld;
>> int nr_targets;
>> struct cxl_dport *target[];
>> };
> if (!dport) {
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways
2023-12-22 20:12 ` Alison Schofield
@ 2023-12-22 21:10 ` Dan Williams
2023-12-22 22:01 ` Alison Schofield
0 siblings, 1 reply; 5+ messages in thread
From: Dan Williams @ 2023-12-22 21:10 UTC (permalink / raw)
To: Alison Schofield, Dan Williams; +Cc: linux-cxl, stable, Huang, Ying
Alison Schofield wrote:
> On Thu, Dec 21, 2023 at 10:12:12PM -0800, Dan Williams wrote:
> > From: Huang Ying <ying.huang@intel.com>
> >
> > The decoder_populate_targets() helper walks all of the targets in a port
> > and makes sure they can be looked up in @target_map. Where @target_map
> > is a lookup table from target position to target id (corresponding to a
> > cxl_dport instance). However @target_map is only responsible for
> > conveying the active dport instances as conveyed by interleave_ways.
> >
> > When nr_targets > interleave_ways it results in
> > decoder_populate_targets() walking off the end of the valid entries in
> > @target_map. Given target_map is initialized to 0 it results in the
> > dport lookup failing if position 0 is not mapped to a dport with an id
> > of 0:
> >
> > cxl_port port3: Failed to populate active decoder targets
> > cxl_port port3: Failed to add decoder
> > cxl_port port3: Failed to add decoder3.0
> > cxl_bus_probe: cxl_port port3: probe: -6
> >
> > This bug also highlights that when the decoder's ->targets[] array is
> > written in cxl_port_setup_targets() it is missing a hold of the
> > targets_lock to synchronize against sysfs readers of the target list. A
> > fix for that is saved for a later patch.
> >
> > Fixes: a5c258021689 ("cxl/bus: Populate the target list at decoder create")
> > Cc: <stable@vger.kernel.org>
> > Signed-off-by: "Huang, Ying" <ying.huang@intel.com>
> > [djbw: rewrite the changelog, find the Fixes: tag]
> > Co-developed-by: Dan Williams <dan.j.williams@intel.com>
> > Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> > ---
> > drivers/cxl/core/port.c | 2 +-
> > 1 file changed, 1 insertion(+), 1 deletion(-)
> >
> > diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
> > index b7c93bb18f6e..57495cdc181f 100644
> > --- a/drivers/cxl/core/port.c
> > +++ b/drivers/cxl/core/port.c
> > @@ -1644,7 +1644,7 @@ static int decoder_populate_targets(struct cxl_switch_decoder *cxlsd,
> > return -EINVAL;
> >
> > write_seqlock(&cxlsd->target_lock);
> > - for (i = 0; i < cxlsd->nr_targets; i++) {
> > + for (i = 0; i < cxlsd->cxld.interleave_ways; i++) {
> > struct cxl_dport *dport = find_dport(port, target_map[i]);
> >
>
> Does this loop need to protect against interleave_ways > nr_targets?
> ie protect from walking off the target_map[nr_targets].
It's a good review question, but I think target_map[] is safe from those
shenanigans. For the CFMWS case interleave_ways == nr_targets, see the
@nr_tagets argument to cxl_root_decoder_alloc(). For the mid-level
switch decoder case it is protected by the fact that the decoder's
interleave_ways setting is sanity checked by the eiw_to_ways() call in
init_hdm_decoder(). So there's never any danger of walking off the end
of the target_map[] because that is allocated to support the
spec-defined hardware-max of CXL_DECODER_MAX_INTERLEAVE.
> There is a check for that in cxl_port_setup_targets()
> >> if (iw > 8 || iw > cxlsd->nr_targets) {
> >> dev_dbg(&cxlr->dev,
> >> "%s:%s:%s: ways: %d overflows targets: %d\n",
That check is for programming mid-level decoders where we find out at
run time that the interleave_ways of the region can not be satisfied by
one of the decoders in the chain, so that one is not about walking past
the end of a target list, that one is about detecting impossible region
configurations.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways
2023-12-22 21:10 ` Dan Williams
@ 2023-12-22 22:01 ` Alison Schofield
0 siblings, 0 replies; 5+ messages in thread
From: Alison Schofield @ 2023-12-22 22:01 UTC (permalink / raw)
To: Dan Williams; +Cc: linux-cxl, stable, Huang, Ying
On Fri, Dec 22, 2023 at 01:10:52PM -0800, Dan Williams wrote:
> Alison Schofield wrote:
> > On Thu, Dec 21, 2023 at 10:12:12PM -0800, Dan Williams wrote:
> > > From: Huang Ying <ying.huang@intel.com>
> > >
> > > The decoder_populate_targets() helper walks all of the targets in a port
> > > and makes sure they can be looked up in @target_map. Where @target_map
> > > is a lookup table from target position to target id (corresponding to a
> > > cxl_dport instance). However @target_map is only responsible for
> > > conveying the active dport instances as conveyed by interleave_ways.
> > >
> > > When nr_targets > interleave_ways it results in
> > > decoder_populate_targets() walking off the end of the valid entries in
> > > @target_map. Given target_map is initialized to 0 it results in the
> > > dport lookup failing if position 0 is not mapped to a dport with an id
> > > of 0:
> > >
> > > cxl_port port3: Failed to populate active decoder targets
> > > cxl_port port3: Failed to add decoder
> > > cxl_port port3: Failed to add decoder3.0
> > > cxl_bus_probe: cxl_port port3: probe: -6
> > >
> > > This bug also highlights that when the decoder's ->targets[] array is
> > > written in cxl_port_setup_targets() it is missing a hold of the
> > > targets_lock to synchronize against sysfs readers of the target list. A
> > > fix for that is saved for a later patch.
> > >
> > > Fixes: a5c258021689 ("cxl/bus: Populate the target list at decoder create")
> > > Cc: <stable@vger.kernel.org>
> > > Signed-off-by: "Huang, Ying" <ying.huang@intel.com>
> > > [djbw: rewrite the changelog, find the Fixes: tag]
> > > Co-developed-by: Dan Williams <dan.j.williams@intel.com>
> > > Signed-off-by: Dan Williams <dan.j.williams@intel.com>
> > > ---
Thanks for answering my questions -
Reviewed-by: Alison Schofield <alison.schofield@intel.com>
> > > drivers/cxl/core/port.c | 2 +-
> > > 1 file changed, 1 insertion(+), 1 deletion(-)
> > >
> > > diff --git a/drivers/cxl/core/port.c b/drivers/cxl/core/port.c
> > > index b7c93bb18f6e..57495cdc181f 100644
> > > --- a/drivers/cxl/core/port.c
> > > +++ b/drivers/cxl/core/port.c
> > > @@ -1644,7 +1644,7 @@ static int decoder_populate_targets(struct cxl_switch_decoder *cxlsd,
> > > return -EINVAL;
> > >
> > > write_seqlock(&cxlsd->target_lock);
> > > - for (i = 0; i < cxlsd->nr_targets; i++) {
> > > + for (i = 0; i < cxlsd->cxld.interleave_ways; i++) {
> > > struct cxl_dport *dport = find_dport(port, target_map[i]);
> > >
> >
> > Does this loop need to protect against interleave_ways > nr_targets?
> > ie protect from walking off the target_map[nr_targets].
>
> It's a good review question, but I think target_map[] is safe from those
> shenanigans. For the CFMWS case interleave_ways == nr_targets, see the
> @nr_tagets argument to cxl_root_decoder_alloc(). For the mid-level
> switch decoder case it is protected by the fact that the decoder's
> interleave_ways setting is sanity checked by the eiw_to_ways() call in
> init_hdm_decoder(). So there's never any danger of walking off the end
> of the target_map[] because that is allocated to support the
> spec-defined hardware-max of CXL_DECODER_MAX_INTERLEAVE.
>
> > There is a check for that in cxl_port_setup_targets()
> > >> if (iw > 8 || iw > cxlsd->nr_targets) {
> > >> dev_dbg(&cxlr->dev,
> > >> "%s:%s:%s: ways: %d overflows targets: %d\n",
>
> That check is for programming mid-level decoders where we find out at
> run time that the interleave_ways of the region can not be satisfied by
> one of the decoders in the chain, so that one is not about walking past
> the end of a target list, that one is about detecting impossible region
> configurations.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2023-12-22 22:02 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2023-12-22 6:12 [PATCH] cxl/port: Fix decoder initialization when nr_targets > interleave_ways Dan Williams
2023-12-22 15:57 ` Dave Jiang
2023-12-22 20:12 ` Alison Schofield
2023-12-22 21:10 ` Dan Williams
2023-12-22 22:01 ` Alison Schofield
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox