public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
From: John Garry <john.g.garry@oracle.com>
To: "Li, Eric (Honggang)" <Eric.H.Li@Dell.com>,
	Jason Yan <yanaijie@huawei.com>,
	"james.bottomley@hansenpartnership.com"
	<James.Bottomley@HansenPartnership.com>,
	"Martin K . Petersen" <martin.petersen@oracle.com>
Cc: "linux-scsi@vger.kernel.org" <linux-scsi@vger.kernel.org>
Subject: Re: Issue in sas_ex_discover_dev() for multiple level of SAS expanders in a domain
Date: Wed, 1 May 2024 15:23:57 +0100	[thread overview]
Message-ID: <c004da1f-b9fe-4641-9d0f-162eabde0101@oracle.com> (raw)
In-Reply-To: <SJ0PR19MB54152CB3D611259510902505C41A2@SJ0PR19MB5415.namprd19.prod.outlook.com>

On 30/04/2024 15:22, Li, Eric (Honggang) wrote:
> I suppose you have got the log file I attached.
> If not, please let me know.
> Any update about this?
> 
> Eric LI

So if you revert a1b6fb947f923, but then remove the call to 
sas_ex_join_wide_port() re-added in that revert, is it ok? I am just 
wondering are we just missing the call to set phy_state = 
PHY_DEVICE_DISCOVERED after v5.3?

Thanks,
John

> 
> 
> Internal Use - Confidential
>> -----Original Message-----
>> From: Li, Eric (Honggang)
>> Sent: Thursday, April 25, 2024 1:04 PM
>> To: Jason Yan <yanaijie@huawei.com>; John Garry <john.g.garry@oracle.com>;
>> james.bottomley@hansenpartnership.com; Martin K . Petersen
>> <martin.petersen@oracle.com>
>> Cc: linux-scsi@vger.kernel.org
>> Subject: RE: Issue in sas_ex_discover_dev() for multiple level of SAS expanders in a
>> domain
>>
>>> -----Original Message-----
>>> From: Jason Yan <yanaijie@huawei.com>
>>> Sent: Thursday, April 25, 2024 10:58 AM
>>> To: John Garry <john.g.garry@oracle.com>; Li, Eric (Honggang)
>>> <Eric.H.Li@Dell.com>; james.bottomley@hansenpartnership.com; Martin K .
>>> Petersen <martin.petersen@oracle.com>
>>> Cc: linux-scsi@vger.kernel.org
>>> Subject: Re: Issue in sas_ex_discover_dev() for multiple level of SAS
>>> expanders in a domain
>>>
>>>
>>> [EXTERNAL EMAIL]
>>>
>>> On 2024/4/24 18:46, John Garry wrote:
>>>> On 24/04/2024 09:59, Li, Eric (Honggang) wrote:
>>>>> Hi,
>>>>>
>>>>> There is an issue in the function sas_ex_discover_dev() when I have
>>>>> multiple SAS expanders chained under one SAS port on SAS controller.
>>>>
>>>> I think typically we can't and so don't test such a setup.
>>>
>>> Eric,
>>>
>>> I also don't understand why you need such a setup. Can you explain more
>>> details of your topology?
>>
>> I believe this is common setup if you want to support large number of drives under
>> one SAS port of SAS controller.
>>
>>>
>>>>
>>>>>
>>>>> In this function, we first check whether the PHY’s
>>>>> attached_sas_address is already present in the SAS domain, and then
>>>>> check if this PHY belongs to an existing port on this SAS expander.
>>>>> I think this has an issue if this SAS expander use a wide port
>>>>> connecting a downstream SAS expander.
>>>>> This is because if the PHY belongs to an existing port on this SAS
>>>>> expander, the attached SAS address of this port must already be
>>>>> present in the domain and it results in disabling that port.
>>>>> I don’t think that is what we expect.
>>>>>
>>>>> In old release (4.x), at the end of this function, it would make
>>>>> addition sas_ex_join_wide_port() call for any possibly PHYs that
>>>>> could be added into the SAS port.
>>>>> This will make subsequent PHYs (other than the first PHY of that
>>>>> port) being marked to DISCOVERED so that this function would not be
>>>>> invoked on those subsequent PHYs (in that port).
>>>>> But potential question here is we didn’t configure the per-PHY
>>>>> routing table for those PHYs.
>>>>> As I don’t have such SAS expander on hand, I am not sure what’s
>>>>> impact (maybe just performance/bandwidth impact).
>>>>> But at least, it didn’t impact the functionality of that port.
>>>>>
>>>>> But in v5.3 or later release, that part of code was removed (in the
>>>>> commit a1b6fb947f923).
>>>>
>>>> Jason, can you please check this?
>>>
>>> The removed code is only for races before we serialize the event
>>> processing. All PHYs will still be scanned one by one and add to the
>>> wide port if they have the same address. So are you encountering a real issue? If
>> so, can you share the full log?
>>
>> Yes. We did hit this issue when we upgrade Linux kernel from 4.19.236 to 5.14.21.
>> Full logs attached.
>>
>>>
>>> Thanks,
>>> Jason
>>>
>>> 祝一切顺利!
>>>
>>>>
>>>> Thanks!
>>>>
>>>>> And this caused this problem occurred (downstream port of that SAS
>>>>> expander was disabled and all downstream SAS devices were removed
>>>>> from the domain).
>>>>>
>>>>> Regards.
>>>>> Eric Li
>>>>>
>>>>> SPE, DellEMC
>>>>> 3/F KIC 1, 252# Songhu Road, YangPu District, SHANGHAI
>>>>> +86-21-6036-4384
>>>>>
>>>>>
>>>>> Internal Use - Confidential
>>>>
>>>> .


  reply	other threads:[~2024-05-01 14:24 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-04-24  8:59 Issue in sas_ex_discover_dev() for multiple level of SAS expanders in a domain Li, Eric (Honggang)
2024-04-24 10:46 ` John Garry
2024-04-25  2:57   ` Jason Yan
2024-04-25  5:03     ` Li, Eric (Honggang)
2024-04-30 14:22       ` Li, Eric (Honggang)
2024-05-01 14:23         ` John Garry [this message]
2024-05-03  3:15           ` Li, Eric (Honggang)
2024-05-03  8:33             ` John Garry
2024-05-06  1:49               ` Li, Eric (Honggang)
2024-05-07  8:03                 ` John Garry
2024-05-07  8:44                 ` Li, Eric (Honggang)
2024-05-07  9:17                   ` John Garry
2024-05-07 11:17                     ` Li, Eric (Honggang)
2024-05-07 15:14                       ` John Garry
2024-05-08  0:59                         ` Li, Eric (Honggang)
2024-05-08  7:48                           ` John Garry
2024-05-08  8:29                             ` Li, Eric (Honggang)
2024-05-09  3:52                               ` Jason Yan
2024-05-11  3:41                               ` Jason Yan
2024-05-14  9:23                                 ` Li, Eric (Honggang)
2025-06-10 13:05                                   ` Li, Eric (Honggang)
2025-06-10 13:33                                     ` Jason Yan

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=c004da1f-b9fe-4641-9d0f-162eabde0101@oracle.com \
    --to=john.g.garry@oracle.com \
    --cc=Eric.H.Li@Dell.com \
    --cc=James.Bottomley@HansenPartnership.com \
    --cc=linux-scsi@vger.kernel.org \
    --cc=martin.petersen@oracle.com \
    --cc=yanaijie@huawei.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox