* [PATCH 0/2] Regression in s2idle for family 19h model 78h @ 2023-04-27 5:33 Mario Limonciello 2023-04-27 5:33 ` [PATCH 1/2] amd_nb: Add PCI ID " Mario Limonciello 0 siblings, 1 reply; 16+ messages in thread From: Mario Limonciello @ 2023-04-27 5:33 UTC (permalink / raw) To: Hans de Goede, Sanket Goswami, Shyam Sundar S K, linux-hwmon, linux-pci Cc: Richard gong, Mario Limonciello, H. Peter Anvin, linux-kernel A regression occurred in 6.4 due to a missing ID in `amd_nb` and the `amd_pmc` driver now being dependent on amd_smn_read(). This series adds the missing ID which fixes s2idle on this system. The ID also enables k10temp to work. Mario Limonciello (2): amd_nb: Add PCI ID for family 19h model 78h k10temp: Add pci ID for family 19, model 78h arch/x86/kernel/amd_nb.c | 2 ++ drivers/hwmon/k10temp.c | 1 + include/linux/pci_ids.h | 1 + 3 files changed, 4 insertions(+) -- 2.34.1 ^ permalink raw reply [flat|nested] 16+ messages in thread
* [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 5:33 [PATCH 0/2] Regression in s2idle for family 19h model 78h Mario Limonciello @ 2023-04-27 5:33 ` Mario Limonciello 2023-04-27 16:46 ` Bjorn Helgaas ` (2 more replies) 0 siblings, 3 replies; 16+ messages in thread From: Mario Limonciello @ 2023-04-27 5:33 UTC (permalink / raw) To: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami Cc: Richard gong, Mario Limonciello, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci s2idle previously worked on this system, but it regressed in kernel 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for driver probe"). The reason for the regression is that before this commit the SMN communication was hardcoded, but after amd_smn_read() is used which relies upon the misc PCI ID used by DF function 3 being included in a table. The ID was missing for model 78h, so this meant that the amd_smn_read() wouldn't work. Add the missing ID into amd_nb, restoring s2idle on this system. Fixes: 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for driver probe") Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> --- arch/x86/kernel/amd_nb.c | 2 ++ include/linux/pci_ids.h | 1 + 2 files changed, 3 insertions(+) diff --git a/arch/x86/kernel/amd_nb.c b/arch/x86/kernel/amd_nb.c index 4266b64631a4..7e331e8f3692 100644 --- a/arch/x86/kernel/amd_nb.c +++ b/arch/x86/kernel/amd_nb.c @@ -36,6 +36,7 @@ #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F4 0x166e #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F4 0x14e4 #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F4 0x14f4 +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F4 0x12fc /* Protect the PCI config register pairs used for SMN. */ static DEFINE_MUTEX(smn_mutex); @@ -79,6 +80,7 @@ static const struct pci_device_id amd_nb_misc_ids[] = { { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M50H_DF_F3) }, { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M60H_DF_F3) }, { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M70H_DF_F3) }, + { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M78H_DF_F3) }, {} }; diff --git a/include/linux/pci_ids.h b/include/linux/pci_ids.h index 45c3d62e616d..95f33dadb2be 100644 --- a/include/linux/pci_ids.h +++ b/include/linux/pci_ids.h @@ -567,6 +567,7 @@ #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F3 0x166d #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F3 0x14e3 #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F3 0x14f3 +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F3 0x12fb #define PCI_DEVICE_ID_AMD_CNB17H_F3 0x1703 #define PCI_DEVICE_ID_AMD_LANCE 0x2000 #define PCI_DEVICE_ID_AMD_LANCE_HOME 0x2001 -- 2.34.1 ^ permalink raw reply related [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 5:33 ` [PATCH 1/2] amd_nb: Add PCI ID " Mario Limonciello @ 2023-04-27 16:46 ` Bjorn Helgaas 2023-04-27 16:50 ` Limonciello, Mario 2023-04-27 16:48 ` Bjorn Helgaas 2023-05-06 14:05 ` Guenter Roeck 2 siblings, 1 reply; 16+ messages in thread From: Bjorn Helgaas @ 2023-04-27 16:46 UTC (permalink / raw) To: Mario Limonciello Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: > s2idle previously worked on this system, but it regressed in kernel > 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN > index 0 for driver probe"). > > The reason for the regression is that before this commit the SMN > communication was hardcoded, but after amd_smn_read() is used which > relies upon the misc PCI ID used by DF function 3 being included in > a table. The ID was missing for model 78h, so this meant that the > amd_smn_read() wouldn't work. > > Add the missing ID into amd_nb, restoring s2idle on this system. Is there a long-term solution for this that will not require adding new IDs every time new hardware comes out? drivers/platform/x86/amd/pmc.c already matches ACPI IDs; maybe there's some way for the platform to provide the information you need via ACPI or something? Bjorn ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 16:46 ` Bjorn Helgaas @ 2023-04-27 16:50 ` Limonciello, Mario 0 siblings, 0 replies; 16+ messages in thread From: Limonciello, Mario @ 2023-04-27 16:50 UTC (permalink / raw) To: Bjorn Helgaas Cc: Bjorn Helgaas, Hans de Goede, S-k, Shyam-sundar, Goswami, Sanket, Gong, Richard, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86@kernel.org, H. Peter Anvin, linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org [Public] > On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: > > s2idle previously worked on this system, but it regressed in kernel > > 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN > > index 0 for driver probe"). > > > > The reason for the regression is that before this commit the SMN > > communication was hardcoded, but after amd_smn_read() is used which > > relies upon the misc PCI ID used by DF function 3 being included in > > a table. The ID was missing for model 78h, so this meant that the > > amd_smn_read() wouldn't work. > > > > Add the missing ID into amd_nb, restoring s2idle on this system. > > Is there a long-term solution for this that will not require adding > new IDs every time new hardware comes out? > > drivers/platform/x86/amd/pmc.c already matches ACPI IDs; maybe there's > some way for the platform to provide the information you need via > ACPI or something? > > Bjorn I had the same question when I came up with this and found out the ACPI tables don't encode the information needed right now. But, yes there are discussions ongoing to make this more scalable in a future generation. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 5:33 ` [PATCH 1/2] amd_nb: Add PCI ID " Mario Limonciello 2023-04-27 16:46 ` Bjorn Helgaas @ 2023-04-27 16:48 ` Bjorn Helgaas 2023-05-08 8:52 ` Borislav Petkov 2023-05-06 14:05 ` Guenter Roeck 2 siblings, 1 reply; 16+ messages in thread From: Bjorn Helgaas @ 2023-04-27 16:48 UTC (permalink / raw) To: Mario Limonciello Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: > s2idle previously worked on this system, but it regressed in kernel > 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN > index 0 for driver probe"). > > The reason for the regression is that before this commit the SMN > communication was hardcoded, but after amd_smn_read() is used which > relies upon the misc PCI ID used by DF function 3 being included in > a table. The ID was missing for model 78h, so this meant that the > amd_smn_read() wouldn't work. > > Add the missing ID into amd_nb, restoring s2idle on this system. > > Fixes: 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for driver probe") > Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> Grudgingly, because this really isn't a maintainable strategy: Acked-by: Bjorn Helgaas <bhelgaas@google.com> # pci_ids.h > --- > arch/x86/kernel/amd_nb.c | 2 ++ > include/linux/pci_ids.h | 1 + > 2 files changed, 3 insertions(+) > > diff --git a/arch/x86/kernel/amd_nb.c b/arch/x86/kernel/amd_nb.c > index 4266b64631a4..7e331e8f3692 100644 > --- a/arch/x86/kernel/amd_nb.c > +++ b/arch/x86/kernel/amd_nb.c > @@ -36,6 +36,7 @@ > #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F4 0x166e > #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F4 0x14e4 > #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F4 0x14f4 > +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F4 0x12fc > > /* Protect the PCI config register pairs used for SMN. */ > static DEFINE_MUTEX(smn_mutex); > @@ -79,6 +80,7 @@ static const struct pci_device_id amd_nb_misc_ids[] = { > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M50H_DF_F3) }, > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M60H_DF_F3) }, > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M70H_DF_F3) }, > + { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M78H_DF_F3) }, > {} > }; > > diff --git a/include/linux/pci_ids.h b/include/linux/pci_ids.h > index 45c3d62e616d..95f33dadb2be 100644 > --- a/include/linux/pci_ids.h > +++ b/include/linux/pci_ids.h > @@ -567,6 +567,7 @@ > #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F3 0x166d > #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F3 0x14e3 > #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F3 0x14f3 > +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F3 0x12fb > #define PCI_DEVICE_ID_AMD_CNB17H_F3 0x1703 > #define PCI_DEVICE_ID_AMD_LANCE 0x2000 > #define PCI_DEVICE_ID_AMD_LANCE_HOME 0x2001 > -- > 2.34.1 > ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 16:48 ` Bjorn Helgaas @ 2023-05-08 8:52 ` Borislav Petkov 0 siblings, 0 replies; 16+ messages in thread From: Borislav Petkov @ 2023-05-08 8:52 UTC (permalink / raw) To: Bjorn Helgaas, Mario Limonciello Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Thu, Apr 27, 2023 at 11:48:16AM -0500, Bjorn Helgaas wrote: > Grudgingly, because this really isn't a maintainable strategy: > > Acked-by: Bjorn Helgaas <bhelgaas@google.com> # pci_ids.h I was wondering whether that define should even go into pci_ids.h but there's a patch 2 which uses it and which is *not* in my mbox. Mario, please CC everybody on all patches in the future. I'll take the k10temp one too so that there's no cross-tree deps. Thx. -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-04-27 5:33 ` [PATCH 1/2] amd_nb: Add PCI ID " Mario Limonciello 2023-04-27 16:46 ` Bjorn Helgaas 2023-04-27 16:48 ` Bjorn Helgaas @ 2023-05-06 14:05 ` Guenter Roeck 2023-05-07 12:51 ` Mario Limonciello 2 siblings, 1 reply; 16+ messages in thread From: Guenter Roeck @ 2023-05-06 14:05 UTC (permalink / raw) To: Mario Limonciello Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: > s2idle previously worked on this system, but it regressed in kernel > 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN > index 0 for driver probe"). > > The reason for the regression is that before this commit the SMN > communication was hardcoded, but after amd_smn_read() is used which > relies upon the misc PCI ID used by DF function 3 being included in > a table. The ID was missing for model 78h, so this meant that the > amd_smn_read() wouldn't work. > > Add the missing ID into amd_nb, restoring s2idle on this system. > > Fixes: 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for driver probe") > Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> FWIW: Acked-by: Guenter Roeck <linux@roeck-us.net> Note that this patch is not upstream, meaning the second patch in the series can not be applied either. I am not sure if that is because of "regressed in kernel 6.4" - after all, that kernel does not exist yet. The offending patch _is_ in the upstream kernel, though. It might make sense to inform the regression bot if the problem is not fixed when v6.4-rc1 is made available. Guenter > --- > arch/x86/kernel/amd_nb.c | 2 ++ > include/linux/pci_ids.h | 1 + > 2 files changed, 3 insertions(+) > > diff --git a/arch/x86/kernel/amd_nb.c b/arch/x86/kernel/amd_nb.c > index 4266b64631a4..7e331e8f3692 100644 > --- a/arch/x86/kernel/amd_nb.c > +++ b/arch/x86/kernel/amd_nb.c > @@ -36,6 +36,7 @@ > #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F4 0x166e > #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F4 0x14e4 > #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F4 0x14f4 > +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F4 0x12fc > > /* Protect the PCI config register pairs used for SMN. */ > static DEFINE_MUTEX(smn_mutex); > @@ -79,6 +80,7 @@ static const struct pci_device_id amd_nb_misc_ids[] = { > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M50H_DF_F3) }, > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M60H_DF_F3) }, > { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M70H_DF_F3) }, > + { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M78H_DF_F3) }, > {} > }; > > diff --git a/include/linux/pci_ids.h b/include/linux/pci_ids.h > index 45c3d62e616d..95f33dadb2be 100644 > --- a/include/linux/pci_ids.h > +++ b/include/linux/pci_ids.h > @@ -567,6 +567,7 @@ > #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F3 0x166d > #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F3 0x14e3 > #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F3 0x14f3 > +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F3 0x12fb > #define PCI_DEVICE_ID_AMD_CNB17H_F3 0x1703 > #define PCI_DEVICE_ID_AMD_LANCE 0x2000 > #define PCI_DEVICE_ID_AMD_LANCE_HOME 0x2001 > -- > 2.34.1 > ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-06 14:05 ` Guenter Roeck @ 2023-05-07 12:51 ` Mario Limonciello 2023-05-08 11:13 ` Thorsten Leemhuis 0 siblings, 1 reply; 16+ messages in thread From: Mario Limonciello @ 2023-05-07 12:51 UTC (permalink / raw) To: Guenter Roeck, Linux regressions mailing list Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On 5/6/23 09:05, Guenter Roeck wrote: > On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: >> s2idle previously worked on this system, but it regressed in kernel >> 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN >> index 0 for driver probe"). >> >> The reason for the regression is that before this commit the SMN >> communication was hardcoded, but after amd_smn_read() is used which >> relies upon the misc PCI ID used by DF function 3 being included in >> a table. The ID was missing for model 78h, so this meant that the >> amd_smn_read() wouldn't work. >> >> Add the missing ID into amd_nb, restoring s2idle on this system. >> >> Fixes: 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for driver probe") >> Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> > FWIW: > > Acked-by: Guenter Roeck <linux@roeck-us.net> > > Note that this patch is not upstream, meaning the second patch > in the series can not be applied either. I am not sure if that is > because of "regressed in kernel 6.4" - after all, that kernel does not > exist yet. The offending patch _is_ in the upstream kernel, though. > It might make sense to inform the regression bot if the problem is > not fixed when v6.4-rc1 is made available. You're right; the commit message should: s,but it regressed in kernel 6.4 due,but it regressed in, Boris told me that he's waiting for 6.4-rc1 to pick this series up. #regzbot ^introduced 310e782a99c7 > > Guenter > >> --- >> arch/x86/kernel/amd_nb.c | 2 ++ >> include/linux/pci_ids.h | 1 + >> 2 files changed, 3 insertions(+) >> >> diff --git a/arch/x86/kernel/amd_nb.c b/arch/x86/kernel/amd_nb.c >> index 4266b64631a4..7e331e8f3692 100644 >> --- a/arch/x86/kernel/amd_nb.c >> +++ b/arch/x86/kernel/amd_nb.c >> @@ -36,6 +36,7 @@ >> #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F4 0x166e >> #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F4 0x14e4 >> #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F4 0x14f4 >> +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F4 0x12fc >> >> /* Protect the PCI config register pairs used for SMN. */ >> static DEFINE_MUTEX(smn_mutex); >> @@ -79,6 +80,7 @@ static const struct pci_device_id amd_nb_misc_ids[] = { >> { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M50H_DF_F3) }, >> { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M60H_DF_F3) }, >> { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M70H_DF_F3) }, >> + { PCI_DEVICE(PCI_VENDOR_ID_AMD, PCI_DEVICE_ID_AMD_19H_M78H_DF_F3) }, >> {} >> }; >> >> diff --git a/include/linux/pci_ids.h b/include/linux/pci_ids.h >> index 45c3d62e616d..95f33dadb2be 100644 >> --- a/include/linux/pci_ids.h >> +++ b/include/linux/pci_ids.h >> @@ -567,6 +567,7 @@ >> #define PCI_DEVICE_ID_AMD_19H_M50H_DF_F3 0x166d >> #define PCI_DEVICE_ID_AMD_19H_M60H_DF_F3 0x14e3 >> #define PCI_DEVICE_ID_AMD_19H_M70H_DF_F3 0x14f3 >> +#define PCI_DEVICE_ID_AMD_19H_M78H_DF_F3 0x12fb >> #define PCI_DEVICE_ID_AMD_CNB17H_F3 0x1703 >> #define PCI_DEVICE_ID_AMD_LANCE 0x2000 >> #define PCI_DEVICE_ID_AMD_LANCE_HOME 0x2001 >> -- >> 2.34.1 >> ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-07 12:51 ` Mario Limonciello @ 2023-05-08 11:13 ` Thorsten Leemhuis 2023-05-08 11:25 ` Borislav Petkov 0 siblings, 1 reply; 16+ messages in thread From: Thorsten Leemhuis @ 2023-05-08 11:13 UTC (permalink / raw) To: Mario Limonciello, Guenter Roeck, Linux regressions mailing list Cc: Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Borislav Petkov, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On 07.05.23 14:51, Mario Limonciello wrote: > On 5/6/23 09:05, Guenter Roeck wrote: >> On Thu, Apr 27, 2023 at 12:33:36AM -0500, Mario Limonciello wrote: >>> s2idle previously worked on this system, but it regressed in kernel >>> 6.4 due to commit 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN >>> index 0 for driver probe"). >>> >>> The reason for the regression is that before this commit the SMN >>> communication was hardcoded, but after amd_smn_read() is used which >>> relies upon the misc PCI ID used by DF function 3 being included in >>> a table. The ID was missing for model 78h, so this meant that the >>> amd_smn_read() wouldn't work. >>> >>> Add the missing ID into amd_nb, restoring s2idle on this system. >>> >>> Fixes: 310e782a99c7 ("platform/x86/amd: pmc: Utilize SMN index 0 for >>> driver probe") >>> Signed-off-by: Mario Limonciello <mario.limonciello@amd.com> >> FWIW: >> >> Acked-by: Guenter Roeck <linux@roeck-us.net> >> >> Note that this patch is not upstream, meaning the second patch >> in the series can not be applied either. I am not sure if that is >> because of "regressed in kernel 6.4" - after all, that kernel does not >> exist yet. The offending patch _is_ in the upstream kernel, though. >> It might make sense to inform the regression bot if the problem is >> not fixed when v6.4-rc1 is made available. > > You're right; the commit message should: > > s,but it regressed in kernel 6.4 due,but it regressed in, > > Boris told me that he's waiting for 6.4-rc1 to pick this series up. Which afaics means that users of -rc1 are now affected by this and might waste time bisecting a known issue that could easily have been fixed already. :-/ That doesn't feel right. Or am I missing something? /me wonders I he should start tracking regressions more closely during the merge window to catch and prevent situations like this... > #regzbot ^introduced 310e782a99c7 Thx for adding it. #regzbot fix: 7d8accfaa0ab65e42 Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat) -- Everything you wanna know about Linux kernel regression tracking: https://linux-regtracking.leemhuis.info/about/#tldr If I did something stupid, please tell me, as explained on that page. ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-08 11:13 ` Thorsten Leemhuis @ 2023-05-08 11:25 ` Borislav Petkov 2023-05-08 13:28 ` Guenter Roeck 0 siblings, 1 reply; 16+ messages in thread From: Borislav Petkov @ 2023-05-08 11:25 UTC (permalink / raw) To: Thorsten Leemhuis Cc: Mario Limonciello, Guenter Roeck, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Mon, May 08, 2023 at 01:13:14PM +0200, Thorsten Leemhuis wrote: > Which afaics means that users of -rc1 are now affected by this and might > waste time bisecting a known issue that could easily have been fixed > already. :-/ That doesn't feel right. Or am I missing something? -rc1 is pretty much the most broken tree there is. And it is not an officially released tree but a, well, the first release candidate. So fixes are trickling in, there's lag between what gets found, when the maintainers pick it up and when it ends up upstream and so on and so on. Some fixes need longer testing because there have been cases where a fix breaks something else. And yes, maintainers can always expedite a fix or Linus will pick it up directly if it breaks a lot of boxes in a nasty way. So, long story short, I don't think you should track -rcs. You are tracking the reports already and you're tracking where their fixes land so I guess that's good enough. > /me wonders I he should start tracking regressions more closely during > the merge window to catch and prevent situations like this... I don't see a "situation" here. rcs can be broken for some use cases and that is fine as long as that breakage doesn't get released. Thx. -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-08 11:25 ` Borislav Petkov @ 2023-05-08 13:28 ` Guenter Roeck 2023-05-08 13:44 ` Borislav Petkov 0 siblings, 1 reply; 16+ messages in thread From: Guenter Roeck @ 2023-05-08 13:28 UTC (permalink / raw) To: Borislav Petkov Cc: Thorsten Leemhuis, Mario Limonciello, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Mon, May 08, 2023 at 01:25:43PM +0200, Borislav Petkov wrote: > On Mon, May 08, 2023 at 01:13:14PM +0200, Thorsten Leemhuis wrote: > > Which afaics means that users of -rc1 are now affected by this and might > > waste time bisecting a known issue that could easily have been fixed > > already. :-/ That doesn't feel right. Or am I missing something? > > -rc1 is pretty much the most broken tree there is. And it is not an > officially released tree but a, well, the first release candidate. So > fixes are trickling in, there's lag between what gets found, when the > maintainers pick it up and when it ends up upstream and so on and so on. > > Some fixes need longer testing because there have been cases where a fix > breaks something else. > > And yes, maintainers can always expedite a fix or Linus will pick it up > directly if it breaks a lot of boxes in a nasty way. > > So, long story short, I don't think you should track -rcs. You are > tracking the reports already and you're tracking where their fixes land > so I guess that's good enough. > I absolutely disagree. Without Thorsten's tracking, Linus would have no idea what the status of the kernel is. > > /me wonders I he should start tracking regressions more closely during > > the merge window to catch and prevent situations like this... > > I don't see a "situation" here. rcs can be broken for some use cases and > that is fine as long as that breakage doesn't get released. > Again, I disagree. The whole point of testing release candidates is to get problems fixed. If issues are not fixed timely, they just pile up on top of each other and make it difficult to identify new issues (and, in many cases, to test the kernel in the first place). I find it quite annoying that problems are identfied, often even in -next, the patch intoducing them is applied to mainline anyway, and then it sometimes takes until -rc5 or even later to get the fix applied (even if the fix has been known for weeks or even months). It sometimes even takes Linus' intervention to get fixes applied to the upstream kernel. That really should not be necessary. Telling those who track regressions to stay away from release candidates is absolutely the wrong thing to do and would only serve to make release candidates quite pointless. v6.4-rc1 is a good example. Fixes for all build breakages were published before the commit window opened, yet at least one of them did not make it into -rc1. Together with this patch there are now at least two regressions if -rc1 whch could have been avoided and may impact testability on affected systems. Guenter ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-08 13:28 ` Guenter Roeck @ 2023-05-08 13:44 ` Borislav Petkov 2023-05-11 19:51 ` Limonciello, Mario 0 siblings, 1 reply; 16+ messages in thread From: Borislav Petkov @ 2023-05-08 13:44 UTC (permalink / raw) To: Guenter Roeck Cc: Thorsten Leemhuis, Mario Limonciello, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, Shyam Sundar S K, Sanket Goswami, Richard gong, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86, H. Peter Anvin, linux-kernel, linux-pci On Mon, May 08, 2023 at 06:28:03AM -0700, Guenter Roeck wrote: > Together with this patch there are now at least two regressions if > -rc1 whch could have been avoided and may impact testability on > affected systems. Are you saying that this patch which fixes s2idle on some random box should've gone to Linus *immediately*? And read my mail again: "Some fixes need longer testing because there have been cases where a fix breaks something else." So yes, I disagree with rushing fixes immediately. If they're obvious - whatever that means - then sure but not all of them are such. -- Regards/Gruss, Boris. https://people.kernel.org/tglx/notes-about-netiquette ^ permalink raw reply [flat|nested] 16+ messages in thread
* RE: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-08 13:44 ` Borislav Petkov @ 2023-05-11 19:51 ` Limonciello, Mario 2023-05-11 20:09 ` Thorsten Leemhuis ` (2 more replies) 0 siblings, 3 replies; 16+ messages in thread From: Limonciello, Mario @ 2023-05-11 19:51 UTC (permalink / raw) To: Sasha Levin Cc: Thorsten Leemhuis, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, S-k, Shyam-sundar, Goswami, Sanket, Gong, Richard, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86@kernel.org, H. Peter Anvin, linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org, Borislav Petkov, Guenter Roeck, stable@vger.kernel.org [AMD Official Use Only - General] +stable, Sasha > > Together with this patch there are now at least two regressions if > > -rc1 whch could have been avoided and may impact testability on > > affected systems. > > Are you saying that this patch which fixes s2idle on some random box > should've gone to Linus *immediately*? > > And read my mail again: > > "Some fixes need longer testing because there have been cases where > a fix breaks something else." > > So yes, I disagree with rushing fixes immediately. If they're obvious > - whatever that means - then sure but not all of them are such. > > -- Unfortunately, it looks like the broken commit got backported into 6.1.28, but the fix still isn't in Linus' tree. Sasha, Can you please pick up https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?h=x86/urgent&id=23a5b8bb022c1e071ca91b1a9c10f0ad6a0966e9 for 6.1.29 to fix the regression? Thanks, ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-11 19:51 ` Limonciello, Mario @ 2023-05-11 20:09 ` Thorsten Leemhuis 2023-05-11 21:50 ` Sasha Levin 2023-05-15 12:26 ` Greg KH 2 siblings, 0 replies; 16+ messages in thread From: Thorsten Leemhuis @ 2023-05-11 20:09 UTC (permalink / raw) To: Limonciello, Mario, Sasha Levin Cc: Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, S-k, Shyam-sundar, Goswami, Sanket, Gong, Richard, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86@kernel.org, H. Peter Anvin, linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org, Borislav Petkov, Guenter Roeck, stable@vger.kernel.org On 11.05.23 21:51, Limonciello, Mario wrote: > [AMD Official Use Only - General] > > +stable, Sasha > >>> Together with this patch there are now at least two regressions if >>> -rc1 whch could have been avoided and may impact testability on >>> affected systems. >> >> Are you saying that this patch which fixes s2idle on some random box >> should've gone to Linus *immediately*? >> >> And read my mail again: >> >> "Some fixes need longer testing because there have been cases where >> a fix breaks something else." >> >> So yes, I disagree with rushing fixes immediately. If they're obvious >> - whatever that means - then sure but not all of them are such. >> >> -- > > Unfortunately, it looks like the broken commit got backported into 6.1.28, > but the fix still isn't in Linus' tree. > > Sasha, > > Can you please pick up > https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?h=x86/urgent&id=23a5b8bb022c1e071ca91b1a9c10f0ad6a0966e9 > for 6.1.29 to fix the regression? FWIW, the stable team afaics usually does not fix anything in stable trees before it's fixed in mainline. IOW: that fix now quickly needs to go to Linus to get it quickly fixed in 6.1.y. Side note: I'll soon post a rewritten section of 'Prioritize work on fixing regressions' which is part of Documentation/process/handling-regressions.rst. It among others will cover this case: ``` * Whenever you want to swiftly resolve a regression that recently also made it into a proper mainline, stable, or longterm release, fix it quickly in mainline; when appropriate thus involve Linus to fast-track the fix. That's because the stable team normally does neither revert nor fix any changes that cause a regression in mainline, too. ``` Ciao, Thorsten ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-11 19:51 ` Limonciello, Mario 2023-05-11 20:09 ` Thorsten Leemhuis @ 2023-05-11 21:50 ` Sasha Levin 2023-05-15 12:26 ` Greg KH 2 siblings, 0 replies; 16+ messages in thread From: Sasha Levin @ 2023-05-11 21:50 UTC (permalink / raw) To: Limonciello, Mario Cc: Thorsten Leemhuis, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, S-k, Shyam-sundar, Goswami, Sanket, Gong, Richard, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86@kernel.org, H. Peter Anvin, linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org, Borislav Petkov, Guenter Roeck, stable@vger.kernel.org On Thu, May 11, 2023 at 07:51:42PM +0000, Limonciello, Mario wrote: >[AMD Official Use Only - General] > >+stable, Sasha > >> > Together with this patch there are now at least two regressions if >> > -rc1 whch could have been avoided and may impact testability on >> > affected systems. >> >> Are you saying that this patch which fixes s2idle on some random box >> should've gone to Linus *immediately*? >> >> And read my mail again: >> >> "Some fixes need longer testing because there have been cases where >> a fix breaks something else." >> >> So yes, I disagree with rushing fixes immediately. If they're obvious >> - whatever that means - then sure but not all of them are such. >> >> -- > >Unfortunately, it looks like the broken commit got backported into 6.1.28, >but the fix still isn't in Linus' tree. > >Sasha, > >Can you please pick up >https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?h=x86/urgent&id=23a5b8bb022c1e071ca91b1a9c10f0ad6a0966e9 >for 6.1.29 to fix the regression? Happily, once it lands upstream :) -- Thanks, Sasha ^ permalink raw reply [flat|nested] 16+ messages in thread
* Re: [PATCH 1/2] amd_nb: Add PCI ID for family 19h model 78h 2023-05-11 19:51 ` Limonciello, Mario 2023-05-11 20:09 ` Thorsten Leemhuis 2023-05-11 21:50 ` Sasha Levin @ 2023-05-15 12:26 ` Greg KH 2 siblings, 0 replies; 16+ messages in thread From: Greg KH @ 2023-05-15 12:26 UTC (permalink / raw) To: Limonciello, Mario Cc: Sasha Levin, Thorsten Leemhuis, Linux regressions mailing list, Bjorn Helgaas, Hans de Goede, S-k, Shyam-sundar, Goswami, Sanket, Gong, Richard, Thomas Gleixner, Ingo Molnar, Dave Hansen, x86@kernel.org, H. Peter Anvin, linux-kernel@vger.kernel.org, linux-pci@vger.kernel.org, Borislav Petkov, Guenter Roeck, stable@vger.kernel.org On Thu, May 11, 2023 at 07:51:42PM +0000, Limonciello, Mario wrote: > [AMD Official Use Only - General] > > +stable, Sasha > > > > Together with this patch there are now at least two regressions if > > > -rc1 whch could have been avoided and may impact testability on > > > affected systems. > > > > Are you saying that this patch which fixes s2idle on some random box > > should've gone to Linus *immediately*? > > > > And read my mail again: > > > > "Some fixes need longer testing because there have been cases where > > a fix breaks something else." > > > > So yes, I disagree with rushing fixes immediately. If they're obvious > > - whatever that means - then sure but not all of them are such. > > > > -- > > Unfortunately, it looks like the broken commit got backported into 6.1.28, > but the fix still isn't in Linus' tree. > > Sasha, > > Can you please pick up > https://git.kernel.org/pub/scm/linux/kernel/git/tip/tip.git/commit/?h=x86/urgent&id=23a5b8bb022c1e071ca91b1a9c10f0ad6a0966e9 > for 6.1.29 to fix the regression? Now that this is in Linus's tree, it's queued up, thanks. greg k-h ^ permalink raw reply [flat|nested] 16+ messages in thread
end of thread, other threads:[~2023-05-15 12:26 UTC | newest] Thread overview: 16+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2023-04-27 5:33 [PATCH 0/2] Regression in s2idle for family 19h model 78h Mario Limonciello 2023-04-27 5:33 ` [PATCH 1/2] amd_nb: Add PCI ID " Mario Limonciello 2023-04-27 16:46 ` Bjorn Helgaas 2023-04-27 16:50 ` Limonciello, Mario 2023-04-27 16:48 ` Bjorn Helgaas 2023-05-08 8:52 ` Borislav Petkov 2023-05-06 14:05 ` Guenter Roeck 2023-05-07 12:51 ` Mario Limonciello 2023-05-08 11:13 ` Thorsten Leemhuis 2023-05-08 11:25 ` Borislav Petkov 2023-05-08 13:28 ` Guenter Roeck 2023-05-08 13:44 ` Borislav Petkov 2023-05-11 19:51 ` Limonciello, Mario 2023-05-11 20:09 ` Thorsten Leemhuis 2023-05-11 21:50 ` Sasha Levin 2023-05-15 12:26 ` Greg KH
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox