From: Hans de Goede <hdegoede@redhat.com>
To: Pierre-Louis Bossart <pierre-louis.bossart@linux.intel.com>,
ACPI Devel Mailing List <linux-acpi@vger.kernel.org>
Cc: Mika Westerberg <mika.westerberg@linux.intel.com>,
"Rafael J. Wysocki" <rafael.j.wysocki@intel.com>,
Andy Shevchenko <andriy.shevchenko@linux.intel.com>,
"Rafael J. Wysocki" <rafael@kernel.org>
Subject: Re: ACPI scan regression -> Boot fail on Cherrytrail w/ 5.11-rc3
Date: Fri, 15 Jan 2021 00:34:46 +0100 [thread overview]
Message-ID: <f5dae2aa-ffcb-1af1-726f-2362cb4fe470@redhat.com> (raw)
In-Reply-To: <792c89fd-88f2-b243-50df-21f3be1cc20c@linux.intel.com>
Hi,
On 1/14/21 10:55 PM, Pierre-Louis Bossart wrote:
> Hi,
> My primary test device for SOF on Cherrytrail no longer boots with v5.11-rc3 and the sof-dev branch, nothing happens after the 'loading initial ramdisk'. It's a 'Zotac' headless device derived from the Cherrytrail FFD design, so likely there are other devices hit by this problem.
>
> A long bisect points to the commit 71da201f38dfb ('ACPI: scan: Defer enumeration of devices with _DEP lists').
>
> Reverting the two commits below solves the boot issue.
>
> I have absolutely no idea what these two patches do, but they sure have a large impact. Please let me know what sort of information or tests might help root-cause this problem.
Heh, I was just about to answer your other (off-list) email about your
CHT test device booting with a suggestion that you should try reverting that
exact commit as it is the only commit that I'm aware of which went into 5.11
which might cause this...
So I just boot 5.11-rc3 on a Acer Aspire Switch 10E SW3-016 (x5-Z8300 CHT
based) myself and that booted fine.`
Next I tried a MINIX NEO Z83-4 (x5-Z8300) which is a Mini PC and as such
probably the closest to the Zotac box which you are using which I have
at hand to test on, and I can somewhat reproduce it there.
It seems that the new code somehow causes us to hit a race somewhere, so
the NEO Z83-4 will boot most of the times but not always, it get past
the loading initrd phase for me and then it threw the following error
and after that the boot hung (waiting for the rootfs to show up)
platform device 80860F14: Resources present before probing
As I already told Rafael in a previous email, I did see something
similar when my personal tree was still 5.10 based, with the ACPI
scan rework patches cherry-picked for testing. In that case I got
a backtrace (followed by a hang) during boot about a kernel NULL
pointer deref triggered by sysfs_seq_file_read or some such. But
this problem went away with 5.11-rc1, so I stopped looking into
it. I do have a tag of my broken 5.10 + cherry-picks tree, so
I should be able to reproduce that issue.
So I see 2 possible theories here:
1. We have 2 probes of the same device racing somehow
2. The struct device memory is getting corrupted somehow.
Pierre-Louis, can you see if the following hack helps? :
--- a/drivers/acpi/scan.c
+++ b/drivers/acpi/scan.c
@@ -1939,7 +1939,6 @@ static acpi_status acpi_bus_check_add(acpi_handle handle, bool check_dep,
/* Bail out if the number of recorded dependencies is not 0. */
if (count > 0) {
acpi_bus_scan_second_pass = true;
- return AE_CTRL_DEPTH;
}
}
@@ -1948,8 +1947,7 @@ static acpi_status acpi_bus_check_add(acpi_handle handle, bool check_dep,
return AE_CTRL_DEPTH;
acpi_scan_init_hotplug(device);
- if (!check_dep)
- acpi_scan_dep_init(device);
+ acpi_scan_dep_init(device);
out:
if (!*adev_p)
And can you collect an acpidump from the device and either send it to me and Rafael
offlist, or upload it somewhere and send us a link ?
Regards,
Hans
next prev parent reply other threads:[~2021-01-14 23:36 UTC|newest]
Thread overview: 20+ messages / expand[flat|nested] mbox.gz Atom feed top
2021-01-14 21:55 ACPI scan regression -> Boot fail on Cherrytrail w/ 5.11-rc3 Pierre-Louis Bossart
2021-01-14 23:34 ` Hans de Goede [this message]
2021-01-15 0:49 ` Pierre-Louis Bossart
2021-01-15 8:54 ` Hans de Goede
2021-01-15 13:38 ` Rafael J. Wysocki
2021-01-15 13:54 ` Rafael J. Wysocki
2021-01-15 14:52 ` Pierre-Louis Bossart
2021-01-15 14:55 ` Rafael J. Wysocki
2021-01-15 15:09 ` Pierre-Louis Bossart
2021-01-15 15:22 ` Rafael J. Wysocki
2021-01-15 15:38 ` Pierre-Louis Bossart
2021-01-15 16:05 ` Hans de Goede
2021-01-15 16:22 ` Rafael J. Wysocki
2021-01-15 16:41 ` Pierre-Louis Bossart
2021-01-15 19:01 ` Rafael J. Wysocki
2021-01-15 19:06 ` Rafael J. Wysocki
2021-01-15 20:48 ` Hans de Goede
2021-01-15 21:57 ` Hans de Goede
[not found] ` <56b732f4-9a24-688e-7cc7-6c2522d173c9@linux.intel.com>
2021-01-16 11:18 ` Hans de Goede
2021-01-16 12:26 ` Hans de Goede
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=f5dae2aa-ffcb-1af1-726f-2362cb4fe470@redhat.com \
--to=hdegoede@redhat.com \
--cc=andriy.shevchenko@linux.intel.com \
--cc=linux-acpi@vger.kernel.org \
--cc=mika.westerberg@linux.intel.com \
--cc=pierre-louis.bossart@linux.intel.com \
--cc=rafael.j.wysocki@intel.com \
--cc=rafael@kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox