From: Guixin Liu <kanie@linux.alibaba.com>
To: Bjorn Helgaas <helgaas@kernel.org>
Cc: "Bjorn Helgaas" <bhelgaas@google.com>,
"Andy Shevchenko" <andriy.shevchenko@intel.com>,
"Ilpo Järvinen" <ilpo.jarvinen@linux.intel.com>,
linux-pci@vger.kernel.org,
"Xunlei Pang" <xlpang@linux.alibaba.com>,
oliver.yang@linux.alibaba.com
Subject: Re: [PATCH v11 2/2] PCI: Check ROM header and data structure addr before accessing
Date: Wed, 6 May 2026 12:39:36 +0800 [thread overview]
Message-ID: <d843329a-85c3-4b1c-b35b-e506d247d0ab@linux.alibaba.com> (raw)
In-Reply-To: <20260430214615.GA441190@bhelgaas>
在 2026/5/1 05:46, Bjorn Helgaas 写道:
> On Fri, Jan 30, 2026 at 04:07:29PM +0800, Guixin Liu wrote:
>> We meet a crash when running stress-ng on x86_64 machine:
>>
>> BUG: unable to handle page fault for address: ffa0000007f40000
>> RIP: 0010:pci_get_rom_size+0x52/0x220
>> Call Trace:
>> <TASK>
>> pci_map_rom+0x80/0x130
>> pci_read_rom+0x4b/0xe0
>> kernfs_file_read_iter+0x96/0x180
>> vfs_read+0x1b1/0x300
>>
>> Our analysis reveals that the ROM space's start address is
>> 0xffa0000007f30000, and size is 0x10000. Because of broken ROM
>> space, before calling readl(pds), the pds's value is
>> 0xffa0000007f3ffff, which is already pointed to the ROM space
>> end, invoking readl() would read 4 bytes therefore cause an
>> out-of-bounds access and trigger a crash.
>> Fix this by adding image header and data structure checking.
>>
>> We also found another crash on arm64 machine:
>>
>> Unable to handle kernel paging request at virtual address
>> ffff8000dd1393ff
>> Mem abort info:
>> ESR = 0x0000000096000021
>> EC = 0x25: DABT (current EL), IL = 32 bits
>> SET = 0, FnV = 0
>> EA = 0, S1PTW = 0
>> FSC = 0x21: alignment fault
>>
>> The call trace is the same with x86_64, but the crash reason is
>> that the data structure addr is not aligned with 4, and arm64
>> machine report "alignment fault". Fix this by adding alignment
>> checking.
>>
>> Fixes: 47b975d234ea ("PCI: Avoid iterating through memory outside the resource window")
>> Suggested-by: Guanghui Feng <guanghuifeng@linux.alibaba.com>
>> Signed-off-by: Guixin Liu <kanie@linux.alibaba.com>
>> Reviewed-by: Andy Shevchenko <andriy.shevchenko@intel.com>
>> ---
>> drivers/pci/rom.c | 113 ++++++++++++++++++++++++++++++++++++++--------
>> 1 file changed, 95 insertions(+), 18 deletions(-)
>>
>> diff --git a/drivers/pci/rom.c b/drivers/pci/rom.c
>> index 4f7641b93b4b..d8abed669fac 100644
>> --- a/drivers/pci/rom.c
>> +++ b/drivers/pci/rom.c
>> @@ -6,9 +6,12 @@
>> * (C) Copyright 2004 Silicon Graphics, Inc. Jesse Barnes <jbarnes@sgi.com>
>> */
>>
>> +#include <linux/align.h>
>> #include <linux/bits.h>
>> #include <linux/kernel.h>
>> #include <linux/export.h>
>> +#include <linux/io.h>
>> +#include <linux/overflow.h>
>> #include <linux/pci.h>
>> #include <linux/sizes.h>
>> #include <linux/slab.h>
>> @@ -84,6 +87,91 @@ void pci_disable_rom(struct pci_dev *pdev)
>> }
>> EXPORT_SYMBOL_GPL(pci_disable_rom);
>>
>> +static bool pci_rom_is_header_valid(struct pci_dev *pdev,
>> + void __iomem *image,
>> + void __iomem *rom,
>> + size_t size,
>> + bool last_image)
>> +{
>> + unsigned long rom_end = (unsigned long)rom + size - 1;
>> + unsigned long header_end;
>> + u16 signature;
>> +
>> + /*
>> + * Some CPU architectures require IOMEM access addresses to
>> + * be aligned, for example arm64, so since we're about to
>> + * call readw(), we check here for 2-byte alignment.
>> + */
> I think PCI Firmware r3.3, sec 5.1, actually requires 512-byte
> alignment, but I guess we haven't enforced that before. Worth
> mentioning the spec requirement to show that this isn't just an
> arbitrary thing to accommodate a weird CPU architecture.
Yes, it says "Each image must start on a 512-byte boundary and must
contain the PCI Expansion ROM header".
OK, I'll add a 512-byte alignment check in the next version, along
with a comment citing the spec requirement.
>> + if (!IS_ALIGNED((unsigned long)image, 2))
>> + return false;
>> +
>> + if (check_add_overflow((unsigned long)image, PCI_ROM_HEADER_SIZE,
>> + &header_end))
>> + return false;
>> +
>> + if (image < rom || header_end > rom_end)
>> + return false;
> From Sashiko:
>
> Does this correctly handle a ROM structure that fits exactly at the
> end of the window?
>
> Since header_end is calculated exclusively and rom_end inclusively,
> a perfectly sized structure will have header_end equal to rom_end +
> 1, causing the header_end > rom_end check to incorrectly evaluate to
> true and reject the ROM. This same inclusive versus exclusive
> boundary mismatch also happens in pci_rom_is_data_struct_valid()
> when checking the end pointer.
That's right, chagend in v12, thanks.
>> +
>> + /* Standard PCI ROMs start out with these bytes 55 AA */
>> + signature = readw(image);
>> + if (signature == PCI_ROM_IMAGE_SIGNATURE)
>> + return true;
> I think this and pci_rom_is_data_struct_valid() would read better if
> every test was a check for failure instead of having a bunch of
> failure returns, followed by a success return, followed by another
> failure return. E.g.,
>
> if (signature != PCI_ROM_IMAGE_SIGNATURE) {
> if (last_image) {
> ...
> }
> return false;
> }
>
> return true;
I think so, changed in v12, thanks.
>> + if (last_image) {
>> + pci_info(pdev, "Invalid PCI ROM header signature: expecting %#06x, got %#06x\n",
>> + PCI_ROM_IMAGE_SIGNATURE, signature);
>> + } else {
>> + pci_info(pdev, "No more image in the PCI ROM\n");
>> + }
> I'm not completely convinced that it's worth passing in last_image. I
> suppose the reason was to make the messages exactly the same as
> before?
>
> Even in the "!last_image" case, I think it might be worth printing the
> signature we got. The "No more image" message means that the ROM
> format isn't strictly conforming, doesn't it? Maybe the same message
> would suffice for both "last_image" and "!last_image"?
Commit beced88e6af43 ("PCI: Add check code for last image indicator not
set")
added this printing to avoid print "Invalid image",
I thinks it's worth keeping as-is.
>> + return false;
>> +}
>> +
>> +static bool pci_rom_is_data_struct_valid(struct pci_dev *pdev,
>> + void __iomem *pds,
>> + void __iomem *rom,
>> + size_t size)
>> +{
>> + unsigned long rom_end = (unsigned long)rom + size - 1;
>> + unsigned long end;
>> + u32 signature;
>> + u16 data_len;
>> +
>> + /*
>> + * Some CPU architectures require IOMEM access addresses to
>> + * be aligned, for example arm64, so since we're about to
>> + * call readl(), we check here for 4-byte alignment.
>> + */
>> + if (!IS_ALIGNED((unsigned long)pds, 4))
>> + return false;
>> +
>> + /* Before reading length, check addr range. */
>> + if (check_add_overflow((unsigned long)pds, PCI_ROM_DATA_STRUCT_LEN + 1,
>> + &end))
>> + return false;
>> +
>> + if (pds < rom || end > rom_end)
>> + return false;
>> +
>> + data_len = readw(pds + PCI_ROM_DATA_STRUCT_LEN);
>> + if (!data_len || data_len == U16_MAX)
>> + return false;
>> +
>> + if (check_add_overflow((unsigned long)pds, data_len, &end))
>> + return false;
>> +
>> + if (end > rom_end)
>> + return false;
> More from Sashiko:
>
> Does pci_rom_is_data_struct_valid() need to enforce a minimum safe
> size for data_len?
>
> If a malformed device advertises a small data_len (e.g., 12 bytes),
> validation passes here, but the subsequent reads in
> pci_get_rom_size() for PCI_ROM_IMAGE_LEN and
> PCI_ROM_LAST_IMAGE_INDICATOR could access unmapped memory past the
> ROM boundary.
OK, what would be a suitable min value here?
PCI_ROM_LAST_IMAGE_INDICATOR? I think this value needs to work for both
new and legacy devices.
>
>> + signature = readl(pds);
>> + if (signature == PCI_ROM_DATA_STRUCT_SIGNATURE)
>> + return true;
> Seems like it would be nicer to check the signature first, before
> checking the data_len. If the signature is bad, we log a hint about
> what went wrong, but we don't log anything if data_len is bad.
OK, changed in v12.
>> + pci_info(pdev, "Invalid PCI ROM data signature: expecting %#010x, got %#010x\n",
>> + PCI_ROM_DATA_STRUCT_SIGNATURE, signature);
>> + return false;
>> +}
>> +
>> /**
>> * pci_get_rom_size - obtain the actual size of the ROM image
>> * @pdev: target PCI device
>> @@ -99,38 +187,27 @@ static size_t pci_get_rom_size(struct pci_dev *pdev, void __iomem *rom,
>> size_t size)
>> {
>> void __iomem *image;
>> - int last_image;
>> unsigned int length;
>> + bool last_image;
>>
>> image = rom;
>> do {
>> void __iomem *pds;
>> - /* Standard PCI ROMs start out with these bytes 55 AA */
>> - if (readw(image) != PCI_ROM_IMAGE_SIGNATURE) {
>> - pci_info(pdev, "Invalid PCI ROM header signature: expecting %#06x, got %#06x\n",
>> - PCI_ROM_IMAGE_SIGNATURE, readw(image));
>> + if (!pci_rom_is_header_valid(pdev, image, rom, size, true))
>> break;
>> - }
>> +
>> /* get the PCI data structure and check its "PCIR" signature */
>> pds = image + readw(image + PCI_ROM_POINTER_TO_DATA_STRUCT);
>> - if (readl(pds) != PCI_ROM_DATA_STRUCT_SIGNATURE) {
>> - pci_info(pdev, "Invalid PCI ROM data signature: expecting %#010x, got %#010x\n",
>> - PCI_ROM_DATA_STRUCT_SIGNATURE, readl(pds));
>> + if (!pci_rom_is_data_struct_valid(pdev, pds, rom, size))
>> break;
>> - }
>> +
>> last_image = readb(pds + PCI_ROM_LAST_IMAGE_INDICATOR) &
>> PCI_ROM_LAST_IMAGE_INDICATOR_BIT;
>> length = readw(pds + PCI_ROM_IMAGE_LEN);
>> image += length * PCI_ROM_IMAGE_SECTOR_SIZE;
>> - /* Avoid iterating through memory outside the resource window */
>> - if (image >= rom + size)
>> +
>> + if (!pci_rom_is_header_valid(pdev, image, rom, size, last_image))
>> break;
> More from Sashiko. I'm not sure about this one.
>
> Does this log a false-positive warning when processing the final
> image?
>
> When last_image is true, the image pointer is advanced to the end of
> the ROM and passed into pci_rom_is_header_valid().
>
> Because last_image is passed as true to the helper, the signature
> check will fail and log an invalid header signature error for a
> perfectly valid device instead of gracefully finishing the loop.
I rename the "last_image" param of pci_rom_is_header_valid() to
"expect_valid", and add a !last_image check here to print "No more
image" instead of "Invalid sig". Best Regards, Guixin Liu
>> - if (!last_image) {
>> - if (readw(image) != PCI_ROM_IMAGE_SIGNATURE) {
>> - pci_info(pdev, "No more image in the PCI ROM\n");
>> - break;
>> - }
>> - }
>> } while (length && !last_image);
>>
>> /* never return a size larger than the PCI resource window */
>> --
>> 2.32.0.3.g01195cf9f
>>
next prev parent reply other threads:[~2026-05-06 4:39 UTC|newest]
Thread overview: 12+ messages / expand[flat|nested] mbox.gz Atom feed top
2026-01-30 8:07 [PATCH v11 0/2] PCI: Fix crash when access broken ROM Guixin Liu
2026-01-30 8:07 ` [PATCH v11 1/2] PCI: Introduce named defines for PCI ROM Guixin Liu
2026-05-02 16:55 ` Krzysztof Wilczyński
2026-05-06 4:40 ` Guixin Liu
2026-01-30 8:07 ` [PATCH v11 2/2] PCI: Check ROM header and data structure addr before accessing Guixin Liu
2026-04-30 21:46 ` Bjorn Helgaas
2026-05-06 4:39 ` Guixin Liu [this message]
2026-02-09 6:43 ` [PATCH v11 0/2] PCI: Fix crash when access broken ROM Guixin Liu
2026-02-09 17:54 ` Bjorn Helgaas
2026-04-24 6:32 ` Guixin Liu
2026-04-24 8:38 ` Andy Shevchenko
2026-04-30 2:01 ` Guixin Liu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=d843329a-85c3-4b1c-b35b-e506d247d0ab@linux.alibaba.com \
--to=kanie@linux.alibaba.com \
--cc=andriy.shevchenko@intel.com \
--cc=bhelgaas@google.com \
--cc=helgaas@kernel.org \
--cc=ilpo.jarvinen@linux.intel.com \
--cc=linux-pci@vger.kernel.org \
--cc=oliver.yang@linux.alibaba.com \
--cc=xlpang@linux.alibaba.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox