dri-devel Archive on lore.kernel.org
 help / color / mirror / Atom feed
From: sashiko-bot@kernel.org
To: "Juergen Gross" <jgross@suse.com>
Cc: dri-devel@lists.freedesktop.org
Subject: Re: [PATCH 21/32] agp/nvidia: Stop using 32-bit MSR interfaces
Date: Mon, 29 Jun 2026 06:25:23 +0000	[thread overview]
Message-ID: <20260629062527.B615C1F000E9@smtp.kernel.org> (raw)
In-Reply-To: <20260629060526.3638272-22-jgross@suse.com>

Thank you for your contribution! Sashiko AI review found 3 potential issue(s) to consider:

Pre-existing issues:
- [High] This problem wasn't introduced by this patch, but there is a memory leak of the ioremap virtual memory area on suspend/resume.
- [High] This problem wasn't introduced by this patch, but an ignored initialization error on resume leads to a guaranteed NULL pointer dereference in the hardware TLB flush path.
- [High] This problem wasn't introduced by this patch, but a missing break in a timeout loop causes a soft lockup if the hardware fails to complete a TLB flush.
--

commit 809ee345f7a00dcaa6e844219a8b666a3f9ffadb
Author: Juergen Gross <jgross@suse.com>

agp/nvidia: Stop using 32-bit MSR interfaces

This commit replaces the deprecated 32-bit MSR interfaces rdmsr() and wrmsr()
with their 64-bit equivalents in the NVIDIA AGP driver. This is part of a
planned removal of the 32-bit variants.

> diff --git a/drivers/char/agp/nvidia-agp.c b/drivers/char/agp/nvidia-agp.c
> index 4787391bb6b4f..3e760bc00afaa 100644
> --- a/drivers/char/agp/nvidia-agp.c
> +++ b/drivers/char/agp/nvidia-agp.c

[ ... ]

[Severity: High]
This isn't a bug introduced by this patch, but I noticed a potential memory
leak on suspend/resume.

Does this code leak the vmap area leading to vmalloc space exhaustion after
repeated suspend/resume cycles?

drivers/char/agp/nvidia-agp.c:nvidia_configure() {
...
	/* map aperture */
	apbase_phys = pci_resource_start(agp_bridge->dev, AGP_APERTURE_BAR);
	nvidia_private.aperture =
		(volatile u32 __iomem *) ioremap(apbase_phys, 33 * PAGE_SIZE);
...
}

Because the driver defines no suspend hook:

drivers/char/agp/nvidia-agp.c:
...
static DEFINE_SIMPLE_DEV_PM_OPS(agp_nvidia_pm_ops, NULL, agp_nvidia_resume);
...

The previous ioremap allocation is never unmapped via iounmap() when the
system suspends.

[Severity: High]
This is a pre-existing issue, but I spotted a potential NULL pointer
dereference in the hardware TLB flush path.

Could a failure in nvidia_configure() during resume lead to a kernel crash?

drivers/char/agp/nvidia-agp.c:agp_nvidia_resume() {
...
	/* reconfigure AGP hardware again */
	nvidia_configure();

	return 0;
}

The return value of nvidia_configure() is ignored. If ioremap fails during
resume, nvidia_configure() returns -ENOMEM and nvidia_private.aperture
remains NULL. 

Later, when nvidia_tlbflush() is called, it unconditionally dereferences
nvidia_private.aperture:

drivers/char/agp/nvidia-agp.c:nvidia_tlbflush() {
...
	/* flush TLB entries */
	for (i = 0; i < 32 + 1; i++)
		temp = readl(nvidia_private.aperture+(i * PAGE_SIZE / sizeof(u32)));
...
}

[Severity: High]
This isn't a bug introduced by this patch, but the timeout loop in
nvidia_tlbflush() appears to be missing a break statement.

Can this cause a soft lockup if the hardware fails to complete a TLB flush?

drivers/char/agp/nvidia-agp.c:nvidia_tlbflush() {
...
		end = jiffies + 3*HZ;
		do {
			pci_read_config_dword(nvidia_private.dev_1,
					NVIDIA_1_WBC, &wbc_reg);
			if (time_before_eq(end, jiffies)) {
				printk(KERN_ERR PFX
				    "TLB flush took more than 3 seconds.\n");
			}
		} while (wbc_reg & nvidia_private.wbc_mask);
...
}

If the hardware enters an unexpected state and the timeout expires, the code
prints an error message but lacks a break statement to exit the loop, which
could trap the kernel in an infinite loop.

-- 
Sashiko AI review · https://sashiko.dev/#/patchset/20260629060526.3638272-1-jgross@suse.com?part=21

  reply	other threads:[~2026-06-29  6:25 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2026-06-29  6:04 [PATCH 00/32] x86/msr: Drop 32-bit MSR interfaces Juergen Gross
2026-06-29  6:05 ` [PATCH 21/32] agp/nvidia: Stop using " Juergen Gross
2026-06-29  6:25   ` sashiko-bot [this message]
2026-06-29  6:05 ` [PATCH 22/32] fbdev/geode: " Juergen Gross
2026-06-29  6:05 ` [PATCH 31/32] treewide: convert rdmsrq() from a macro to an inline function Juergen Gross
2026-06-29  6:24   ` sashiko-bot
2026-06-29  6:52 ` [PATCH 00/32] x86/msr: Drop 32-bit MSR interfaces Arnd Bergmann
2026-06-29  7:01   ` Jürgen Groß
2026-06-29  8:06     ` Arnd Bergmann
2026-06-29  8:15       ` Jürgen Groß
2026-06-29  8:38         ` Arnd Bergmann
2026-06-30 20:06           ` H. Peter Anvin
2026-06-29 11:19       ` Ingo Molnar
2026-06-30 18:59         ` Sean Christopherson
2026-07-01  8:33           ` Jürgen Groß
2026-07-02 10:07           ` Ingo Molnar
2026-07-02 11:03             ` Juergen Gross

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20260629062527.B615C1F000E9@smtp.kernel.org \
    --to=sashiko-bot@kernel.org \
    --cc=dri-devel@lists.freedesktop.org \
    --cc=jgross@suse.com \
    --cc=sashiko-reviews@lists.linux.dev \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox