From: Hannes Reinecke <hare@suse.de>
To: John Meneghini <jmeneghi@redhat.com>,
"Martin K. Petersen" <martin.petersen@oracle.com>,
pheidologeton@protonmail.com, kernel@roadkil.net,
maokaman@gmail.com
Cc: Sagar.Biradar@microchip.com,
"James E.J. Bottomley" <James.Bottomley@HansenPartnership.com>,
linux-scsi@vger.kernel.org, thenzl@redhat.com,
mpatalan@redhat.com, Scott.Benesh@microchip.com,
Don.Brace@microchip.com, Tom.White@microchip.com,
Abhinav.Kuchibhotla@microchip.com
Subject: Re: [PATCH] [v2]aacraid: Reply queue mapping to CPUs based on IRQ affinity
Date: Mon, 10 Mar 2025 17:44:14 +0100 [thread overview]
Message-ID: <84a87c16-0738-460d-b83f-55f8181d536e@suse.de> (raw)
In-Reply-To: <2eca14e0-3978-440f-a4a4-32c9c61baad4@redhat.com>
On 2/24/25 22:15, John Meneghini wrote:
> On 2/20/25 9:38 PM, Martin K. Petersen wrote:
>>
>> John,
>>
>>> However, I agree it would be better to just fix the driver,
>>> performance impact notwithstanding, and ship it. For my part I'd
>>> rather have a correctly functioning driver, that's slower, but doesn't
>>> panic.
>>
>> I prefer to have a driver that doesn't panic when the user performs a
>> reasonably normal administrative action.
>
> Agreed. The only clarification I want to make is that users will
> not see a panic, they will see IO timeouts and Host bus resets.
> It was my mistake to report earlier that the host would panic.
>
> When aac_cpu_offline_feature is disabled users will see higher performance
> but if they start off-lining CPUS they may see IO timeouts. This is the
> state of the current driver and this is the problem which the original
> patch:
> commit 9dc704dcc09e ("scsi: aacraid: Reply queue mapping to CPUs based
> on IRQ affinity")
> was supposed to have fixed. The problem was the original patch didn't
> fix the
> problem correctly and it resulted in the regression reported in Bugzilla
> 217599[1].
>
> This patch circles back and fixes the original problem correctly. The extra
> 'aac_cpu_offline_feature' modparam was added to disable the new code path
> because of concerns raised during our testing at Red Hat about reduced
> performance with this patch.
>
>> If go-faster stripes are desired in specific configurations, then make
>> the performance mode an opt-in. Based on your benchmarks, however, I'm
>> not entirely convinced it's worth it...
>
> I agree. So how about if we can just take out the
> aac_cpu_offline_feature modparam...?
>
> Alternatively we can replace the modparam with a kConfig option. The
> default setting for the new Kconfig option will be offline_cpu_support_on and
> performance_mode_off. That way we can ship a default kernel configuration that
> provides a working aacraid driver which safely supports off-lining
> CPUS. If people are really unhappy with the performance, and they>
don't care about offline cpu support, they can re-config their kernel.
>
> Personally I prefer option 1, but we the thoughts of the upstream users.
>
> I've added the original authors of Bugzilla 217599[1] to the cc list to
> get their attention and review.
>
Do we have an idea what these 'specific use-cases' are?
And how much performance impact we have?
I could imagine a single-threaded workload driving just one blk-mq queue
would benefit from spreading out onto several interrupts.
But then, this would be true for most of the multiqueue drivers; and
indeed quite some drivers (eg megaraid_sas & mpt3sas
'smp_affinity_enable') have the very same module option.
Wouldn't it be an idea to check if we can make this a generic / blk-mq
queue option instead of having each driver to implement the same
functionality on it's own?
Topic for LSF?
Cheers,
Hannes
--
Dr. Hannes Reinecke Kernel Storage Architect
hare@suse.de +49 911 74053 688
SUSE Software Solutions GmbH, Frankenstr. 146, 90461 Nürnberg
HRB 36809 (AG Nürnberg), GF: I. Totev, A. McDonald, W. Knoblich
next prev parent reply other threads:[~2025-03-10 16:44 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2025-01-30 17:33 [PATCH] [v2]aacraid: Reply queue mapping to CPUs based on IRQ affinity Sagar Biradar
2025-02-10 17:20 ` John Meneghini
2025-02-10 20:24 ` John Meneghini
2025-02-13 2:56 ` Martin K. Petersen
2025-02-13 21:26 ` Sagar.Biradar
[not found] ` <PH7PR11MB7570E9E65153C48BA7C5679EFAFF2@PH7PR11MB7570.namprd11.prod.outlook.com>
2025-02-13 21:31 ` Sagar.Biradar
2025-02-13 22:03 ` John Meneghini
2025-02-13 22:21 ` John Meneghini
2025-02-21 2:38 ` Martin K. Petersen
2025-02-24 21:15 ` John Meneghini
2025-03-10 16:44 ` Hannes Reinecke [this message]
2025-03-11 1:16 ` Martin K. Petersen
2025-03-12 1:52 ` John Meneghini
2025-03-25 0:16 ` Sagar.Biradar
2025-03-25 1:54 ` John Garry
2025-04-17 16:02 ` Sagar.Biradar
2025-04-22 6:42 ` Hannes Reinecke
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=84a87c16-0738-460d-b83f-55f8181d536e@suse.de \
--to=hare@suse.de \
--cc=Abhinav.Kuchibhotla@microchip.com \
--cc=Don.Brace@microchip.com \
--cc=James.Bottomley@HansenPartnership.com \
--cc=Sagar.Biradar@microchip.com \
--cc=Scott.Benesh@microchip.com \
--cc=Tom.White@microchip.com \
--cc=jmeneghi@redhat.com \
--cc=kernel@roadkil.net \
--cc=linux-scsi@vger.kernel.org \
--cc=maokaman@gmail.com \
--cc=martin.petersen@oracle.com \
--cc=mpatalan@redhat.com \
--cc=pheidologeton@protonmail.com \
--cc=thenzl@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox