public inbox for linux-scsi@vger.kernel.org
 help / color / mirror / Atom feed
* megaraid_sas driver bug
@ 2016-06-25 17:13 Geza Lucz
  2016-07-01 11:06 ` Kashyap Desai
  0 siblings, 1 reply; 2+ messages in thread
From: Geza Lucz @ 2016-06-25 17:13 UTC (permalink / raw)
  To: linux-scsi

I don’t know if this is the right place to report this and whether this is
more like a Centos kernel issue.

Since Centos 6.8, the megaraid_sas driver seems to be somewhat broken –
which as I understand is a backport of the up-to-date driver to their kernel
version. It is still partially usable, but as soon as I need to rebuild an
array, instead of the scan message at the end I get a bunch of kernel error
messages. There are so many of these that the kernel logger uses too much
CPU and eventually the log fills up the partition, so the server needs to be
rebooted. So technically I lose all hotplug functionality.

Probably the same thing happens when the array degrades. 

I’m sending this report here, because there is absolutely no mention of
similar problems on the internet, yet the problem is fully reproducible and
real.

I’m using Dell Perc 6i at the moment.

Thanks


Jun 25 08:49:29 ns8 kernel: [  757.358015] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.379016] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.400018] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.421021] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.442017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.463015] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.484021] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.505017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:29 ns8 kernel: [  757.526021] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.547017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.568021] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.589017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.610017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.631017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.652017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.673016] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.694017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.715016] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.736017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.757017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.778017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.799017] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.820018] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115
Jun 25 08:49:30 ns8 kernel: [  757.841018] megaraid_sas 0000:02:00.0: DCMD
failed/not supported by firmware: megasas_get_pd_list 4115

^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: megaraid_sas driver bug
  2016-06-25 17:13 megaraid_sas driver bug Geza Lucz
@ 2016-07-01 11:06 ` Kashyap Desai
  0 siblings, 0 replies; 2+ messages in thread
From: Kashyap Desai @ 2016-07-01 11:06 UTC (permalink / raw)
  To: Geza Lucz; +Cc: linux-scsi

On Sat, Jun 25, 2016 at 10:43 PM, Geza Lucz <geza@lucz.com> wrote:
>
> I don’t know if this is the right place to report this and whether this is
> more like a Centos kernel issue.
>
> Since Centos 6.8, the megaraid_sas driver seems to be somewhat broken –
> which as I understand is a backport of the up-to-date driver to their kernel
> version. It is still partially usable, but as soon as I need to rebuild an
> array, instead of the scan message at the end I get a bunch of kernel error
> messages. There are so many of these that the kernel logger uses too much
> CPU and eventually the log fills up the partition, so the server needs to be
> rebooted. So technically I lose all hotplug functionality.
>
> Probably the same thing happens when the array degrades.
>
> I’m sending this report here, because there is absolutely no mention of
> similar problems on the internet, yet the problem is fully reproducible and
> real.
>
> I’m using Dell Perc 6i at the moment.
>
> Thanks


As we discussed, this is regression and found the proposed fix. Can
you try below patch. ?


---
 megaraid_sas_base.c | 8 +++++++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/megaraid_sas_base.c b/megaraid_sas_base.c
index d15e252..bee0895 100755
--- a/megaraid_sas_base.c
+++ b/megaraid_sas_base.c
@@ -5309,6 +5309,12 @@ megasas_get_pd_list(struct megasas_instance *instance)
     struct MR_PD_ADDRESS *pd_addr;
     dma_addr_t ci_h = 0;

+    if (instance->pd_list_not_supported) {
+        dev_info(&instance->pdev->dev, "MR_DCMD_PD_LIST_QUERY "
+            "not supported by firmware\n");
+        return ret;
+    }
+
     cmd = megasas_get_cmd(instance);

     if (!cmd) {
-- 


We have verified on our local setup, but waiting for more testing by
Test team to make sure it is good to post.

Thanks, Kashyap

>
>
> Jun 25 08:49:29 ns8 kernel: [  757.358015] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.379016] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.400018] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.421021] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.442017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.463015] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.484021] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.505017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:29 ns8 kernel: [  757.526021] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.547017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.568021] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.589017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.610017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.631017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.652017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.673016] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.694017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.715016] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.736017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.757017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.778017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.799017] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.820018] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
> Jun 25 08:49:30 ns8 kernel: [  757.841018] megaraid_sas 0000:02:00.0: DCMD
> failed/not supported by firmware: megasas_get_pd_list 4115
--
To unsubscribe from this list: send the line "unsubscribe linux-scsi" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply related	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2016-07-01 11:07 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2016-06-25 17:13 megaraid_sas driver bug Geza Lucz
2016-07-01 11:06 ` Kashyap Desai

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox