* aacraid: kernel: AAC: Host adapter dead -1 (bisected)
@ 2017-01-15 11:05 Arkadiusz Miskiewicz
0 siblings, 0 replies; 5+ messages in thread
From: Arkadiusz Miskiewicz @ 2017-01-15 11:05 UTC (permalink / raw)
To: linux-kernel
Cc: Raghava Aditya Renukunta, Johannes Thumshirn, Martin K. Petersen,
Adaptec OEM Raid Solutions
Hi.
There is a bug with handling of adaptec raid cards (in my case it is Adaptec
3405) where kernel logs hundreds of "AAC: Host adapter dead -1" messages.
Bug was reported previously on lkml but there was no progres in solving it.
There is also bugzilla entry:
https://bugzilla.kernel.org/show_bug.cgi?id=151661
I've bisected that to commit bellow and indeed, reverting it from kernel 4.9.3
makes messages go away.
Could anyone at microsemi look at this regression?
Thanks
commit 78cbccd3bd683c295a44af8050797dc4a41376ff
Author: Raghava Aditya Renukunta <RaghavaAditya.Renukunta@microsemi.com>
Date: Mon Apr 25 23:32:37 2016 -0700
aacraid: Fix for KDUMP driver hang
When KDUMP is triggered the driver first talks to the firmware in INTX
mode, but the adapter firmware is still in MSIX mode. Therefore the first
driver command hangs since the driver is waiting for an INTX response and
firmware gives a MSIX response. If when the OS is installed on a RAID
drive created by the adapter KDUMP will hang since the driver does not
receive a response in sync mode.
Fixed by: Change the firmware to INTX mode if it is in MSIX mode before
sending the first sync command.
Cc: stable@vger.kernel.org
Signed-off-by: Raghava Aditya Renukunta
<RaghavaAditya.Renukunta@microsemi.com>
Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de>
Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
my hardware:
02:0e.0 RAID bus controller [0104]: Adaptec AAC-RAID [9005:0285]
Subsystem: Adaptec 3405 [9005:02bb]
Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping+ SERR+ FastB2B- DisINTx-
Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 32 (250ns min, 250ns max), Cache Line Size: 4 bytes
Interrupt: pin A routed to IRQ 18
Region 0: Memory at fba00000 (64-bit, non-prefetchable) [size=2M]
[virtual] Expansion ROM at fbc00000 [disabled] [size=256K]
Capabilities: [c0] Power Management version 2
Flags: PMEClk- DSI- D1+ D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot-,D3cold-)
Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
Capabilities: [d0] MSI: Enable- Count=1/2 Maskable- 64bit+
Address: 0000000000000000 Data: 0000
Capabilities: [e0] PCI-X non-bridge device
Command: DPERE- ERO- RBC=512 OST=4
Status: Dev=02:0e.0 64bit+ 133MHz+ SCD- USC- DC=bridge
DMMRBC=1024 DMOST=4 DMCRS=16 RSCEM- 266MHz- 533MHz-
Kernel driver in use: aacraid
Kernel modules: aacraid
[ 1.956009] Adaptec aacraid driver 1.2-1[41066]-ms
[ 2.164584] AAC0: kernel 5.2-0[17342] Aug 4 2010
[ 2.164633] AAC0: monitor 5.2-0[17342]
[ 2.164676] AAC0: bios 5.2-0[17342]
[ 2.164719] AAC0: serial 7C46114103A
[ 2.164761] AAC0: Non-DASD support enabled.
[ 2.164804] AAC0: 64bit support enabled.
[ 2.164846] AAC0: 64 Bit DAC enabled
[ 2.177929] scsi host6: aacraid
--
Arkadiusz Miśkiewicz, arekm / ( maven.pl | pld-linux.org )
^ permalink raw reply [flat|nested] 5+ messages in thread
* RE: aacraid: kernel: AAC: Host adapter dead -1 (bisected)
[not found] <201701151205.37563.a.miskiewicz@gmail.com>
@ 2017-01-17 18:23 ` Dave Carroll
2017-01-17 18:31 ` Arkadiusz Miskiewicz
0 siblings, 1 reply; 5+ messages in thread
From: Dave Carroll @ 2017-01-17 18:23 UTC (permalink / raw)
To: arekm@maven.pl, linux-kernel@vger.kerne.org
Cc: Raghava Aditya Renukunta, Johannes Thumshirn, Martin K. Petersen,
dl-esc-Aacraid Linux Driver, linux-scsi@vger.kernel.org
>
>
> Hi.
>
> There is a bug with handling of adaptec raid cards (in my case it is Adaptec
> 3405) where kernel logs hundreds of "AAC: Host adapter dead -1"
> messages.
>
> Bug was reported previously on lkml but there was no progres in solving it.
>
> There is also bugzilla entry:
> https://bugzilla.kernel.org/show_bug.cgi?id=151661
>
> I've bisected that to commit bellow and indeed, reverting it from kernel
> 4.9.3 makes messages go away.
>
> Could anyone at microsemi look at this regression?
>
> Thanks
Hi Arkadiusz,
Thanks for your effort in determining the cause of the issue. It makes sense now that the patch should have been included in controller specific code, rather than common code.
I will prepare a patch for this, and if you are willing to test it, that would be great!
Thanks, -Dave
>
> commit 78cbccd3bd683c295a44af8050797dc4a41376ff
> Author: Raghava Aditya Renukunta
> <RaghavaAditya.Renukunta@microsemi.com>
> Date: Mon Apr 25 23:32:37 2016 -0700
>
> aacraid: Fix for KDUMP driver hang
>
> When KDUMP is triggered the driver first talks to the firmware in INTX
> mode, but the adapter firmware is still in MSIX mode. Therefore the
> first
> driver command hangs since the driver is waiting for an INTX response
> and
> firmware gives a MSIX response. If when the OS is installed on a RAID
> drive created by the adapter KDUMP will hang since the driver does not
> receive a response in sync mode.
>
> Fixed by: Change the firmware to INTX mode if it is in MSIX mode before
> sending the first sync command.
>
> Cc: stable@vger.kernel.org
> Signed-off-by: Raghava Aditya Renukunta
> <RaghavaAditya.Renukunta@microsemi.com>
> Reviewed-by: Johannes Thumshirn <jthumshirn@suse.de>
> Signed-off-by: Martin K. Petersen <martin.petersen@oracle.com>
>
> my hardware:
> 02:0e.0 RAID bus controller [0104]: Adaptec AAC-RAID [9005:0285]
> Subsystem: Adaptec 3405 [9005:02bb]
> Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
> ParErr-
> Stepping+ SERR+ FastB2B- DisINTx-
> Status: Cap+ 66MHz+ UDF- FastB2B- ParErr- DEVSEL=medium >TAbort-
> <TAbort- <MAbort- >SERR- <PERR- INTx-
> Latency: 32 (250ns min, 250ns max), Cache Line Size: 4 bytes
> Interrupt: pin A routed to IRQ 18
> Region 0: Memory at fba00000 (64-bit, non-prefetchable) [size=2M]
> [virtual] Expansion ROM at fbc00000 [disabled] [size=256K]
> Capabilities: [c0] Power Management version 2
> Flags: PMEClk- DSI- D1+ D2- AuxCurrent=0mA
> PME(D0-,D1-,D2-,D3hot-,D3cold-)
> Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME-
> Capabilities: [d0] MSI: Enable- Count=1/2 Maskable- 64bit+
> Address: 0000000000000000 Data: 0000
> Capabilities: [e0] PCI-X non-bridge device
> Command: DPERE- ERO- RBC=512 OST=4
> Status: Dev=02:0e.0 64bit+ 133MHz+ SCD- USC- DC=bridge
> DMMRBC=1024 DMOST=4 DMCRS=16 RSCEM- 266MHz- 533MHz-
> Kernel driver in use: aacraid
> Kernel modules: aacraid
>
> [ 1.956009] Adaptec aacraid driver 1.2-1[41066]-ms
> [ 2.164584] AAC0: kernel 5.2-0[17342] Aug 4 2010
> [ 2.164633] AAC0: monitor 5.2-0[17342]
> [ 2.164676] AAC0: bios 5.2-0[17342]
> [ 2.164719] AAC0: serial 7C46114103A
> [ 2.164761] AAC0: Non-DASD support enabled.
> [ 2.164804] AAC0: 64bit support enabled.
> [ 2.164846] AAC0: 64 Bit DAC enabled
> [ 2.177929] scsi host6: aacraid
>
>
> --
> Arkadiusz Miśkiewicz, arekm / ( maven.pl | pld-linux.org )
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: aacraid: kernel: AAC: Host adapter dead -1 (bisected)
2017-01-17 18:23 ` aacraid: kernel: AAC: Host adapter dead -1 (bisected) Dave Carroll
@ 2017-01-17 18:31 ` Arkadiusz Miskiewicz
0 siblings, 0 replies; 5+ messages in thread
From: Arkadiusz Miskiewicz @ 2017-01-17 18:31 UTC (permalink / raw)
To: Dave Carroll
Cc: linux-kernel@vger.kernel.org, Raghava Aditya Renukunta,
Johannes Thumshirn, Martin K. Petersen,
dl-esc-Aacraid Linux Driver, linux-scsi@vger.kernel.org
On Tuesday 17 of January 2017, Dave Carroll wrote:
> > Hi.
> >
> > There is a bug with handling of adaptec raid cards (in my case it is
> > Adaptec 3405) where kernel logs hundreds of "AAC: Host adapter dead -1"
> > messages.
> >
> > Bug was reported previously on lkml but there was no progres in solving
> > it.
> >
> > There is also bugzilla entry:
> > https://bugzilla.kernel.org/show_bug.cgi?id=151661
> >
> > I've bisected that to commit bellow and indeed, reverting it from kernel
> > 4.9.3 makes messages go away.
> >
> > Could anyone at microsemi look at this regression?
> >
> > Thanks
>
> Hi Arkadiusz,
>
> Thanks for your effort in determining the cause of the issue. It makes
> sense now that the patch should have been included in controller specific
> code, rather than common code.
>
> I will prepare a patch for this, and if you are willing to test it, that
> would be great!
Great!
I have dedicated machine for testing this, so yes - I'll test.
--
Arkadiusz Miśkiewicz, arekm / ( maven.pl | pld-linux.org )
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: aacraid: kernel: AAC: Host adapter dead -1 (bisected)
[not found] ` <20170210102447.GA15183@kroah.com>
@ 2017-02-10 10:45 ` Andrey Melnikov
2017-02-10 10:47 ` Greg Kroah-Hartman
0 siblings, 1 reply; 5+ messages in thread
From: Andrey Melnikov @ 2017-02-10 10:45 UTC (permalink / raw)
To: Greg Kroah-Hartman
Cc: stable, linux-kernel, arekm, linux-scsi, Raghava Aditya Renukunta
Cc: linux-scsi@vger.kernel.org
2017-02-10 13:24 GMT+03:00 Greg Kroah-Hartman <gregkh@linuxfoundation.org>:
> On Fri, Feb 10, 2017 at 02:25:26AM +0300, Andrey Jr. Melnikov wrote:
>> In article <201701151205.37563.a.miskiewicz@gmail.com> you wrote:
>> > Newsgroups: gmane.linux.kernel
>>
>>
>> > Hi.
>>
>> > There is a bug with handling of adaptec raid cards (in my case it is Adaptec
>> > 3405) where kernel logs hundreds of "AAC: Host adapter dead -1" messages.
>>
>> > Bug was reported previously on lkml but there was no progres in solving it.
>>
>> > There is also bugzilla entry:
>> > https://bugzilla.kernel.org/show_bug.cgi?id=151661
>>
>> > I've bisected that to commit bellow and indeed, reverting it from kernel 4.9.3
>> > makes messages go away.
>>
>>
>> Don't try to switch Adaptec 3405/3805 RAID cards to MSI-X interrupt mode.
>> Fix https://bugzilla.kernel.org/show_bug.cgi?id=151661
>>
>> Signed-off-by: Andrey Jr. Melnikov <temnota.am@gmail.com>
>>
>> ---
>>
>> diff --git a/drivers/scsi/aacraid/aacraid.h b/drivers/scsi/aacraid/aacraid.h
>> index 969c312de1be..2ad8403dea40 100644
>> --- a/drivers/scsi/aacraid/aacraid.h
>> +++ b/drivers/scsi/aacraid/aacraid.h
>
> <snip>
>
> Why are you sending this to me and not the scsi developers who can
> actually do something with this patch?
Bug in bugzilla open half year ago, microsemi maintainer slowly read
his fine docs about his hardware, broken driver fills our log with
useless messages every 10 seconds.
So, make decision - apply this patch to stable 4.9.x/4.4.x tree or
revert commit 78cbccd3bd683c295a44af8050797dc4a41376ff from it.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: aacraid: kernel: AAC: Host adapter dead -1 (bisected)
2017-02-10 10:45 ` Andrey Melnikov
@ 2017-02-10 10:47 ` Greg Kroah-Hartman
0 siblings, 0 replies; 5+ messages in thread
From: Greg Kroah-Hartman @ 2017-02-10 10:47 UTC (permalink / raw)
To: Andrey Melnikov
Cc: stable, linux-kernel, arekm, linux-scsi, Raghava Aditya Renukunta
On Fri, Feb 10, 2017 at 01:45:06PM +0300, Andrey Melnikov wrote:
> Cc: linux-scsi@vger.kernel.org
>
> 2017-02-10 13:24 GMT+03:00 Greg Kroah-Hartman <gregkh@linuxfoundation.org>:
> > On Fri, Feb 10, 2017 at 02:25:26AM +0300, Andrey Jr. Melnikov wrote:
> >> In article <201701151205.37563.a.miskiewicz@gmail.com> you wrote:
> >> > Newsgroups: gmane.linux.kernel
> >>
> >>
> >> > Hi.
> >>
> >> > There is a bug with handling of adaptec raid cards (in my case it is Adaptec
> >> > 3405) where kernel logs hundreds of "AAC: Host adapter dead -1" messages.
> >>
> >> > Bug was reported previously on lkml but there was no progres in solving it.
> >>
> >> > There is also bugzilla entry:
> >> > https://bugzilla.kernel.org/show_bug.cgi?id=151661
> >>
> >> > I've bisected that to commit bellow and indeed, reverting it from kernel 4.9.3
> >> > makes messages go away.
> >>
> >>
> >> Don't try to switch Adaptec 3405/3805 RAID cards to MSI-X interrupt mode.
> >> Fix https://bugzilla.kernel.org/show_bug.cgi?id=151661
> >>
> >> Signed-off-by: Andrey Jr. Melnikov <temnota.am@gmail.com>
> >>
> >> ---
> >>
> >> diff --git a/drivers/scsi/aacraid/aacraid.h b/drivers/scsi/aacraid/aacraid.h
> >> index 969c312de1be..2ad8403dea40 100644
> >> --- a/drivers/scsi/aacraid/aacraid.h
> >> +++ b/drivers/scsi/aacraid/aacraid.h
> >
> > <snip>
> >
> > Why are you sending this to me and not the scsi developers who can
> > actually do something with this patch?
>
> Bug in bugzilla open half year ago, microsemi maintainer slowly read
> his fine docs about his hardware, broken driver fills our log with
> useless messages every 10 seconds.
> So, make decision - apply this patch to stable 4.9.x/4.4.x tree or
> revert commit 78cbccd3bd683c295a44af8050797dc4a41376ff from it.
I don't understand, that's not how the stable kernels work, please read
Documentation/stable_kernel_rules.txt for how the process works. Please
get a patch accepted into Linus's tree and then we will be glad to apply
it to the stable kernel trees.
thanks,
greg k-h
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-02-10 10:47 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <201701151205.37563.a.miskiewicz@gmail.com>
2017-01-17 18:23 ` aacraid: kernel: AAC: Host adapter dead -1 (bisected) Dave Carroll
2017-01-17 18:31 ` Arkadiusz Miskiewicz
[not found] <589cfa66.rU2EDFCW1W2Xv421%temnota.am@gmail.com>
[not found] ` <20170210102447.GA15183@kroah.com>
2017-02-10 10:45 ` Andrey Melnikov
2017-02-10 10:47 ` Greg Kroah-Hartman
2017-01-15 11:05 Arkadiusz Miskiewicz
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).