All of lore.kernel.org
 help / color / mirror / Atom feed
From: Vivek Goyal <vgoyal@in.ibm.com>
To: "Salyzyn, Mark" <mark_salyzyn@adaptec.com>
Cc: James Bottomley <James.Bottomley@SteelEye.com>,
	Kexec Mailing List <kexec@lists.infradead.org>,
	Judith Lebzelter <judith@linux-foundation.org>,
	linux-scsi@vger.kernel.org, "Darrick J. Wong" <djwong@us.ibm.com>
Subject: Re: [PATCH] aacraid: fails to initialize after a  kexec operation
Date: Mon, 30 Apr 2007 15:23:34 +0530	[thread overview]
Message-ID: <20070430095334.GA17186@in.ibm.com> (raw)
In-Reply-To: <AE4F746F2AECFC4DA4AADD66A1DFEF0188AF97@otce2k301.adaptec.com>

On Tue, Apr 24, 2007 at 09:21:35AM -0400, Salyzyn, Mark wrote:
> The system BIOS sets up the card's PCI configuration and there is code
> in the kernel that is capable of picking up some of the BIOS'
> information from the BIOS Data Space (not sure if it is actively
> collected in your configuration, you need a kernel flag to pick this
> up). On kexec this BIOS Data Space information is missing (?) and if
> there was any reconfiguration of the PCI space going on (I think only
> the Linux BIOS project does this), kexec will inherit it. This issue
> strikes me as a corrupted PCI configuration inherited in the kexec case,
> such corrupted PCI configurations could be a motherboard specific issue
> and can be related to the BIOS' initial setup for the initial kernel. At
> least that is my thought process in questioning the motherboard BIOS or
> hardware.
> 
> Another possibility is that after you have patched over the interrupt
> routing issues (a PCI configuration problem), the card has a foreign
> array, and the reset and reconfiguration is taking arrays offline. Add
> 'aacraid.commit=1' to force the foreign arrays to be accepted by the
> card.
> 

Hi Mark,

So aacraid.commit=1 and irqpoll combination has done the trick. I can
kexec/kdump into second kernel. I am using an IBM x366 series machine.
There is one array and three disks behind it.

Now few queries.

- What is the concept of foreign arrays? 
- Should we pass aacraid.commit=1 all the time or this is only for
  some special cases? What's the point in resetting an adapter if it
  does not online the array it is managing?
- For kexec, it calls the device shutdown routine (aac_shutdown) in this
  case. If this is the case for normal kexec (not kdump) adapter should
  not be reset?
- Still needs to be found out why PCI configuration is getting corrupted
  and why irq routing is not proper and irqpoll is required.
 
Thanks
Vivek

_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec

WARNING: multiple messages have this Message-ID (diff)
From: Vivek Goyal <vgoyal@in.ibm.com>
To: "Salyzyn, Mark" <mark_salyzyn@adaptec.com>
Cc: James Bottomley <James.Bottomley@SteelEye.com>,
	Kexec Mailing List <kexec@lists.infradead.org>,
	Judith Lebzelter <judith@linux-foundation.org>,
	linux-scsi@vger.kernel.org, "Darrick J. Wong" <djwong@us.ibm.com>
Subject: Re: [PATCH] aacraid: fails to initialize after a  kexec operation
Date: Mon, 30 Apr 2007 15:23:34 +0530	[thread overview]
Message-ID: <20070430095334.GA17186@in.ibm.com> (raw)
In-Reply-To: <AE4F746F2AECFC4DA4AADD66A1DFEF0188AF97@otce2k301.adaptec.com>

On Tue, Apr 24, 2007 at 09:21:35AM -0400, Salyzyn, Mark wrote:
> The system BIOS sets up the card's PCI configuration and there is code
> in the kernel that is capable of picking up some of the BIOS'
> information from the BIOS Data Space (not sure if it is actively
> collected in your configuration, you need a kernel flag to pick this
> up). On kexec this BIOS Data Space information is missing (?) and if
> there was any reconfiguration of the PCI space going on (I think only
> the Linux BIOS project does this), kexec will inherit it. This issue
> strikes me as a corrupted PCI configuration inherited in the kexec case,
> such corrupted PCI configurations could be a motherboard specific issue
> and can be related to the BIOS' initial setup for the initial kernel. At
> least that is my thought process in questioning the motherboard BIOS or
> hardware.
> 
> Another possibility is that after you have patched over the interrupt
> routing issues (a PCI configuration problem), the card has a foreign
> array, and the reset and reconfiguration is taking arrays offline. Add
> 'aacraid.commit=1' to force the foreign arrays to be accepted by the
> card.
> 

Hi Mark,

So aacraid.commit=1 and irqpoll combination has done the trick. I can
kexec/kdump into second kernel. I am using an IBM x366 series machine.
There is one array and three disks behind it.

Now few queries.

- What is the concept of foreign arrays? 
- Should we pass aacraid.commit=1 all the time or this is only for
  some special cases? What's the point in resetting an adapter if it
  does not online the array it is managing?
- For kexec, it calls the device shutdown routine (aac_shutdown) in this
  case. If this is the case for normal kexec (not kdump) adapter should
  not be reset?
- Still needs to be found out why PCI configuration is getting corrupted
  and why irq routing is not proper and irqpoll is required.
 
Thanks
Vivek

  reply	other threads:[~2007-04-30  9:54 UTC|newest]

Thread overview: 22+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-04-23  7:49 AACRAID fails to initialize after an kexec operation Vivek Goyal
2007-04-23  7:49 ` Vivek Goyal
2007-04-23 13:01 ` Salyzyn, Mark
2007-04-23 13:01   ` Salyzyn, Mark
2007-04-23 13:38   ` [PATCH] aacraid: fails to initialize after a " Salyzyn, Mark
2007-04-23 13:38     ` Salyzyn, Mark
2007-04-23 16:12     ` Vivek Goyal
2007-04-23 16:12       ` Vivek Goyal
2007-04-23 17:20       ` Salyzyn, Mark
2007-04-23 17:20         ` Salyzyn, Mark
2007-04-24  8:44         ` Vivek Goyal
2007-04-24  8:44           ` Vivek Goyal
2007-04-24  9:01           ` Vivek Goyal
2007-04-24  9:01             ` Vivek Goyal
2007-04-24 13:21           ` Salyzyn, Mark
2007-04-24 13:21             ` Salyzyn, Mark
2007-04-30  9:53             ` Vivek Goyal [this message]
2007-04-30  9:53               ` Vivek Goyal
2007-04-30 14:11               ` Salyzyn, Mark
2007-04-30 14:11                 ` Salyzyn, Mark
2007-05-02  4:21                 ` Vivek Goyal
2007-05-02  4:21                   ` Vivek Goyal

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20070430095334.GA17186@in.ibm.com \
    --to=vgoyal@in.ibm.com \
    --cc=James.Bottomley@SteelEye.com \
    --cc=djwong@us.ibm.com \
    --cc=judith@linux-foundation.org \
    --cc=kexec@lists.infradead.org \
    --cc=linux-scsi@vger.kernel.org \
    --cc=mark_salyzyn@adaptec.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.