public inbox for linux-ia64@vger.kernel.org
 help / color / mirror / Atom feed
From: Keith Owens <kaos@sgi.com>
To: linux-ia64@vger.kernel.org
Subject: Re: testing mca/init patch
Date: Thu, 01 Sep 2005 03:20:46 +0000	[thread overview]
Message-ID: <12725.1125544846@kao2.melbourne.sgi.com> (raw)
In-Reply-To: <200508312343.j7VNhFOZ012157@agluck-lia64.sc.intel.com>

On Wed, 31 Aug 2005 16:43:15 -0700, 
tony.luck@intel.com wrote:
>To make life easier for testers, I've applied Keith's patches
>and put them into my test tree.
>
>I locally applied one insignificant change in ia64_init_handler()
>to print the cpu number and the state of sos->monarch in the
>initial printk in that routine (because there was once a bug where
>Tiger SAL sent all cpus to the registered "master" INIT entry point.
>The output below shows that I have a good SAL that didn't do this).
>I haven't applied this change to my GIT tree.
>
>
>Here's a trimmed down version of the output after I hit the INIT
>button with some comments by me about timing inside {} brackets:
>
>{nothing for a few seconds ... felt like more than 5}

That delay is coming from your SAL, nothing I can do about it.

>CPU1: Entered OS INIT handler. PSPÿe301a0 monarch=1
>Delaying for 5 seconds...
>{another delay, perhaps this one was 5 seconds}

Probably 10 seconds.  5 in ia64_init_handler(), then another 5 in
ia64_wait_for_slaves().  See below.

>Processes interrupted by INIT - 0 (cpu 1 task 0xe0000001ffe90000)

Only one cpu was entered for INIT, not good.

>INIT dump complete.  Monarch on cpu 1 returning to normal service.
>{another several second delay}

This is wrong.  The slave INIT handler was not invoked when the monarch
was delivered, instead the slave events were delivered _after_ the
monarch returned to the interrupted context.  It works for me on SGI's
SAL, all the cpus enter INIT at the same time, without any noticeable
delay.  There is no delay nor lockout in the INIT handler code before
it gets to the first printk, so all the delay and out of order
execution has to be coming from your SAL.

>CPU3: Entered OS INIT handler. PSPÿe301a0 monarch=0
>{another several second delay}
>CPU2: Entered OS INIT handler. PSPÿe301a0 monarch=0
>{another several second delay}
>CPU0: Entered OS INIT handler. PSPÿe301a0 monarch=0
>{another several second delay}
>CPU1: Entered OS INIT handler. PSPÿe301a0 monarch=0
>cpu 1, INIT inconsistent r12 and r13, original stack not modified

And why was cpu 1 entered again, this time as a slave and with wrong
registers?  Looks like another SAL error.

>{system hung}

Because all 4 cpus are driven as slaves.  All the slaves are waiting
for the monarch to arrive.  All of the above tells me that the OS code
is working fine, SAL is not.


  parent reply	other threads:[~2005-09-01  3:20 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2005-08-31 23:43 testing mca/init patch tony.luck
2005-09-01  1:38 ` david mosberger
2005-09-01  3:20 ` Keith Owens [this message]
2005-09-01  4:58 ` david mosberger
2005-09-01  5:30 ` Keith Owens
2005-09-01 16:43 ` Luck, Tony
2005-09-01 19:35 ` david mosberger

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=12725.1125544846@kao2.melbourne.sgi.com \
    --to=kaos@sgi.com \
    --cc=linux-ia64@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox