From: Madhavan Srinivasan <maddy@linux.vnet.ibm.com>
To: Michael Ellerman <mpe@ellerman.id.au>,
Sachin Sant <sachinp@linux.vnet.ibm.com>,
linuxppc-dev@ozlabs.org
Subject: Re: kselftest:lost_exception_test failure with 4.11.0-rc5
Date: Mon, 10 Apr 2017 09:24:28 +0530 [thread overview]
Message-ID: <6ffc8fd9-63f9-b1b0-864c-3ab546cb1d5f@linux.vnet.ibm.com> (raw)
In-Reply-To: <87r314gzt1.fsf@concordia.ellerman.id.au>
On Friday 07 April 2017 06:06 PM, Michael Ellerman wrote:
> Sachin Sant <sachinp@linux.vnet.ibm.com> writes:
>
>> I have run into few instances where the lost_exception_test from
>> powerpc kselftest fails with SIGABRT. Following o/p is against
>> 4.11.0-rc5. The failure is intermittent.
> What hardware are you on?
>
> How long does it take to run when it fails? I assume ~2 minutes?
Started a run in power8 host (habanero) and it is more than 24hrs and
havent failed yet. So this should be guest/VM scenario then?
>
>> When the test fails it is killed due to SIGABRT.
>> # ./lost_exception_test
>> test: lost_exception
>> tags: git_version:unknown
>> Binding to cpu 8
>> main test running as pid 9208
>> EBB Handler is at 0x10003dcc
>> !! killing lost_exception
> This is the parent (test harness saying) it's about to kill the child,
> because it took too long.
>
> It sends SIGTERM, but the child catches that, prints all this info, and
> then aborts() - so that's why you're seeing SIGABRT.
>
>> ebb_state):
>> ebb_count = 191529
> The test usually runs until it's taken 1,000,000 EBBs, so it looks like
> we got stuck.
>
>> spurious = 0
>> negative = 0
>> no_overflow = 0
>> pmc[1] count = 0x0
>> pmc[2] count = 0x0
>> pmc[3] count = 0x0
>> pmc[4] count = 0x4c1b707
> We use a varying sample period of between 400 and 600, and from above
> we've taken 191,529 EBBs.
>
> 0x4c1b707 / 191,529 ~= 416
>
> So that looks reasonable.
>
>> pmc[5] count = 0x0
>> pmc[6] count = 0x0
>> HW state:
>> MMCR0 0x0000000080000080 FC PMAO
> But this says we're stopped with counters frozen and an event pending.
>
>> MMCR2 0x0000000000000000
>> EBBHR 0x0000000010003dcc
>> BESCR 0x8000000100000000 GE PMAE
> And that says we have global enable set and events enabled.
>
>
> So I think there is a bug here somewhere. I don't really have time to
> dig into it now, neither does Maddy I think. But we should try and get
> to it at some point.
>
> cheers
>
next prev parent reply other threads:[~2017-04-10 3:55 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-04-07 8:05 kselftest:lost_exception_test failure with 4.11.0-rc5 Sachin Sant
2017-04-07 12:36 ` Michael Ellerman
2017-04-10 3:54 ` Madhavan Srinivasan [this message]
2017-04-11 10:05 ` Michael Ellerman
2017-04-10 8:00 ` Sachin Sant
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=6ffc8fd9-63f9-b1b0-864c-3ab546cb1d5f@linux.vnet.ibm.com \
--to=maddy@linux.vnet.ibm.com \
--cc=linuxppc-dev@ozlabs.org \
--cc=mpe@ellerman.id.au \
--cc=sachinp@linux.vnet.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).