From: Don Zickus <dzickus@redhat.com>
To: Seiji Aguchi <seiji.aguchi@hds.com>
Cc: "Luck, Tony" <tony.luck@intel.com>,
"linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>,
"cbouatmailru@gmail.com" <cbouatmailru@gmail.com>,
"ccross@android.com" <ccross@android.com>,
"keescook@chromium.org" <keescook@chromium.org>,
"dle-develop@lists.sourceforge.net"
<dle-develop@lists.sourceforge.net>,
Satoru Moriya <satoru.moriya@hds.com>
Subject: Re: [RFC][PATCH] pstore: Skip spinlock when just one cpu is online
Date: Mon, 10 Dec 2012 11:48:13 -0500 [thread overview]
Message-ID: <20121210164813.GA53431@redhat.com> (raw)
In-Reply-To: <A5ED84D3BB3A384992CBB9C77DEDA4D414A1B936@USINDEM103.corp.hds.com>
On Fri, Dec 07, 2012 at 11:43:03PM +0000, Seiji Aguchi wrote:
> > Can all these things really happen (did you run into this problem on a real system?). Or is this just a theoretical problem. Ugly (but
> > practical) hacks might be OK to solve real problems.
>
> It is a theoretical problem right now.
> But it is a timing issue and there is a possibility to happen actually.
>
> > But do we really want them to fix problems that actually never happen?
>
> If we find a problem (even if it is theoretical), we can't say "It actually never happen.".
>
> I have some reasons to submit this patch before reproducing actually.
>
> 1)
> It is too late if we fix a problem after it actually happened in case where we apply Linux, including pstore,
> to mission critical systems, because the failure of those systems has a great impact on a whole society.
> Customers in this area ask us to fix a problem as soon as possible.
> On the other hand, this kind of timing issue is hard to reproduce.
> So, our support service engineers often work all night to reproduce it.
> It is a nightmare for us.
>
> If we can fix it with a small patch in adance, it is really helpful for us.
As I said in my email I just sent, it may not help you without testing it.
As there are probably other problems in that un-tested theoretical
scenario.
>
> 2)
> In the long term, I plan to add a kmsg_dump to a kexec path because kdump may fail in the real world.
> In that case, we need another troubleshooting material like pstore to detect a root cause of failure.
But you are assuming that kmsg_dump is perfect and it isn't, in which case
by putting kmsg_dump in the kdump path, you actually may be blocking kdump
from working.
That is the biggest hold up for those guys from including it I believe.
Cheers,
Don
next prev parent reply other threads:[~2012-12-10 16:48 UTC|newest]
Thread overview: 9+ messages / expand[flat|nested] mbox.gz Atom feed top
2012-12-07 21:41 [RFC][PATCH] pstore: Skip spinlock when just one cpu is online Seiji Aguchi
2012-12-07 22:17 ` Luck, Tony
2012-12-07 23:43 ` Seiji Aguchi
2012-12-10 16:48 ` Don Zickus [this message]
2012-12-10 18:19 ` Luck, Tony
2012-12-11 0:06 ` Seiji Aguchi
2012-12-10 23:55 ` Seiji Aguchi
2012-12-10 16:42 ` Don Zickus
2012-12-10 23:52 ` Seiji Aguchi
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20121210164813.GA53431@redhat.com \
--to=dzickus@redhat.com \
--cc=cbouatmailru@gmail.com \
--cc=ccross@android.com \
--cc=dle-develop@lists.sourceforge.net \
--cc=keescook@chromium.org \
--cc=linux-kernel@vger.kernel.org \
--cc=satoru.moriya@hds.com \
--cc=seiji.aguchi@hds.com \
--cc=tony.luck@intel.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.