From: Andrew Morton <akpm@linux-foundation.org>
To: Neil Horman <nhorman@redhat.com>
Cc: vgoyal@redhat.com, nhorman@redhat.com, nickpiggin@yahoo.com.au,
k-miyoshi@cb.jp.nec.com, greg@kroah.com, bwalle@suse.de,
kdb@oss.sgi.com, kexec@lists.infradead.org,
t-nagano@ah.jp.nec.com, linux-kernel@vger.kernel.org,
rdunlap@xenotime.net, ebiederm@xmission.com, kaos@ocs.com.au
Subject: Re: [PATCH 0/2] add new notifier function ,take3
Date: Mon, 14 Apr 2008 12:33:11 -0700 [thread overview]
Message-ID: <20080414123311.01d537b4.akpm@linux-foundation.org> (raw)
In-Reply-To: <20080414160146.GE1193@hmsendeavour.rdu.redhat.com>
On Mon, 14 Apr 2008 12:01:46 -0400
Neil Horman <nhorman@redhat.com> wrote:
> On Mon, Apr 14, 2008 at 10:53:23AM -0400, Vivek Goyal wrote:
> > On Mon, Apr 14, 2008 at 10:42:28AM -0400, Neil Horman wrote:
> > > On Mon, Apr 14, 2008 at 09:46:22AM -0400, Vivek Goyal wrote:
> > > > On Fri, Apr 11, 2008 at 09:07:51PM -0700, Andrew Morton wrote:
> > > >
> > > > [..]
> > > > > > Kernel panic - not syncing: Panic by panic_module.
> > > > > > __tunable_atomic_notifier_call_chain enter
> > > > > > msg_handler:panic_event was called.
> > > > > > ipmi_wdog:wdog_panic_handler was called.
> > > > > > notifier_test: notifier_test_panic() is called.
> > > > > > notifier_test: notifier_test_panic2() is called.
> > > > >
> > > > > OK. But I don't see anywhere in here the most important piece of
> > > > > information: why do we need this feature in Linux?
> > > > >
> > > > > What are the use-cases? What is the value? etc.
> > > > >
> > > > > Often I can guess (but I like the originator to remove the guesswork). In
> > > > > this case I'm stumped - I can't see any reason why anyone would want this.
> > > > >
> > > >
> > > > Hi Andrew,
> > > >
> > > > To begin with, he wants kdb, kgdb etc to co-exist with kdump. He wants
> > > > to put all the RAS tools (who are interested in panic event) on a list
> > > > and export it to user space and let user decide in what order do the tool get
> > > > executed at panic time (based on priority).
> > > >
> > > > This brings in little bit reliability concerns for kdump due to notifier
> > > > code being run after panic.
> > > >
> > > > I think people want to use this infrastrutucure beyond RAS tools. I
> > > > remember somebody wanting to send a message to remote node after a
> > > > panic (before kdump kicks in) so that remote node can initiate failover
> > > > etc.
> > > >
> > > I know it doesn't particularly relate to this patch, but FWIW, for cases like
> > > failover, I've inserted infrastrucutre in the userspace part of kdump for
> > > Fedora/RHEL to support this sort of thing. We can run arbitrary scripts righte
> > > before and after a capture so that notifications can be sent to remote nodes in
> > > a much safer fashion than using the notifier chain after a panic.
> > > Neil
> > >
> >
> > That's great. I did not know about these. So user can write custom
> > scripts/binaries which can be packed into kdump initrd and executed either
> > before or after dump capture? Any idea, if somebody has started using it
> > already?
> >
> Thats exactly right. I'm not sure if there is any serious use as of yet, but
> I've had some interrogatories about it. Specific cases that I recall include:
>
> 1) A set of users in japan that are using the pre-dump script to block execution
> until a scsi controller detects all its drives (it apparently takes up to three
> minues to scan its bus)
>
> 2) I think some people using clustering services were using the pre-script to
> notify cluster peers of the failure to avoid power fencing while a node
> completed the crash dump
>
> 3) A national lab had an interest in using the pre script to send an email to an
> administrative address to log the failure in a cluster
>
OK, thanks.
I think I'll duck the patch for now as it seems that a littlee more thought
and coordination is neeed.
Plus it appears that the only users of this infrastructure are provided via
presently-out-of-tree patches, so people who are already patching and
building their own kernels can easily add this other patch as well, for now.
next prev parent reply other threads:[~2008-04-14 19:34 UTC|newest]
Thread overview: 10+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-04-11 7:53 [PATCH 0/2] add new notifier function ,take3 Takenori Nagano
2008-04-12 4:07 ` Andrew Morton
2008-04-14 13:46 ` Vivek Goyal
2008-04-14 14:42 ` Neil Horman
2008-04-14 14:46 ` Bernhard Walle
2008-04-14 14:53 ` Vivek Goyal
2008-04-14 16:01 ` Neil Horman
2008-04-14 19:33 ` Andrew Morton [this message]
2008-04-17 5:31 ` Takenori Nagano
2008-04-23 12:31 ` Eric W. Biederman
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20080414123311.01d537b4.akpm@linux-foundation.org \
--to=akpm@linux-foundation.org \
--cc=bwalle@suse.de \
--cc=ebiederm@xmission.com \
--cc=greg@kroah.com \
--cc=k-miyoshi@cb.jp.nec.com \
--cc=kaos@ocs.com.au \
--cc=kdb@oss.sgi.com \
--cc=kexec@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=nhorman@redhat.com \
--cc=nickpiggin@yahoo.com.au \
--cc=rdunlap@xenotime.net \
--cc=t-nagano@ah.jp.nec.com \
--cc=vgoyal@redhat.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox