* /sbin/kexec became zero size... Anyone else?
@ 2008-09-17 21:07 Jay Lan
2008-09-23 7:52 ` Simon Horman
0 siblings, 1 reply; 7+ messages in thread
From: Jay Lan @ 2008-09-17 21:07 UTC (permalink / raw)
To: kexec
Hi,
My root disk was populated with sles10sp2, but the kernel was
2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
Many times when kdump kernel failed early i found after reboot that
/sbin/kexec became zero size. There was no warning on executing
'/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
no idea when and how that happened.
Has anyone else seen this problem?
Thanks,
jay
a4700rac:/sbin # ll kexec*
-rwxr-xr-x 1 root root 0 2008-09-17 12:41 kexec
-rwxr-xr-x 1 root root 387635 2008-07-23 07:25 kexec-2.0.0
-r-xr-xr-x 1 root root 320632 2008-04-21 17:31 kexec-sles10sp2
a4700rac:/sbin # /etc/init.d/kdump start
Loading kdump Loading kdump kernel: /sbin/kexec -p
/boot/vmlinuz-2.6.27-rc5-default --append="CRASH=1 root=/dev/sda8
shub2 console=ttySG0,115200 thash_entries=2097152 ro elevator=deadline
sysrq=1 reset_devices irqpoll maxcpus=1 3"
--initrd=/boot/initrd-2.6.27-rc5-default --noio
done
a4700rac:/sbin #
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-17 21:07 /sbin/kexec became zero size... Anyone else? Jay Lan
@ 2008-09-23 7:52 ` Simon Horman
2008-09-23 16:41 ` Jay Lan
0 siblings, 1 reply; 7+ messages in thread
From: Simon Horman @ 2008-09-23 7:52 UTC (permalink / raw)
To: Jay Lan; +Cc: kexec
On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
> Hi,
>
> My root disk was populated with sles10sp2, but the kernel was
> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
>
> Many times when kdump kernel failed early i found after reboot that
> /sbin/kexec became zero size. There was no warning on executing
> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
> no idea when and how that happened.
>
> Has anyone else seen this problem?
Is it possible that this is caused by filesystem corruption?
--
Simon Horman
VA Linux Systems Japan K.K., Sydney, Australia Satellite Office
H: www.vergenet.net/~horms/ W: www.valinux.co.jp/en
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-23 7:52 ` Simon Horman
@ 2008-09-23 16:41 ` Jay Lan
2008-09-23 17:41 ` Neil Horman
0 siblings, 1 reply; 7+ messages in thread
From: Jay Lan @ 2008-09-23 16:41 UTC (permalink / raw)
To: Simon Horman; +Cc: kexec
Simon Horman wrote:
> On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
>> Hi,
>>
>> My root disk was populated with sles10sp2, but the kernel was
>> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
>>
>> Many times when kdump kernel failed early i found after reboot that
>> /sbin/kexec became zero size. There was no warning on executing
>> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
>> no idea when and how that happened.
>>
>> Has anyone else seen this problem?
>
> Is it possible that this is caused by filesystem corruption?
I found the problems, which i can not reliably reproduce, happened
when i ran into kdump kernel boot failure (yet not every time.)
What puzzled me was why only /sbin/kexec was affected.
I can not explain what caused it (otherwise i would be able to
reproduce it) and do not know where my finger should point to. ;)
The good news is since i fixed the kernel boot problem it has not
happened any more.
The distros probably can add a check against zero-sized kexec
in /etc/init.d/kdump script though.
Thanks,
- jay
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-23 16:41 ` Jay Lan
@ 2008-09-23 17:41 ` Neil Horman
2008-09-23 20:32 ` Jay Lan
0 siblings, 1 reply; 7+ messages in thread
From: Neil Horman @ 2008-09-23 17:41 UTC (permalink / raw)
To: Jay Lan; +Cc: Simon Horman, kexec
On Tue, Sep 23, 2008 at 09:41:50AM -0700, Jay Lan wrote:
> Simon Horman wrote:
> > On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
> >> Hi,
> >>
> >> My root disk was populated with sles10sp2, but the kernel was
> >> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
> >>
> >> Many times when kdump kernel failed early i found after reboot that
> >> /sbin/kexec became zero size. There was no warning on executing
> >> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
> >> no idea when and how that happened.
> >>
> >> Has anyone else seen this problem?
> >
> > Is it possible that this is caused by filesystem corruption?
>
> I found the problems, which i can not reliably reproduce, happened
> when i ran into kdump kernel boot failure (yet not every time.)
> What puzzled me was why only /sbin/kexec was affected.
>
This is why most kdump implementations capture vmcore from an initramfs, since
a crash means that you may not be able to trust the integrity of the filesystems
that were mounted before the panic.
> I can not explain what caused it (otherwise i would be able to
> reproduce it) and do not know where my finger should point to. ;)
> The good news is since i fixed the kernel boot problem it has not
> happened any more.
>
> The distros probably can add a check against zero-sized kexec
> in /etc/init.d/kdump script though.
>
Theres really not a need to. Filesystems can be corrupted by crashes, thats why
we capture cores from the initrd. We need to be able to trust the rest of the
system to scan filesystems and do their best to recover when filsystems loose
data.
Regards
Neil
> Thanks,
> - jay
>
>
> _______________________________________________
> kexec mailing list
> kexec@lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/kexec
--
/***************************************************
*Neil Horman
*Senior Software Engineer
*Red Hat, Inc.
*nhorman@redhat.com
*gpg keyid: 1024D / 0x92A74FA1
*http://pgp.mit.edu
***************************************************/
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-23 17:41 ` Neil Horman
@ 2008-09-23 20:32 ` Jay Lan
2008-09-23 20:41 ` Neil Horman
2010-04-08 16:54 ` Neil Horman
0 siblings, 2 replies; 7+ messages in thread
From: Jay Lan @ 2008-09-23 20:32 UTC (permalink / raw)
To: Neil Horman; +Cc: Simon Horman, kexec
Neil Horman wrote:
> On Tue, Sep 23, 2008 at 09:41:50AM -0700, Jay Lan wrote:
>> Simon Horman wrote:
>>> On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
>>>> Hi,
>>>>
>>>> My root disk was populated with sles10sp2, but the kernel was
>>>> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
>>>>
>>>> Many times when kdump kernel failed early i found after reboot that
>>>> /sbin/kexec became zero size. There was no warning on executing
>>>> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
>>>> no idea when and how that happened.
>>>>
>>>> Has anyone else seen this problem?
>>> Is it possible that this is caused by filesystem corruption?
>> I found the problems, which i can not reliably reproduce, happened
>> when i ran into kdump kernel boot failure (yet not every time.)
>> What puzzled me was why only /sbin/kexec was affected.
>>
> This is why most kdump implementations capture vmcore from an initramfs, since
> a crash means that you may not be able to trust the integrity of the filesystems
> that were mounted before the panic.
>
>
>> I can not explain what caused it (otherwise i would be able to
>> reproduce it) and do not know where my finger should point to. ;)
>> The good news is since i fixed the kernel boot problem it has not
>> happened any more.
>>
>> The distros probably can add a check against zero-sized kexec
>> in /etc/init.d/kdump script though.
>>
> Theres really not a need to. Filesystems can be corrupted by crashes, thats why
> we capture cores from the initrd. We need to be able to trust the rest of the
> system to scan filesystems and do their best to recover when filsystems loose
> data.
Hi Neil,
Which rhel release starts doing this?
Thanks,
- jay
>
> Regards
> Neil
>
>> Thanks,
>> - jay
>>
>>
>> _______________________________________________
>> kexec mailing list
>> kexec@lists.infradead.org
>> http://lists.infradead.org/mailman/listinfo/kexec
>
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-23 20:32 ` Jay Lan
@ 2008-09-23 20:41 ` Neil Horman
2010-04-08 16:54 ` Neil Horman
1 sibling, 0 replies; 7+ messages in thread
From: Neil Horman @ 2008-09-23 20:41 UTC (permalink / raw)
To: Jay Lan; +Cc: Neil Horman, Simon Horman, kexec
On Tue, Sep 23, 2008 at 01:32:19PM -0700, Jay Lan wrote:
> Neil Horman wrote:
> > On Tue, Sep 23, 2008 at 09:41:50AM -0700, Jay Lan wrote:
> >> Simon Horman wrote:
> >>> On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
> >>>> Hi,
> >>>>
> >>>> My root disk was populated with sles10sp2, but the kernel was
> >>>> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
> >>>>
> >>>> Many times when kdump kernel failed early i found after reboot that
> >>>> /sbin/kexec became zero size. There was no warning on executing
> >>>> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
> >>>> no idea when and how that happened.
> >>>>
> >>>> Has anyone else seen this problem?
> >>> Is it possible that this is caused by filesystem corruption?
> >> I found the problems, which i can not reliably reproduce, happened
> >> when i ran into kdump kernel boot failure (yet not every time.)
> >> What puzzled me was why only /sbin/kexec was affected.
> >>
> > This is why most kdump implementations capture vmcore from an initramfs, since
> > a crash means that you may not be able to trust the integrity of the filesystems
> > that were mounted before the panic.
> >
> >
> >> I can not explain what caused it (otherwise i would be able to
> >> reproduce it) and do not know where my finger should point to. ;)
> >> The good news is since i fixed the kernel boot problem it has not
> >> happened any more.
> >>
> >> The distros probably can add a check against zero-sized kexec
> >> in /etc/init.d/kdump script though.
> >>
> > Theres really not a need to. Filesystems can be corrupted by crashes, thats why
> > we capture cores from the initrd. We need to be able to trust the rest of the
> > system to scan filesystems and do their best to recover when filsystems loose
> > data.
>
> Hi Neil,
>
> Which rhel release starts doing this?
>
Not sure what you mean. Which rhel releases attempt to recover filesystems on
boot? Linux has done that with fsck for years. Which RHEL releases have
allowed vmcore capture from an initrd with kdump? Read the man page, its been
available since GA.
Neil
> Thanks,
> - jay
>
>
> >
> > Regards
> > Neil
> >
> >> Thanks,
> >> - jay
> >>
> >>
> >> _______________________________________________
> >> kexec mailing list
> >> kexec@lists.infradead.org
> >> http://lists.infradead.org/mailman/listinfo/kexec
> >
>
--
/***************************************************
*Neil Horman
*Senior Software Engineer
*Red Hat, Inc.
*nhorman@redhat.com
*gpg keyid: 1024D / 0x92A74FA1
*http://pgp.mit.edu
***************************************************/
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: /sbin/kexec became zero size... Anyone else?
2008-09-23 20:32 ` Jay Lan
2008-09-23 20:41 ` Neil Horman
@ 2010-04-08 16:54 ` Neil Horman
1 sibling, 0 replies; 7+ messages in thread
From: Neil Horman @ 2010-04-08 16:54 UTC (permalink / raw)
To: Jay Lan; +Cc: Neil Horman, Simon Horman, kexec
On Tue, Sep 23, 2008 at 01:32:19PM -0700, Jay Lan wrote:
> Neil Horman wrote:
> > On Tue, Sep 23, 2008 at 09:41:50AM -0700, Jay Lan wrote:
> >> Simon Horman wrote:
> >>> On Wed, Sep 17, 2008 at 02:07:15PM -0700, Jay Lan wrote:
> >>>> Hi,
> >>>>
> >>>> My root disk was populated with sles10sp2, but the kernel was
> >>>> 2.6.27-rc5 and /sbin/kexec was built from 2.0.0 version.
> >>>>
> >>>> Many times when kdump kernel failed early i found after reboot that
> >>>> /sbin/kexec became zero size. There was no warning on executing
> >>>> '/etc/init.d/kdump start' when /sbin/kexec is zero size. I have
> >>>> no idea when and how that happened.
> >>>>
> >>>> Has anyone else seen this problem?
> >>> Is it possible that this is caused by filesystem corruption?
> >> I found the problems, which i can not reliably reproduce, happened
> >> when i ran into kdump kernel boot failure (yet not every time.)
> >> What puzzled me was why only /sbin/kexec was affected.
> >>
> > This is why most kdump implementations capture vmcore from an initramfs, since
> > a crash means that you may not be able to trust the integrity of the filesystems
> > that were mounted before the panic.
> >
> >
> >> I can not explain what caused it (otherwise i would be able to
> >> reproduce it) and do not know where my finger should point to. ;)
> >> The good news is since i fixed the kernel boot problem it has not
> >> happened any more.
> >>
> >> The distros probably can add a check against zero-sized kexec
> >> in /etc/init.d/kdump script though.
> >>
> > Theres really not a need to. Filesystems can be corrupted by crashes, thats why
> > we capture cores from the initrd. We need to be able to trust the rest of the
> > system to scan filesystems and do their best to recover when filsystems loose
> > data.
>
> Hi Neil,
>
> Which rhel release starts doing this?
>
> Thanks,
> - jay
>
Its always been that way.
Neil
>
> >
> > Regards
> > Neil
> >
> >> Thanks,
> >> - jay
> >>
> >>
> >> _______________________________________________
> >> kexec mailing list
> >> kexec@lists.infradead.org
> >> http://lists.infradead.org/mailman/listinfo/kexec
> >
>
>
> _______________________________________________
> kexec mailing list
> kexec@lists.infradead.org
> http://lists.infradead.org/mailman/listinfo/kexec
>
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2010-04-08 16:54 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2008-09-17 21:07 /sbin/kexec became zero size... Anyone else? Jay Lan
2008-09-23 7:52 ` Simon Horman
2008-09-23 16:41 ` Jay Lan
2008-09-23 17:41 ` Neil Horman
2008-09-23 20:32 ` Jay Lan
2008-09-23 20:41 ` Neil Horman
2010-04-08 16:54 ` Neil Horman
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox