From: Yoshihiro YUNOMAE <yoshihiro.yunomae.ez@hitachi.com>
To: "Eric W. Biederman" <ebiederm@xmission.com>,
Don Zickus <dzickus@redhat.com>
Cc: Ingo Molnar <mingo@kernel.org>,
linux-kernel@vger.kernel.org, Andi Kleen <ak@linux.intel.com>,
"H. Peter Anvin" <hpa@zytor.com>, Gleb Natapov <gleb@redhat.com>,
Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>,
Joerg Roedel <joro@8bytes.org>,
x86@kernel.org, stable@vger.kernel.org,
Marcelo Tosatti <mtosatti@redhat.com>,
Hidehiro Kawai <hidehiro.kawai.ez@hitachi.com>,
Sebastian Andrzej Siewior <sebastian@breakpoint.cc>,
Ingo Molnar <mingo@redhat.com>,
Zhang Yanfei <zhangyanfei@cn.fujitsu.com>,
yrl.pp-manager.tt@hitachi.com,
Masami Hiramatsu <masami.hiramatsu.pt@hitachi.com>,
Thomas Gleixner <tglx@linutronix.de>,
Seiji Aguchi <seiji.aguchi@hds.com>,
Andrew Morton <akpm@linux-foundation.org>
Subject: Re: Re: [PATCH] [BUGFIX] crash/ioapic: Prevent crash_kexec() from deadlocking of ioapic_lock
Date: Mon, 02 Sep 2013 12:09:00 +0900 [thread overview]
Message-ID: <5224014C.2070801@hitachi.com> (raw)
In-Reply-To: <878uzir80g.fsf@xmission.com>
Hi Eric and Don,
Sorry for the late reply.
(2013/08/31 9:58), Eric W. Biederman wrote:
> Don Zickus <dzickus@redhat.com> writes:
>
>> On Tue, Aug 27, 2013 at 12:41:51PM +0900, Yoshihiro YUNOMAE wrote:
>>> Hi Don,
>>>
>>> Sorry for the late reply.
>>>
>>> (2013/08/22 22:11), Don Zickus wrote:
>>>> On Thu, Aug 22, 2013 at 05:38:07PM +0900, Yoshihiro YUNOMAE wrote:
>>>>>> So, I agree with Eric, let's remove the disable_IO_APIC() stuff and keep
>>>>>> the code simpler.
>>>>>
>>>>> Thank you for commenting about my patch.
>>>>> I didn't know you already have submitted the patches for this deadlock
>>>>> problem.
>>>>>
>>>>> I can't answer definitively right now that no problems are induced by
>>>>> removing disable_IO_APIC(). However, my patch should be work well (and
>>>>> has already been merged to -tip tree). So how about taking my patch at
>>>>> first, and then discussing the removal of disabled_IO_APIC()?
>>>>
>>>> It doesn't matter to me. My orignal patch last year was similar to yours
>>>> until it was suggested that we were working around a problem which was we
>>>> shouldn't touch the IO_APIC code on panic. Then I wrote the removal of
>>>> disable_IO_APIC patch and did lots of testing on it. I don't think I have
>>>> seen any issues with it (just the removal of disabling the lapic stuff).
>>>
>>> Yes, you really did a lot of testing about this problem according to
>>> your patch(https://lkml.org/lkml/2012/1/31/391). Although you
>>> said jiffies calibration code does not need the PIT in
>>> http://lists.infradead.org/pipermail/kexec/2012-February/006017.html,
>>> I don't understand yet why we can remove disable_IO_APIC.
>>> Would you please explain about the calibration codes?
>>
>> I forgot a lot of this, Eric B. might remember more (as he was the one that
>> pointed this out initially). I believe initially the io_apic had to be in
>> a pre-configured state in order to do some early calibration of the timing
>> code. Later on, it was my understanding, that the calibration of various
>> time keeping stuff did not need the io_apic in a correct state. The code
>> might have switched to tsc instead of PIT, I forget.
>
> Yes. Alan Coxe's initial SMP port had a few cases where it still
> exepected the system to be in PIT mode during boot and it took us a
> decade or so before those assumptions were finally expunged.
Would you please tell me the commit ID or the hint like files,
functions, or when?
>> Then again looking at the output of the latest dmesg, it seems the IO APIC
>> is initialized way before the tsc is calibrated. So I am not sure what
>> needed to get done or what interrupts are needed before the IO APIC gets
>> initialized.
>
> The practical issue is that jiffies was calibrated off of the PIT timer
> if I recall. But that is all old news.
Are the jiffies calibration codes calibrate_delay()?
It seems that the jiffies calibration have not used PIT in 2005
according to 8a9e1b0.
>>> By the way, can we remove disable_IO_APIC even if an old dump capture
>>> kernel is used?
>>
>> Good question. I did a bunch of testing with RHEL-6 too, which is 2.6.32
>> based. But I think we added some IRR fixes (commit 1e75b31d638), which
>> may or may not have helped in this case. So I don't know when a kernel
>> started worked correctly during init (with the right changes). I believe
>> 2.6.32 had everything.
>
> A sufficient old and buggy dump capture kernel will fail because of bugs
> in it's startup path, but I don't think anyone cares.
OK, if the jiffies calibration problem has been fixed in the old days,
we don't need to care for the old kernel.
> The kernel startup path has been fixed for years, and disable_IO_APIC in
> crash_kexec has always been a bug work-around for deficiencies in the
> kernel's start up path (not part of the guaranteed interface).
> Furthermore every real system configuration I have encountered used the
> same kernel version for the crashdump kernel and the production kernel.
> So we should be good.
We also will be use the kdump(crashdump) kernel as the production
kernel. Should I only care for the current kernel?
Thanks,
Yoshihiro YUNOMAE
--
Yoshihiro YUNOMAE
Software Platform Research Dept. Linux Technology Center
Hitachi, Ltd., Yokohama Research Laboratory
E-mail: yoshihiro.yunomae.ez@hitachi.com
next prev parent reply other threads:[~2013-09-02 3:09 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-08-19 8:12 [PATCH] [BUGFIX] crash/ioapic: Prevent crash_kexec() from deadlocking of ioapic_lock Yoshihiro YUNOMAE
2013-08-19 9:46 ` Ingo Molnar
2013-08-20 0:06 ` Yoshihiro YUNOMAE
2013-08-20 10:12 ` Eric W. Biederman
2013-08-20 14:27 ` Don Zickus
2013-08-22 8:38 ` Yoshihiro YUNOMAE
2013-08-22 13:11 ` Don Zickus
2013-08-27 3:41 ` Yoshihiro YUNOMAE
2013-08-27 13:33 ` Don Zickus
2013-08-31 0:58 ` Eric W. Biederman
2013-09-02 3:09 ` Yoshihiro YUNOMAE [this message]
2013-09-03 0:12 ` Eric W. Biederman
2013-09-03 11:02 ` Yoshihiro YUNOMAE
2013-09-03 12:44 ` Eric W. Biederman
2013-09-04 9:40 ` Yoshihiro YUNOMAE
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=5224014C.2070801@hitachi.com \
--to=yoshihiro.yunomae.ez@hitachi.com \
--cc=ak@linux.intel.com \
--cc=akpm@linux-foundation.org \
--cc=dzickus@redhat.com \
--cc=ebiederm@xmission.com \
--cc=gleb@redhat.com \
--cc=hidehiro.kawai.ez@hitachi.com \
--cc=hpa@zytor.com \
--cc=joro@8bytes.org \
--cc=konrad.wilk@oracle.com \
--cc=linux-kernel@vger.kernel.org \
--cc=masami.hiramatsu.pt@hitachi.com \
--cc=mingo@kernel.org \
--cc=mingo@redhat.com \
--cc=mtosatti@redhat.com \
--cc=sebastian@breakpoint.cc \
--cc=seiji.aguchi@hds.com \
--cc=stable@vger.kernel.org \
--cc=tglx@linutronix.de \
--cc=x86@kernel.org \
--cc=yrl.pp-manager.tt@hitachi.com \
--cc=zhangyanfei@cn.fujitsu.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).