From: HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com>
To: Michael Holzheu <holzheu@linux.vnet.ibm.com>
Cc: kexec@lists.infradead.org,
Heiko Carstens <heiko.carstens@de.ibm.com>,
Jan Willeke <willeke@de.ibm.com>,
linux-kernel@vger.kernel.org,
Martin Schwidefsky <schwidefsky@de.ibm.com>,
Vivek Goyal <vgoyal@redhat.com>
Subject: Re: [PATCH v6 3/5] vmcore: Introduce remap_oldmem_pfn_range()
Date: Wed, 10 Jul 2013 18:50:18 +0900 [thread overview]
Message-ID: <51DD2E5A.1030200@jp.fujitsu.com> (raw)
In-Reply-To: <20130710104252.479a0f92@holzheu>
(2013/07/10 17:42), Michael Holzheu wrote:
> Hello Hatayama,
>
> On Tue, 09 Jul 2013 14:49:48 +0900
> HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com> wrote:
>
>> (2013/07/08 23:28), Vivek Goyal wrote:
>>> On Mon, Jul 08, 2013 at 11:28:39AM +0200, Michael Holzheu wrote:
>>>> On Mon, 08 Jul 2013 14:32:09 +0900
>>>> HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com> wrote:
>
> [snip]
>
>>> I personally perfer not to special case it for s390 only and let the
>>> handler be generic.
>>>
>>> If there is a bug in remap_old_pfn_range(), only side affect is that
>>> we will fault in the page when it is accessed and that will be slow. BUG()
>>> sounds excessive. At max it could be WARN_ONCE().
>>>
>>> In regular cases for x86, this path should not even hit. So special casing
>>> it to detect issues with remap_old_pfn_range() does not sound very good
>>> to me. I would rather leave it as it is and if there are bugs and mmap()
>>> slows down, then somebody needs to debug it.
>>>
>>
>> I agree to WARN_ONCE(). Then, we can notice bug at least if it occurs.
>>
>> Interface is like this?
>>
>> [generic]
>>
>> bool __weak in_valid_fault_range(pgoff_t pgoff)
>> {
>> return false;
>> }
>>
>> [s390]
>>
>> bool in_valid_fault_range(pgoff_t pgoff)
>> {
>> loff_t offset = pgoff << PAGE_CACHE_SHIFT;
>> u64 paddr = vmcore_offset_to_paddr(offset);
>>
>> return paddr < ZFCPDUMP_HSA_SIZE;
>> }
>>
>> assuming vmcore_offset_to_paddr() that looks up vmcore_list and returns physical
>> address corresponding to given offset of vmcore. I guess this could return error
>> value if there's no entry corresponding to given offset in vmcore_list.
>
> I think this is too much code (and overhead) just for checking the correctness the
> kdump mmap implementation.
>
> My suggestion is to add the WARN_ONCE() for #ifndef CONFIG_S390. This has the same
> effect as your suggestion for all architectures besides of s390. And for s390 we
> take the risk that a programming error would result in poor /proc/vmcore
> performance.
>
If you want to avoid looking up vmcore_list that takes linear time w.r.t. the number
of the elements, you can still calculate the range of offsets in /proc/vmcore
corresponding to HSA during /proc/vmcore initialization.
Also, could you tell me how often and how much the HSA region is during crash dumping?
I guess the read to HSA is done mainly during early part of crash dumping process only.
According to the code, it appears at most 64MiB only. Then, I feel performance is not
a big issue.
Also, cost of WARN_ONCE() is one memory access only in the 2nd and later calls. I don't
think it too much overhead...
> So, at least for this patch series I would implement the fault handler as follows:
>
> static int mmap_vmcore_fault(struct vm_area_struct *vma, struct vm_fault *vmf)
> {
> ...
> char *buf;
> int rc;
>
> #ifndef CONFIG_S390
> WARN_ONCE(1, "vmcore: Unexpected call of mmap_vmcore_fault()");
> #endif
> page = find_or_create_page(mapping, index, GFP_KERNEL);
>
> At this point I have to tell you that we plan another vmcore patch series where
> the fault handler might be called also for other architectures. But I think we
> should *then* discuss your issue again.
>
Could you explain the plan in more detail? Or I cannot review correctly since I don't
know whether there's really usecase of this generic fault handler for other
architectures. This is the issue for architectures other than s390, not mine; now we
don't need it at all.
--
Thanks.
HATAYAMA, Daisuke
_______________________________________________
kexec mailing list
kexec@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/kexec
WARNING: multiple messages have this Message-ID (diff)
From: HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com>
To: Michael Holzheu <holzheu@linux.vnet.ibm.com>
Cc: Vivek Goyal <vgoyal@redhat.com>,
Martin Schwidefsky <schwidefsky@de.ibm.com>,
kexec@lists.infradead.org,
Heiko Carstens <heiko.carstens@de.ibm.com>,
Jan Willeke <willeke@de.ibm.com>,
linux-kernel@vger.kernel.org
Subject: Re: [PATCH v6 3/5] vmcore: Introduce remap_oldmem_pfn_range()
Date: Wed, 10 Jul 2013 18:50:18 +0900 [thread overview]
Message-ID: <51DD2E5A.1030200@jp.fujitsu.com> (raw)
In-Reply-To: <20130710104252.479a0f92@holzheu>
(2013/07/10 17:42), Michael Holzheu wrote:
> Hello Hatayama,
>
> On Tue, 09 Jul 2013 14:49:48 +0900
> HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com> wrote:
>
>> (2013/07/08 23:28), Vivek Goyal wrote:
>>> On Mon, Jul 08, 2013 at 11:28:39AM +0200, Michael Holzheu wrote:
>>>> On Mon, 08 Jul 2013 14:32:09 +0900
>>>> HATAYAMA Daisuke <d.hatayama@jp.fujitsu.com> wrote:
>
> [snip]
>
>>> I personally perfer not to special case it for s390 only and let the
>>> handler be generic.
>>>
>>> If there is a bug in remap_old_pfn_range(), only side affect is that
>>> we will fault in the page when it is accessed and that will be slow. BUG()
>>> sounds excessive. At max it could be WARN_ONCE().
>>>
>>> In regular cases for x86, this path should not even hit. So special casing
>>> it to detect issues with remap_old_pfn_range() does not sound very good
>>> to me. I would rather leave it as it is and if there are bugs and mmap()
>>> slows down, then somebody needs to debug it.
>>>
>>
>> I agree to WARN_ONCE(). Then, we can notice bug at least if it occurs.
>>
>> Interface is like this?
>>
>> [generic]
>>
>> bool __weak in_valid_fault_range(pgoff_t pgoff)
>> {
>> return false;
>> }
>>
>> [s390]
>>
>> bool in_valid_fault_range(pgoff_t pgoff)
>> {
>> loff_t offset = pgoff << PAGE_CACHE_SHIFT;
>> u64 paddr = vmcore_offset_to_paddr(offset);
>>
>> return paddr < ZFCPDUMP_HSA_SIZE;
>> }
>>
>> assuming vmcore_offset_to_paddr() that looks up vmcore_list and returns physical
>> address corresponding to given offset of vmcore. I guess this could return error
>> value if there's no entry corresponding to given offset in vmcore_list.
>
> I think this is too much code (and overhead) just for checking the correctness the
> kdump mmap implementation.
>
> My suggestion is to add the WARN_ONCE() for #ifndef CONFIG_S390. This has the same
> effect as your suggestion for all architectures besides of s390. And for s390 we
> take the risk that a programming error would result in poor /proc/vmcore
> performance.
>
If you want to avoid looking up vmcore_list that takes linear time w.r.t. the number
of the elements, you can still calculate the range of offsets in /proc/vmcore
corresponding to HSA during /proc/vmcore initialization.
Also, could you tell me how often and how much the HSA region is during crash dumping?
I guess the read to HSA is done mainly during early part of crash dumping process only.
According to the code, it appears at most 64MiB only. Then, I feel performance is not
a big issue.
Also, cost of WARN_ONCE() is one memory access only in the 2nd and later calls. I don't
think it too much overhead...
> So, at least for this patch series I would implement the fault handler as follows:
>
> static int mmap_vmcore_fault(struct vm_area_struct *vma, struct vm_fault *vmf)
> {
> ...
> char *buf;
> int rc;
>
> #ifndef CONFIG_S390
> WARN_ONCE(1, "vmcore: Unexpected call of mmap_vmcore_fault()");
> #endif
> page = find_or_create_page(mapping, index, GFP_KERNEL);
>
> At this point I have to tell you that we plan another vmcore patch series where
> the fault handler might be called also for other architectures. But I think we
> should *then* discuss your issue again.
>
Could you explain the plan in more detail? Or I cannot review correctly since I don't
know whether there's really usecase of this generic fault handler for other
architectures. This is the issue for architectures other than s390, not mine; now we
don't need it at all.
--
Thanks.
HATAYAMA, Daisuke
next prev parent reply other threads:[~2013-07-10 9:51 UTC|newest]
Thread overview: 74+ messages / expand[flat|nested] mbox.gz Atom feed top
2013-07-01 19:32 [PATCH v6 0/5] kdump: Allow ELF header creation in new kernel Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
2013-07-01 19:32 ` [PATCH v6 1/5] vmcore: Introduce ELF header in new memory feature Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
2013-07-02 15:27 ` Vivek Goyal
2013-07-02 15:27 ` Vivek Goyal
2013-07-01 19:32 ` [PATCH v6 2/5] s390/vmcore: Use " Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
2013-07-02 16:23 ` Vivek Goyal
2013-07-02 16:23 ` Vivek Goyal
2013-07-03 7:59 ` Michael Holzheu
2013-07-03 7:59 ` Michael Holzheu
2013-07-03 14:15 ` Vivek Goyal
2013-07-03 14:15 ` Vivek Goyal
2013-07-03 14:39 ` Michael Holzheu
2013-07-03 14:39 ` Michael Holzheu
2013-07-03 14:50 ` Vivek Goyal
2013-07-03 14:50 ` Vivek Goyal
2013-07-01 19:32 ` [PATCH v6 3/5] vmcore: Introduce remap_oldmem_pfn_range() Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
2013-07-02 15:42 ` Vivek Goyal
2013-07-02 15:42 ` Vivek Goyal
2013-07-03 13:59 ` Michael Holzheu
2013-07-03 13:59 ` Michael Holzheu
2013-07-03 14:16 ` Vivek Goyal
2013-07-03 14:16 ` Vivek Goyal
2013-07-15 13:44 ` Michael Holzheu
2013-07-15 13:44 ` Michael Holzheu
2013-07-15 14:27 ` Vivek Goyal
2013-07-15 14:27 ` Vivek Goyal
2013-07-16 9:25 ` Michael Holzheu
2013-07-16 9:25 ` Michael Holzheu
2013-07-16 14:04 ` Vivek Goyal
2013-07-16 14:04 ` Vivek Goyal
2013-07-16 15:37 ` Michael Holzheu
2013-07-16 15:37 ` Michael Holzheu
2013-07-16 15:55 ` Vivek Goyal
2013-07-16 15:55 ` Vivek Goyal
2013-07-08 5:32 ` HATAYAMA Daisuke
2013-07-08 5:32 ` HATAYAMA Daisuke
2013-07-08 9:28 ` Michael Holzheu
2013-07-08 9:28 ` Michael Holzheu
2013-07-08 14:28 ` Vivek Goyal
2013-07-08 14:28 ` Vivek Goyal
2013-07-09 5:49 ` HATAYAMA Daisuke
2013-07-09 5:49 ` HATAYAMA Daisuke
2013-07-10 8:42 ` Michael Holzheu
2013-07-10 8:42 ` Michael Holzheu
2013-07-10 9:50 ` HATAYAMA Daisuke [this message]
2013-07-10 9:50 ` HATAYAMA Daisuke
2013-07-10 11:00 ` Michael Holzheu
2013-07-10 11:00 ` Michael Holzheu
2013-07-12 16:02 ` HATAYAMA Daisuke
2013-07-12 16:02 ` HATAYAMA Daisuke
2013-07-15 9:21 ` Martin Schwidefsky
2013-07-15 9:21 ` Martin Schwidefsky
2013-07-16 0:51 ` HATAYAMA Daisuke
2013-07-16 0:51 ` HATAYAMA Daisuke
2013-07-10 14:33 ` Vivek Goyal
2013-07-10 14:33 ` Vivek Goyal
2013-07-12 11:05 ` HATAYAMA Daisuke
2013-07-12 11:05 ` HATAYAMA Daisuke
2013-07-15 14:20 ` Vivek Goyal
2013-07-15 14:20 ` Vivek Goyal
2013-07-16 0:27 ` HATAYAMA Daisuke
2013-07-16 0:27 ` HATAYAMA Daisuke
2013-07-16 9:40 ` HATAYAMA Daisuke
2013-07-16 9:40 ` HATAYAMA Daisuke
2013-07-09 5:31 ` HATAYAMA Daisuke
2013-07-09 5:31 ` HATAYAMA Daisuke
2013-07-01 19:32 ` [PATCH v6 4/5] s390/vmcore: Implement remap_oldmem_pfn_range for s390 Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
2013-07-01 19:32 ` [PATCH v6 5/5] s390/vmcore: Use vmcore for zfcpdump Michael Holzheu
2013-07-01 19:32 ` Michael Holzheu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=51DD2E5A.1030200@jp.fujitsu.com \
--to=d.hatayama@jp.fujitsu.com \
--cc=heiko.carstens@de.ibm.com \
--cc=holzheu@linux.vnet.ibm.com \
--cc=kexec@lists.infradead.org \
--cc=linux-kernel@vger.kernel.org \
--cc=schwidefsky@de.ibm.com \
--cc=vgoyal@redhat.com \
--cc=willeke@de.ibm.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.