From: Greg Ungerer <gerg@snapgear.com>
To: Ralf Baechle <ralf@linux-mips.org>
Cc: linux-mips@linux-mips.org
Subject: Re: system lockup with 2.6.29 on Cavium/Octeon
Date: Thu, 21 May 2009 15:29:05 +1000 [thread overview]
Message-ID: <4A14E6A1.4030700@snapgear.com> (raw)
In-Reply-To: <20090520142604.GA29677@linux-mips.org>
Hi Ralf,
Ralf Baechle wrote:
> On Wed, May 20, 2009 at 04:12:32PM +1000, Greg Ungerer wrote:
>
>> I have a system lockup problem that I have been looking at on a custom
>> Cavium/Octeon 5010 based design. I am running on linux-2.6.29 with
>> David Daney's latest round of PCI and ethernet patches (posted here
>> on this list).
>>
>> I have tracked the problem back to local_flush_tlb_kernel_range() in
>> arch/mips/mm/tlb-r4k.c. At the top of this function is:
>>
>> void local_flush_tlb_kernel_range(unsigned long start, unsigned long
>> end)
>> {
>> unsigned long flags;
>> int size;
>>
>> ENTER_CRITICAL(flags);
>> size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
>> size = (size + 1) >> 1;
>> if (size <= current_cpu_data.tlbsize / 2) {
>>
>> The problem is that typical example values I see passed in for start
>> and end are:
>>
>> start = c000000000006000
>> end = ffffffffc01d8000
>>
>> Now the vmalloc area starts at 0xc000000000000000 and the kernel code
>> and data is all at 0xffffffff80000000 and above. I don't know if the
>> start and end are reasonable values, but I can see some logic as to
>> where they come from. The code path that leads to this is via
>> __vunmap() and __purge_vmap_area_lazy(). So it is not too difficult
>> to see how we end up with values like this.
>
> Either start or end address is sensible but not the combination - both
> addresses should be in the same segment. Start is in XKSEG, end in CKSEG2
> and in between there are vast wastelands of unused address space exabytes
> in size.
Yes, exactly, that looked odd to me too.
So I tracked it back to see how these both ended up being in there.
It turns out that MODULE_START, as defined in
arch/mips/include/asm/pgtable-64.h, is CKSSEG, so it is
0xffffffffc0000000 in my case. And VMALLOC_START/MAP_BASE is
defined to be 0xc000000000000000.
In module_alloc() there is a call to __get_vm_area() with MODULE_START
as the start address, and this is how the 0xfff... addresses end up in
the vmap_area table. The usual vmalloc() calls use VMALLOC_START and
that is how the 0xc000... addresses get into the vmap_area table.
Interestingly the definition of MODULE_START is like this:
#if defined(CONFIG_MODULES) && defined(KBUILD_64BIT_SYM32) && \
VMALLOC_START != CKSSEG
/* Load modules into 32bit-compatible segment. */
#define MODULE_START CKSSEG
If MODULE_START wasn't defined then the module_alloc() code would
have just called vmalloc() directly - and we wouldn't be in this
mess :-)
>> But the size calculation above with these types of values will result
>> in still a large number. Larger than the 32bit "int" that is "size".
>> I see large negative values fall out as size, and so the following
>> tlbsize check becomes true, and the code spins inside the loop inside
>> that if statement for a _very_ long time trying to flush tlb entries.
>>
>> This is of course easily fixed, by making that size "unsigned long".
>> The patch below trivially does this.
>>
>> But is this analysis correct?
>
> Yes - but I think we have two issues here. The one is the calculation
> overflowing int for the arguments you're seeing. The other being that
> the arguments simply are looking wrong.
>
> There are a few more instances of the same overflow issue which the patch
> below is fixing.
Indeed, looks good.
Regards
Greg
> Ralf
>
>
> arch/mips/mm/tlb-r3k.c | 6 ++----
> arch/mips/mm/tlb-r4k.c | 6 ++----
> arch/mips/mm/tlb-r8k.c | 3 +--
> 3 files changed, 5 insertions(+), 10 deletions(-)
>
> diff --git a/arch/mips/mm/tlb-r3k.c b/arch/mips/mm/tlb-r3k.c
> index f0cf46a..1c0048a 100644
> --- a/arch/mips/mm/tlb-r3k.c
> +++ b/arch/mips/mm/tlb-r3k.c
> @@ -82,8 +82,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
> int cpu = smp_processor_id();
>
> if (cpu_context(cpu, mm) != 0) {
> - unsigned long flags;
> - int size;
> + unsigned long size, flags;
>
> #ifdef DEBUG_TLB
> printk("[tlbrange<%lu,0x%08lx,0x%08lx>]",
> @@ -121,8 +120,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>
> void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
> {
> - unsigned long flags;
> - int size;
> + unsigned long size, flags;
>
> #ifdef DEBUG_TLB
> printk("[tlbrange<%lu,0x%08lx,0x%08lx>]", start, end);
> diff --git a/arch/mips/mm/tlb-r4k.c b/arch/mips/mm/tlb-r4k.c
> index 9619f66..892be42 100644
> --- a/arch/mips/mm/tlb-r4k.c
> +++ b/arch/mips/mm/tlb-r4k.c
> @@ -117,8 +117,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
> int cpu = smp_processor_id();
>
> if (cpu_context(cpu, mm) != 0) {
> - unsigned long flags;
> - int size;
> + unsigned long size, flags;
>
> ENTER_CRITICAL(flags);
> size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
> @@ -160,8 +159,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>
> void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
> {
> - unsigned long flags;
> - int size;
> + unsigned long size, flags;
>
> ENTER_CRITICAL(flags);
> size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
> diff --git a/arch/mips/mm/tlb-r8k.c b/arch/mips/mm/tlb-r8k.c
> index 4f01a3b..4ec95cc 100644
> --- a/arch/mips/mm/tlb-r8k.c
> +++ b/arch/mips/mm/tlb-r8k.c
> @@ -111,8 +111,7 @@ out_restore:
> /* Usable for KV1 addresses only! */
> void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
> {
> - unsigned long flags;
> - int size;
> + unsigned long size, flags;
>
> size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
> size = (size + 1) >> 1;
>
--
------------------------------------------------------------------------
Greg Ungerer -- Principal Engineer EMAIL: gerg@snapgear.com
SnapGear Group, McAfee PHONE: +61 7 3435 2888
825 Stanley St, FAX: +61 7 3891 3630
Woolloongabba, QLD, 4102, Australia WEB: http://www.SnapGear.com
next prev parent reply other threads:[~2009-05-21 5:29 UTC|newest]
Thread overview: 8+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-05-20 6:12 system lockup with 2.6.29 on Cavium/Octeon Greg Ungerer
2009-05-20 14:26 ` Ralf Baechle
2009-05-21 5:29 ` Greg Ungerer [this message]
2009-05-21 6:28 ` Ralf Baechle
2009-05-21 14:50 ` Atsushi Nemoto
2009-05-22 1:19 ` Greg Ungerer
2009-05-22 9:23 ` Ralf Baechle
2009-05-22 11:53 ` Atsushi Nemoto
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4A14E6A1.4030700@snapgear.com \
--to=gerg@snapgear.com \
--cc=linux-mips@linux-mips.org \
--cc=ralf@linux-mips.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox