Linux MIPS Architecture development
 help / color / mirror / Atom feed
From: Greg Ungerer <gerg@snapgear.com>
To: Ralf Baechle <ralf@linux-mips.org>
Cc: linux-mips@linux-mips.org
Subject: Re: system lockup with 2.6.29 on Cavium/Octeon
Date: Thu, 21 May 2009 15:29:05 +1000	[thread overview]
Message-ID: <4A14E6A1.4030700@snapgear.com> (raw)
In-Reply-To: <20090520142604.GA29677@linux-mips.org>

Hi Ralf,

Ralf Baechle wrote:
> On Wed, May 20, 2009 at 04:12:32PM +1000, Greg Ungerer wrote:
> 
>> I have a system lockup problem that I have been looking at on a custom
>> Cavium/Octeon 5010 based design. I am running on linux-2.6.29 with
>> David Daney's latest round of PCI and ethernet patches (posted here
>> on this list).
>>
>> I have tracked the problem back to local_flush_tlb_kernel_range() in
>> arch/mips/mm/tlb-r4k.c. At the top of this function is:
>>
>>     void local_flush_tlb_kernel_range(unsigned long start, unsigned long 
>> end)
>>     {
>>         unsigned long flags;
>>         int size;
>>
>>         ENTER_CRITICAL(flags);
>>         size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
>>         size = (size + 1) >> 1;
>>         if (size <= current_cpu_data.tlbsize / 2) {
>>
>> The problem is that typical example values I see passed in for start
>> and end are:
>>
>>     start = c000000000006000
>>     end   = ffffffffc01d8000
>>
>> Now the vmalloc area starts at 0xc000000000000000 and the kernel code
>> and data is all at 0xffffffff80000000 and above. I don't know if the
>> start and end are reasonable values, but I can see some logic as to
>> where they come from. The code path that leads to this is via
>> __vunmap() and __purge_vmap_area_lazy(). So it is not too difficult
>> to see how we end up with values like this.
> 
> Either start or end address is sensible but not the combination - both
> addresses should be in the same segment.  Start is in XKSEG, end in CKSEG2
> and in between there are vast wastelands of unused address space exabytes
> in size.

Yes, exactly, that looked odd to me too.

So I tracked it back to see how these both ended up being in there.
It turns out that MODULE_START, as defined in
arch/mips/include/asm/pgtable-64.h, is CKSSEG, so it is
0xffffffffc0000000 in my case. And VMALLOC_START/MAP_BASE is
defined to be 0xc000000000000000.

In module_alloc() there is a call to __get_vm_area() with MODULE_START
as the start address, and this is how the 0xfff... addresses end up in
the vmap_area table. The usual vmalloc() calls use VMALLOC_START and
that is how the 0xc000... addresses get into the vmap_area table.

Interestingly the definition of MODULE_START is like this:

#if defined(CONFIG_MODULES) && defined(KBUILD_64BIT_SYM32) && \
         VMALLOC_START != CKSSEG
/* Load modules into 32bit-compatible segment. */
#define MODULE_START    CKSSEG


If MODULE_START wasn't defined then the module_alloc() code would
have just called vmalloc() directly - and we wouldn't be in this
mess :-)


>> But the size calculation above with these types of values will result
>> in still a large number. Larger than the 32bit "int" that is "size".
>> I see large negative values fall out as size, and so the following
>> tlbsize check becomes true, and the code spins inside the loop inside
>> that if statement for a _very_ long time trying to flush tlb entries.
>>
>> This is of course easily fixed, by making that size "unsigned long".
>> The patch below trivially does this.
>>
>> But is this analysis correct?
> 
> Yes - but I think we have two issues here.  The one is the calculation
> overflowing int for the arguments you're seeing.  The other being that
> the arguments simply are looking wrong.
> 
> There are a few more instances of the same overflow issue which the patch
> below is fixing.

Indeed, looks good.

Regards
Greg



>   Ralf
> 
> 
>  arch/mips/mm/tlb-r3k.c |    6 ++----
>  arch/mips/mm/tlb-r4k.c |    6 ++----
>  arch/mips/mm/tlb-r8k.c |    3 +--
>  3 files changed, 5 insertions(+), 10 deletions(-)
> 
> diff --git a/arch/mips/mm/tlb-r3k.c b/arch/mips/mm/tlb-r3k.c
> index f0cf46a..1c0048a 100644
> --- a/arch/mips/mm/tlb-r3k.c
> +++ b/arch/mips/mm/tlb-r3k.c
> @@ -82,8 +82,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>  	int cpu = smp_processor_id();
>  
>  	if (cpu_context(cpu, mm) != 0) {
> -		unsigned long flags;
> -		int size;
> +		unsigned long size, flags;
>  
>  #ifdef DEBUG_TLB
>  		printk("[tlbrange<%lu,0x%08lx,0x%08lx>]",
> @@ -121,8 +120,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>  
>  void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
>  {
> -	unsigned long flags;
> -	int size;
> +	unsigned long size, flags;
>  
>  #ifdef DEBUG_TLB
>  	printk("[tlbrange<%lu,0x%08lx,0x%08lx>]", start, end);
> diff --git a/arch/mips/mm/tlb-r4k.c b/arch/mips/mm/tlb-r4k.c
> index 9619f66..892be42 100644
> --- a/arch/mips/mm/tlb-r4k.c
> +++ b/arch/mips/mm/tlb-r4k.c
> @@ -117,8 +117,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>  	int cpu = smp_processor_id();
>  
>  	if (cpu_context(cpu, mm) != 0) {
> -		unsigned long flags;
> -		int size;
> +		unsigned long size, flags;
>  
>  		ENTER_CRITICAL(flags);
>  		size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
> @@ -160,8 +159,7 @@ void local_flush_tlb_range(struct vm_area_struct *vma, unsigned long start,
>  
>  void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
>  {
> -	unsigned long flags;
> -	int size;
> +	unsigned long size, flags;
>  
>  	ENTER_CRITICAL(flags);
>  	size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
> diff --git a/arch/mips/mm/tlb-r8k.c b/arch/mips/mm/tlb-r8k.c
> index 4f01a3b..4ec95cc 100644
> --- a/arch/mips/mm/tlb-r8k.c
> +++ b/arch/mips/mm/tlb-r8k.c
> @@ -111,8 +111,7 @@ out_restore:
>  /* Usable for KV1 addresses only! */
>  void local_flush_tlb_kernel_range(unsigned long start, unsigned long end)
>  {
> -	unsigned long flags;
> -	int size;
> +	unsigned long size, flags;
>  
>  	size = (end - start + (PAGE_SIZE - 1)) >> PAGE_SHIFT;
>  	size = (size + 1) >> 1;
> 

-- 
------------------------------------------------------------------------
Greg Ungerer  --  Principal Engineer        EMAIL:     gerg@snapgear.com
SnapGear Group, McAfee                      PHONE:       +61 7 3435 2888
825 Stanley St,                             FAX:         +61 7 3891 3630
Woolloongabba, QLD, 4102, Australia         WEB: http://www.SnapGear.com

  reply	other threads:[~2009-05-21  5:29 UTC|newest]

Thread overview: 8+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-05-20  6:12 system lockup with 2.6.29 on Cavium/Octeon Greg Ungerer
2009-05-20 14:26 ` Ralf Baechle
2009-05-21  5:29   ` Greg Ungerer [this message]
2009-05-21  6:28     ` Ralf Baechle
2009-05-21 14:50   ` Atsushi Nemoto
2009-05-22  1:19     ` Greg Ungerer
2009-05-22  9:23       ` Ralf Baechle
2009-05-22 11:53       ` Atsushi Nemoto

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4A14E6A1.4030700@snapgear.com \
    --to=gerg@snapgear.com \
    --cc=linux-mips@linux-mips.org \
    --cc=ralf@linux-mips.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox