All of lore.kernel.org
 help / color / mirror / Atom feed
From: Aurelien Jarno <aurelien@aurel32.net>
To: qemu-devel@nongnu.org
Cc: Paul Brook <paul@codesourcery.com>
Subject: Re: [Qemu-devel] [PATCH 02/10] Rewrite mmap_find_vma() to work fine on 64-bit hosts with 32-bit targets
Date: Wed, 15 Apr 2009 18:21:26 +0200	[thread overview]
Message-ID: <20090415162126.GI26206@hall.aurel32.net> (raw)
In-Reply-To: <cb86432024a1a6e1743abb3122b4348101a23d1b.1238963514.git.riku.voipio@iki.fi>

On Sun, Apr 05, 2009 at 11:59:18PM +0300, riku.voipio@iki.fi wrote:
> From: Kirill A. Shutemov <kirill@shutemov.name>
> 
> qemu's page table can be incomple if /proc/self/maps is unavailable or
> host allocating a memory with mmap(), so we can't use it to find free
> memory area.
> 
> New version mmap_find_vma() uses mmap() without MAP_FIXED to find free
> memory.
> 
> From: Kirill A. Shutemov <kirill@shutemov.name>
> 
> Signed-off-by: Kirill A. Shutemov <kirill@shutemov.name>
> Signed-off-by: Riku Voipio <riku.voipio@iki.fi>
> ---
>  linux-user/mmap.c |   81 ++++++++++++++++++++++++++++------------------------
>  linux-user/qemu.h |    1 +
>  2 files changed, 45 insertions(+), 37 deletions(-)

Last time this patch has been posted, Paul Brook has emitted some
concerns about this patch. Paul do you have any more comments about it?

> diff --git a/linux-user/mmap.c b/linux-user/mmap.c
> index 6f300a0..8cec230 100644
> --- a/linux-user/mmap.c
> +++ b/linux-user/mmap.c
> @@ -275,52 +275,59 @@ static abi_ulong mmap_next_start = 0x40000000;
>  
>  unsigned long last_brk;
>  
> -/* find a free memory area of size 'size'. The search starts at
> -   'start'. If 'start' == 0, then a default start address is used.
> -   Return -1 if error.
> -*/
> -/* page_init() marks pages used by the host as reserved to be sure not
> -   to use them. */
> -static abi_ulong mmap_find_vma(abi_ulong start, abi_ulong size)
> +/*
> + * Find and reserve a free memory area of size 'size'. The search
> + * starts at 'start'.
> + * It must be called with mmap_lock() held.
> + * Return -1 if error.
> + */
> +abi_ulong mmap_find_vma(abi_ulong start, abi_ulong size)
>  {
> -    abi_ulong addr, addr1, addr_start;
> -    int prot;
> -    unsigned long new_brk;
> -
> -    new_brk = (unsigned long)sbrk(0);
> -    if (last_brk && last_brk < new_brk && last_brk == (target_ulong)last_brk) {
> -        /* This is a hack to catch the host allocating memory with brk().
> -           If it uses mmap then we loose.
> -           FIXME: We really want to avoid the host allocating memory in
> -           the first place, and maybe leave some slack to avoid switching
> -           to mmap.  */
> -        page_set_flags(last_brk & TARGET_PAGE_MASK,
> -                       TARGET_PAGE_ALIGN(new_brk),
> -                       PAGE_RESERVED); 
> -    }
> -    last_brk = new_brk;
> +    void *ptr;
> +    abi_ulong addr;
>  
>      size = HOST_PAGE_ALIGN(size);
> -    start = start & qemu_host_page_mask;
> +    start &= qemu_host_page_mask;
> +
> +    /* If 'start' == 0, then a default start address is used. */
> +    if (start == 0)
> +        start = mmap_next_start;
> +
>      addr = start;
> -    if (addr == 0)
> -        addr = mmap_next_start;
> -    addr_start = addr;
> +
>      for(;;) {
> -        prot = 0;
> -        for(addr1 = addr; addr1 < (addr + size); addr1 += TARGET_PAGE_SIZE) {
> -            prot |= page_get_flags(addr1);
> -        }
> -        if (prot == 0)
> +        /*
> +         * Reserve needed memory area to avoid a race.
> +         * It should be discarded using:
> +         *  - mmap() with MAP_FIXED flag
> +         *  - mremap() with MREMAP_FIXED flag
> +         *  - shmat() with SHM_REMAP flag
> +         */
> +        ptr = mmap((void *)(unsigned long)addr, size, PROT_NONE,
> +                   MAP_ANONYMOUS|MAP_PRIVATE|MAP_NORESERVE, -1, 0);
> +
> +        /* ENOMEM, if host address space has no memory */
> +        if (ptr == MAP_FAILED)
> +            return (abi_ulong)-1;
> +
> +        /* If address fits target address space we've found what we need */
> +        if ((unsigned long)ptr + size - 1 <= (abi_ulong)-1)
>              break;
> +
> +        /* Unmap and try again with new page */
> +        munmap(ptr, size);
>          addr += qemu_host_page_size;
> -        /* we found nothing */
> -        if (addr == addr_start)
> +
> +        /* ENOMEM if we check whole of target address space */
> +        if (addr == start)
>              return (abi_ulong)-1;
>      }
> -    if (start == 0)
> -        mmap_next_start = addr + size;
> -    return addr;
> +
> +    /* Update default start address */
> +    if (start == mmap_next_start)
> +        mmap_next_start = (unsigned long)ptr + size;
> +
> +    return h2g(ptr);
>  }
>  
>  /* NOTE: all the constants are the HOST ones */
> diff --git a/linux-user/qemu.h b/linux-user/qemu.h
> index 94ae333..2bef14a 100644
> --- a/linux-user/qemu.h
> +++ b/linux-user/qemu.h
> @@ -228,6 +228,7 @@ int target_msync(abi_ulong start, abi_ulong len, int flags);
>  extern unsigned long last_brk;
>  void mmap_lock(void);
>  void mmap_unlock(void);
> +abi_ulong mmap_find_vma(abi_ulong, abi_ulong);
>  void cpu_list_lock(void);
>  void cpu_list_unlock(void);
>  #if defined(USE_NPTL)
> -- 
> 1.6.2.1
> 
> 
> 
> 

-- 
Aurelien Jarno	                        GPG: 1024D/F1BCDB73
aurelien@aurel32.net                 http://www.aurel32.net

  parent reply	other threads:[~2009-04-15 16:21 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-04-05 20:59 [Qemu-devel] [PATCH 00/10] misc userland patches [v2] riku.voipio
2009-04-05 20:59 ` [Qemu-devel] [PATCH 01/10] Fix fstatat64()/newfstatat() syscall implementation riku.voipio
2009-04-05 20:59 ` [Qemu-devel] [PATCH 02/10] Rewrite mmap_find_vma() to work fine on 64-bit hosts with 32-bit targets riku.voipio
2009-04-05 20:59   ` [Qemu-devel] [PATCH 03/10] Implement shm* syscalls and fix 64/32bit errors riku.voipio
2009-04-15 16:21   ` Aurelien Jarno [this message]
2009-04-05 20:59 ` [Qemu-devel] [PATCH 04/10] fix IPCOP_sem* and implement sem* riku.voipio
2009-04-06  9:06   ` Arnaud Patard
2009-04-06 15:17     ` Riku Voipio
2009-04-06 16:22       ` Arnaud Patard
2009-04-15 15:28   ` Aurelien Jarno
2009-04-16 14:23     ` Riku Voipio
2009-04-18 16:16       ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 05/10] Added posix message queue syscalls except mq_notify riku.voipio
2009-04-15 16:13   ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 06/10] Add support for passing contents of argv0 riku.voipio
2009-04-15 16:13   ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 07/10] linux-user: unix sockets - fix running dbus riku.voipio
2009-04-15 16:14   ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 08/10] linux-user: removed unnecessary MAX_SOCK_ADDR checks for socket syscalls riku.voipio
2009-04-15 16:14   ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 09/10] Prefer glibc over direct syscalls riku.voipio
2009-04-15 16:14   ` Aurelien Jarno
2009-04-05 20:59 ` [Qemu-devel] [PATCH 10/10] linux-user: Proper exit code for uncaught signals riku.voipio
2009-04-15 16:19   ` Aurelien Jarno

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20090415162126.GI26206@hall.aurel32.net \
    --to=aurelien@aurel32.net \
    --cc=paul@codesourcery.com \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.