From mboxrd@z Thu Jan 1 00:00:00 1970 From: Avi Kivity Subject: Re: [PATCH 2/6] qemu-kvm: Modify and introduce wrapper functions to access phys_ram_dirty. Date: Tue, 16 Mar 2010 14:45:44 +0200 Message-ID: <4B9F7D78.5090201@redhat.com> References: <1268736839-27371-1-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> <1268736839-27371-3-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: kvm@vger.kernel.org, qemu-devel@nongnu.org, anthony@codemonkey.ws, ohmura.kei@lab.ntt.co.jp To: Yoshiaki Tamura Return-path: Received: from mx1.redhat.com ([209.132.183.28]:50539 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751704Ab0CPMpy (ORCPT ); Tue, 16 Mar 2010 08:45:54 -0400 In-Reply-To: <1268736839-27371-3-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> Sender: kvm-owner@vger.kernel.org List-ID: On 03/16/2010 12:53 PM, Yoshiaki Tamura wrote: > Modifies wrapper functions for byte-based phys_ram_dirty bitmap to > bit-based phys_ram_dirty bitmap, and adds more wrapper functions to prevent > direct access to the phys_ram_dirty bitmap. > > + > +static inline int cpu_physical_memory_get_dirty_flags(ram_addr_t addr) > +{ > + unsigned long mask; > + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; > + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); > + int ret = 0; > + > + mask = 1UL<< offset; > + if (phys_ram_vga_dirty[index]& mask) > + ret |= VGA_DIRTY_FLAG; > + if (phys_ram_code_dirty[index]& mask) > + ret |= CODE_DIRTY_FLAG; > + if (phys_ram_migration_dirty[index]& mask) > + ret |= MIGRATION_DIRTY_FLAG; > + > + return ret; > } > > static inline int cpu_physical_memory_get_dirty(ram_addr_t addr, > int dirty_flags) > { > - return phys_ram_dirty[addr>> TARGET_PAGE_BITS]& dirty_flags; > + return cpu_physical_memory_get_dirty_flags(addr)& dirty_flags; > } > This turns one cacheline access into three. If the dirty bitmaps were in an array, you could do return dirty_bitmaps[dirty_index][addr >> (TARGET_PAGE_BITS + BITS_IN_LONG)] & mask; with one cacheline access. > > static inline void cpu_physical_memory_set_dirty(ram_addr_t addr) > { > - phys_ram_dirty[addr>> TARGET_PAGE_BITS] = 0xff; > + unsigned long mask; > + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; > + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); > + > + mask = 1UL<< offset; > + phys_ram_vga_dirty[index] |= mask; > + phys_ram_code_dirty[index] |= mask; > + phys_ram_migration_dirty[index] |= mask; > +} > This is also three cacheline accesses. I think we should have a master bitmap which is updated by set_dirty(), and which is or'ed into the other bitmaps when they are accessed. At least the vga and migration bitmaps are only read periodically, not randomly, so this would be very fast. In a way, this is similar to how the qemu bitmap is updated from the kvm bitmap today. I am not sure about the code bitmap though. -- error compiling committee.c: too many arguments to function From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mailman by lists.gnu.org with tmda-scanned (Exim 4.43) id 1NrW9k-0007Wg-0z for qemu-devel@nongnu.org; Tue, 16 Mar 2010 08:45:56 -0400 Received: from [199.232.76.173] (port=51161 helo=monty-python.gnu.org) by lists.gnu.org with esmtp (Exim 4.43) id 1NrW9j-0007WL-Eo for qemu-devel@nongnu.org; Tue, 16 Mar 2010 08:45:55 -0400 Received: from Debian-exim by monty-python.gnu.org with spam-scanned (Exim 4.60) (envelope-from ) id 1NrW9i-00059E-Bg for qemu-devel@nongnu.org; Tue, 16 Mar 2010 08:45:55 -0400 Received: from mx1.redhat.com ([209.132.183.28]:41675) by monty-python.gnu.org with esmtp (Exim 4.60) (envelope-from ) id 1NrW9h-00058p-NB for qemu-devel@nongnu.org; Tue, 16 Mar 2010 08:45:54 -0400 Message-ID: <4B9F7D78.5090201@redhat.com> Date: Tue, 16 Mar 2010 14:45:44 +0200 From: Avi Kivity MIME-Version: 1.0 References: <1268736839-27371-1-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> <1268736839-27371-3-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> In-Reply-To: <1268736839-27371-3-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Subject: [Qemu-devel] Re: [PATCH 2/6] qemu-kvm: Modify and introduce wrapper functions to access phys_ram_dirty. List-Id: qemu-devel.nongnu.org List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Yoshiaki Tamura Cc: ohmura.kei@lab.ntt.co.jp, qemu-devel@nongnu.org, kvm@vger.kernel.org On 03/16/2010 12:53 PM, Yoshiaki Tamura wrote: > Modifies wrapper functions for byte-based phys_ram_dirty bitmap to > bit-based phys_ram_dirty bitmap, and adds more wrapper functions to prevent > direct access to the phys_ram_dirty bitmap. > > + > +static inline int cpu_physical_memory_get_dirty_flags(ram_addr_t addr) > +{ > + unsigned long mask; > + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; > + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); > + int ret = 0; > + > + mask = 1UL<< offset; > + if (phys_ram_vga_dirty[index]& mask) > + ret |= VGA_DIRTY_FLAG; > + if (phys_ram_code_dirty[index]& mask) > + ret |= CODE_DIRTY_FLAG; > + if (phys_ram_migration_dirty[index]& mask) > + ret |= MIGRATION_DIRTY_FLAG; > + > + return ret; > } > > static inline int cpu_physical_memory_get_dirty(ram_addr_t addr, > int dirty_flags) > { > - return phys_ram_dirty[addr>> TARGET_PAGE_BITS]& dirty_flags; > + return cpu_physical_memory_get_dirty_flags(addr)& dirty_flags; > } > This turns one cacheline access into three. If the dirty bitmaps were in an array, you could do return dirty_bitmaps[dirty_index][addr >> (TARGET_PAGE_BITS + BITS_IN_LONG)] & mask; with one cacheline access. > > static inline void cpu_physical_memory_set_dirty(ram_addr_t addr) > { > - phys_ram_dirty[addr>> TARGET_PAGE_BITS] = 0xff; > + unsigned long mask; > + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; > + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); > + > + mask = 1UL<< offset; > + phys_ram_vga_dirty[index] |= mask; > + phys_ram_code_dirty[index] |= mask; > + phys_ram_migration_dirty[index] |= mask; > +} > This is also three cacheline accesses. I think we should have a master bitmap which is updated by set_dirty(), and which is or'ed into the other bitmaps when they are accessed. At least the vga and migration bitmaps are only read periodically, not randomly, so this would be very fast. In a way, this is similar to how the qemu bitmap is updated from the kvm bitmap today. I am not sure about the code bitmap though. -- error compiling committee.c: too many arguments to function