From mboxrd@z Thu Jan 1 00:00:00 1970 From: Yoshiaki Tamura Subject: Re: [PATCH 2/6] qemu-kvm: Modify and introduce wrapper functions to access phys_ram_dirty. Date: Tue, 16 Mar 2010 22:17:54 +0900 Message-ID: <4B9F8502.3070108@lab.ntt.co.jp> References: <1268736839-27371-1-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> <1268736839-27371-3-git-send-email-tamura.yoshiaki@lab.ntt.co.jp> <4B9F7D78.5090201@redhat.com> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: kvm@vger.kernel.org, qemu-devel@nongnu.org, anthony@codemonkey.ws, ohmura.kei@lab.ntt.co.jp To: Avi Kivity Return-path: Received: from tama50.ecl.ntt.co.jp ([129.60.39.147]:60261 "EHLO tama50.ecl.ntt.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S937687Ab0CPNSO (ORCPT ); Tue, 16 Mar 2010 09:18:14 -0400 In-Reply-To: <4B9F7D78.5090201@redhat.com> Sender: kvm-owner@vger.kernel.org List-ID: Avi Kivity wrote: > On 03/16/2010 12:53 PM, Yoshiaki Tamura wrote: >> Modifies wrapper functions for byte-based phys_ram_dirty bitmap to >> bit-based phys_ram_dirty bitmap, and adds more wrapper functions to >> prevent >> direct access to the phys_ram_dirty bitmap. > >> + >> +static inline int cpu_physical_memory_get_dirty_flags(ram_addr_t addr) >> +{ >> + unsigned long mask; >> + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; >> + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); >> + int ret = 0; >> + >> + mask = 1UL<< offset; >> + if (phys_ram_vga_dirty[index]& mask) >> + ret |= VGA_DIRTY_FLAG; >> + if (phys_ram_code_dirty[index]& mask) >> + ret |= CODE_DIRTY_FLAG; >> + if (phys_ram_migration_dirty[index]& mask) >> + ret |= MIGRATION_DIRTY_FLAG; >> + >> + return ret; >> } >> >> static inline int cpu_physical_memory_get_dirty(ram_addr_t addr, >> int dirty_flags) >> { >> - return phys_ram_dirty[addr>> TARGET_PAGE_BITS]& dirty_flags; >> + return cpu_physical_memory_get_dirty_flags(addr)& dirty_flags; >> } > > This turns one cacheline access into three. If the dirty bitmaps were in > an array, you could do > > return dirty_bitmaps[dirty_index][addr >> (TARGET_PAGE_BITS + > BITS_IN_LONG)] & mask; > > with one cacheline access. If I'm understanding the existing code correctly, int dirty_flags can be combined, like VGA + MIGRATION. If we only have to worry about a single dirty flag, I agree with your idea. On the other hand, qemu seems to require getting combined dirty flags. If we introduce dirty bitmaps for each type, we need to access each bitmap to get combined flags. I wasn't sure how to make this more efficient... >> static inline void cpu_physical_memory_set_dirty(ram_addr_t addr) >> { >> - phys_ram_dirty[addr>> TARGET_PAGE_BITS] = 0xff; >> + unsigned long mask; >> + int index = (addr>> TARGET_PAGE_BITS) / HOST_LONG_BITS; >> + int offset = (addr>> TARGET_PAGE_BITS)& (HOST_LONG_BITS - 1); >> + >> + mask = 1UL<< offset; >> + phys_ram_vga_dirty[index] |= mask; >> + phys_ram_code_dirty[index] |= mask; >> + phys_ram_migration_dirty[index] |= mask; >> +} > > This is also three cacheline accesses. I think we should have a master > bitmap which is updated by set_dirty(), and which is or'ed into the > other bitmaps when they are accessed. At least the vga and migration > bitmaps are only read periodically, not randomly, so this would be very > fast. In a way, this is similar to how the qemu bitmap is updated from > the kvm bitmap today. Sounds good to me. So we're going to introduce 4 (VGA, CODE, MIGRATION, master) bit-based bitmaps in total.