From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from eggs.gnu.org ([208.118.235.92]:35956) by lists.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UKRJh-0003Rj-34 for qemu-devel@nongnu.org; Tue, 26 Mar 2013 06:41:22 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1UKRJd-0003qx-Nh for qemu-devel@nongnu.org; Tue, 26 Mar 2013 06:41:21 -0400 Received: from mx1.redhat.com ([209.132.183.28]:18231) by eggs.gnu.org with esmtp (Exim 4.71) (envelope-from ) id 1UKRJd-0003qh-FF for qemu-devel@nongnu.org; Tue, 26 Mar 2013 06:41:17 -0400 From: Juan Quintela In-Reply-To: <1364291919-19563-4-git-send-email-pl@kamp.de> (Peter Lieven's message of "Tue, 26 Mar 2013 10:58:32 +0100") References: <1364291919-19563-1-git-send-email-pl@kamp.de> <1364291919-19563-4-git-send-email-pl@kamp.de> Date: Tue, 26 Mar 2013 11:41:21 +0100 Message-ID: <87fvzio33i.fsf@elfo.elfo> MIME-Version: 1.0 Content-Type: text/plain Subject: Re: [Qemu-devel] [PATCHv5 03/10] cutils: add a function to find non-zero content in a buffer Reply-To: quintela@redhat.com List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , To: Peter Lieven Cc: Orit Wasserman , Paolo Bonzini , qemu-devel@nongnu.org, Stefan Hajnoczi Peter Lieven wrote: > this adds buffer_find_nonzero_offset() which is a SSE2/Altivec > optimized function that searches for non-zero content in a > buffer. > > the function starts full unrolling only after the first few chunks have > been checked one by one. analyzing real memory page data has revealed > that non-zero pages are non-zero within the first 256-512 bits in > most cases. as this function is also heavily used to check for zero memory > pages this tweak has been made to avoid the high setup costs of the fully > unrolled check for non-zero pages. > > due to the optimizations used in the function there are restrictions > on buffer address and search length. the function > can_use_buffer_find_nonzero_content() can be used to check if > the function can be used safely. > > Signed-off-by: Peter Lieven > --- > include/qemu-common.h | 13 ++++++++++++ > util/cutils.c | 55 +++++++++++++++++++++++++++++++++++++++++++++++++ > 2 files changed, 68 insertions(+) > > diff --git a/include/qemu-common.h b/include/qemu-common.h > index 9022646..7c7c244 100644 > --- a/include/qemu-common.h > +++ b/include/qemu-common.h > @@ -472,4 +472,17 @@ void hexdump(const char *buf, FILE *fp, const char *prefix, size_t size); > #define ALL_EQ(v1, v2) ((v1) == (v2)) > #endif > > +#define BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR 8 > +static inline bool > +can_use_buffer_find_nonzero_offset(const void *buf, size_t len) > +{ > + if (len % (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR > + * sizeof(VECTYPE)) == 0 > + && ((uintptr_t) buf) % sizeof(VECTYPE) == 0) { > + return true; > + } > + return false; > +} This can be spelled as: return (len % (BUFFER_FIND_NONZERO_OFFSET_UNROLL_FACTOR * sizeof(VECTYPE)) == 0 && ((uintptr_t) buf) % sizeof(VECTYPE) == 0);; But I don't care too much.