From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.lore.kernel.org (Postfix) with ESMTPS id 2DD29C77B7E for ; Thu, 1 Jun 2023 15:56:09 +0000 (UTC) Received: from localhost ([::1] helo=lists1p.gnu.org) by lists.gnu.org with esmtp (Exim 4.90_1) (envelope-from ) id 1q4keH-0001fJ-Dr; Thu, 01 Jun 2023 11:55:37 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]) by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q4keD-0001e8-HN for qemu-devel@nongnu.org; Thu, 01 Jun 2023 11:55:33 -0400 Received: from us-smtp-delivery-124.mimecast.com ([170.10.133.124]) by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256) (Exim 4.90_1) (envelope-from ) id 1q4keB-0000Xl-Ps for qemu-devel@nongnu.org; Thu, 01 Jun 2023 11:55:33 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1685634931; h=from:from:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=zlnljDgRSgpMBRiTUG2waKjSQN9j6BdY8Rjp6yESPH8=; b=gY3wRmd4oCSRI/xCSbLScfr+KTQUKwv+P254Ge49PKxW0eyyUj5jpmIXqZ9RVjkTMWNCak UluTKaaf9HewvOQ+9p/eUSpZRa3W6wIyTcJxcytv9wrnWkzPTJvrq0CJMiS65RcVrdnugY 4++6PNkGjisPBbiRHSnRRp8MZeIVfpg= Received: from mimecast-mx02.redhat.com (mx3-rdu2.redhat.com [66.187.233.73]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-92-ZICrnAN7OriKLrtuGL5B0w-1; Thu, 01 Jun 2023 11:55:30 -0400 X-MC-Unique: ZICrnAN7OriKLrtuGL5B0w-1 Received: from smtp.corp.redhat.com (int-mx10.intmail.prod.int.rdu2.redhat.com [10.11.54.10]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id ABE173C13A17 for ; Thu, 1 Jun 2023 15:55:29 +0000 (UTC) Received: from redhat.com (unknown [10.42.28.153]) by smtp.corp.redhat.com (Postfix) with ESMTPS id CC894492B0A; Thu, 1 Jun 2023 15:55:27 +0000 (UTC) Date: Thu, 1 Jun 2023 16:55:25 +0100 From: Daniel =?utf-8?B?UC4gQmVycmFuZ8Op?= To: Peter Xu Cc: qemu-devel@nongnu.org, Laurent Vivier , Juan Quintela , Leonardo Bras , Thomas Huth , Paolo Bonzini Subject: Re: [PATCH v3 8/9] tests/qtest: make more migration pre-copy scenarios run non-live Message-ID: References: <20230531132400.1129576-1-berrange@redhat.com> <20230531132400.1129576-9-berrange@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/2.2.9 (2022-11-12) X-Scanned-By: MIMEDefang 3.1 on 10.11.54.10 Received-SPF: pass client-ip=170.10.133.124; envelope-from=berrange@redhat.com; helo=us-smtp-delivery-124.mimecast.com X-Spam_score_int: -22 X-Spam_score: -2.3 X-Spam_bar: -- X-Spam_report: (-2.3 / 5.0 requ) BAYES_00=-1.9, DKIMWL_WL_HIGH=-0.166, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, RCVD_IN_DNSWL_NONE=-0.0001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, T_SCC_BODY_TEXT_LINE=-0.01 autolearn=ham autolearn_force=no X-Spam_action: no action X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.29 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Reply-To: Daniel =?utf-8?B?UC4gQmVycmFuZ8Op?= Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org On Thu, Jun 01, 2023 at 11:53:17AM -0400, Peter Xu wrote: > On Thu, Jun 01, 2023 at 04:39:48PM +0100, Daniel P. Berrangé wrote: > > On Thu, Jun 01, 2023 at 11:30:10AM -0400, Peter Xu wrote: > > > Thanks for looking into this.. definitely worthwhile. > > > > > > On Wed, May 31, 2023 at 02:23:59PM +0100, Daniel P. Berrangé wrote: > > > > There are 27 pre-copy live migration scenarios being tested. In all of > > > > these we force non-convergance and run for one iteration, then let it > > > > converge and wait for completion during the second (or following) > > > > iterations. At 3 mbps bandwidth limit the first iteration takes a very > > > > long time (~30 seconds). > > > > > > > > While it is important to test the migration passes and convergance > > > > logic, it is overkill to do this for all 27 pre-copy scenarios. The > > > > TLS migration scenarios in particular are merely exercising different > > > > code paths during connection establishment. > > > > > > > > To optimize time taken, switch most of the test scenarios to run > > > > non-live (ie guest CPUs paused) with no bandwidth limits. This gives > > > > a massive speed up for most of the test scenarios. > > > > > > > > For test coverage the following scenarios are unchanged > > > > > > Curious how are below chosen? I assume.. > > > > Chosen based on whether they exercise code paths that are unique > > and interesting during the RAM transfer phase. > > > > Essentially the goal is that if we have N% code coverage before this > > patch, then we should still have the same N% code coverage after this > > patch. > > > > The TLS tests exercise code paths that are unique during the migration > > establishment phase. Once establishd they don't exercise anything > > "interesting" during RAM transfer phase. Thus we don't loose code coverage > > by runing TLS tests non-live. > > > > > > > > > > > > > * Precopy with UNIX sockets > > > > > > this one verifies dirty log. > > > > > > > * Precopy with UNIX sockets and dirty ring tracking > > > > > > ... dirty ring... > > > > > > > * Precopy with XBZRLE > > > > > > ... xbzrle I think needs a diff on old/new, makes sense. > > > > > > > * Precopy with UNIX compress > > > > * Precopy with UNIX compress (nowait) > > > > * Precopy with multifd > > > > > > What about the rest three? Especially for two compression tests. > > > > The compress thread logic is unique/interesting during RAM transfer > > so benefits from running live. The wait vs non-wait scenario tests > > a distinct codepath/logic. > > I assume you mean e.g. when compressing with guest page being modified and > we should survive that rather than crashing the compressor? No, i mean the compression code has a significant behaviour difference between its two tests, because they toggle: @compress-wait-thread: Controls behavior when all compression threads are currently busy. If true (default), wait for a free compression thread to become available; otherwise, send the page uncompressed. (Since 3.1) so we need to exercise the code path that falls back to sending uncompressed, as well as the code path that waits for free threads. With regards, Daniel -- |: https://berrange.com -o- https://www.flickr.com/photos/dberrange :| |: https://libvirt.org -o- https://fstop138.berrange.com :| |: https://entangle-photo.org -o- https://www.instagram.com/dberrange :|