From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-0.3 required=3.0 tests=DKIM_ADSP_CUSTOM_MED, DKIM_INVALID,DKIM_SIGNED,FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id CF05DC48BD4 for ; Tue, 25 Jun 2019 11:42:41 +0000 (UTC) Received: from lists.gnu.org (lists.gnu.org [209.51.188.17]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 8EA21213F2 for ; Tue, 25 Jun 2019 11:42:41 +0000 (UTC) Authentication-Results: mail.kernel.org; dkim=fail reason="signature verification failed" (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="DAIKwuFb" DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 8EA21213F2 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=gmail.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Received: from localhost ([::1]:59080 helo=lists.gnu.org) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hfjqS-0002Sx-JZ for qemu-devel@archiver.kernel.org; Tue, 25 Jun 2019 07:42:40 -0400 Received: from eggs.gnu.org ([2001:470:142:3::10]:56213) by lists.gnu.org with esmtp (Exim 4.86_2) (envelope-from ) id 1hfjnC-0000yE-8L for qemu-devel@nongnu.org; Tue, 25 Jun 2019 07:39:20 -0400 Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71) (envelope-from ) id 1hfjn8-0007YC-Gf for qemu-devel@nongnu.org; Tue, 25 Jun 2019 07:39:16 -0400 Received: from mail-wm1-x344.google.com ([2a00:1450:4864:20::344]:53003) by eggs.gnu.org with esmtps (TLS1.0:RSA_AES_128_CBC_SHA1:16) (Exim 4.71) (envelope-from ) id 1hfjn6-0007U0-Q4 for qemu-devel@nongnu.org; Tue, 25 Jun 2019 07:39:14 -0400 Received: by mail-wm1-x344.google.com with SMTP id s3so2450834wms.2 for ; Tue, 25 Jun 2019 04:39:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=subject:to:cc:references:from:message-id:date:user-agent :mime-version:in-reply-to:content-transfer-encoding:content-language; bh=XihMBel6+9VHt+N5UDgW7rphHGe8fdoDAO/nmllXomA=; b=DAIKwuFb7CKqqAJxbq7VObcKInrhAmoAKSrGFA919qCQpxd2mkj3LsifYY38J+vI+2 uA4zJM3IIwGAMiNNUJdTbNKm8mrGhnuxSqQV5JOu4qVad7BdkXFyEG+PCJUTCq/dvyae daXUqBjI2NLHNARgV2OJ9R58WzXow7BiLjcZeG7QTDnUgqy8ur9qXkC2ZyW9nx8OVtLE RSbF2DLf2AakMSG+8RSm7pJ9Ti7wM/R/ivdSPnmj9ryyaWPxvuG4s9lark5sPf9Wzkvi EV/lPZdAipMjb7NKgS30OVDNOyVkxC9MY9JlPdxABQRRFiUhKcQlpaVKNjKGazAPazJ5 2AXQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-transfer-encoding :content-language; bh=XihMBel6+9VHt+N5UDgW7rphHGe8fdoDAO/nmllXomA=; b=dWUl1siwsuwbS2sE2N2CeiX3MJwpDHjk06lXQo3DjKYXy7HDE6H7UnNwaTUdOa9gFH 5HDlE9qyxPT6ay4/C6fzqjvHmUhzVmTIyuH2a5k7Pv7HWPbMMcOaMSfZwrr0jIo44jiv 4ZAj+WGAwqlzw5nQczk0ej/IMw2w4bEk95YUeK4tmXM0xYUumFhfuapkhSu4dnAykFZG WDePvvNXyA9lDzYWvbBBknu/ZTErhe9q5uGy9s2JBL/Bhk/E/uJ0/F/cnUGFMgFbeODn 1HHhJ8LfrUdlT7FHWr0mh53Wpdya0c2irVvI8ISSG8ENQANCEWJOglxVa9j+6F3yHDKk ewcg== X-Gm-Message-State: APjAAAVOKCFckUCIhN4e4zQUZevDhLVuT7HnpBR3IrpVipMLa9LDhnqP swKO4XwXl/WhOY1ING8AGnI= X-Google-Smtp-Source: APXvYqyFB6HWqoCMTBbJv3Jtii11jtgePaQ4s11XQ18f0SYVIp93wmgZEqEWQXwJgmeuFQZeUfmaew== X-Received: by 2002:a1c:6a11:: with SMTP id f17mr18178363wmc.110.1561462750299; Tue, 25 Jun 2019 04:39:10 -0700 (PDT) Received: from localhost.localdomain ([37.142.144.12]) by smtp.gmail.com with ESMTPSA id o6sm2735422wmc.15.2019.06.25.04.39.08 (version=TLS1_3 cipher=AEAD-AES128-GCM-SHA256 bits=128/128); Tue, 25 Jun 2019 04:39:09 -0700 (PDT) To: "Dr. David Alan Gilbert" References: <20190621144541.13770-1-skrtbhtngr@gmail.com> <1AD565D7-F99E-4879-BFEF-0E2C7474A09A@gmail.com> <91874483-215c-1ccc-6f33-5ceeb8f6ec47@gmail.com> <20190625091116.GE3226@work-vm> From: Marcel Apfelbaum Message-ID: Date: Tue, 25 Jun 2019 14:39:21 +0300 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.6.1 MIME-Version: 1.0 In-Reply-To: <20190625091116.GE3226@work-vm> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US X-detected-operating-system: by eggs.gnu.org: Genre and OS details not recognized. X-Received-From: 2a00:1450:4864:20::344 Subject: Re: [Qemu-devel] [RFC 0/1] Add live migration support to the PVRDMA device X-BeenThere: qemu-devel@nongnu.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Cc: Sukrit Bhatnagar , Dmitry Fleytman , qemu-devel@nongnu.org, Yuval Shaia Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org Sender: "Qemu-devel" On 6/25/19 12:11 PM, Dr. David Alan Gilbert wrote: > * Marcel Apfelbaum (marcel.apfelbaum@gmail.com) wrote: >> Hi Dmitry, >> >> On 6/25/19 11:39 AM, Dmitry Fleytman wrote: >>>> On 25 Jun 2019, at 11:14, Marcel Apfelbaum wrote: >>>> >>>> Hi Sukrit >>>> >>>> On 6/21/19 5:45 PM, Sukrit Bhatnagar wrote: >>>>> Hi, >>>> [...] >>>>> This RFC is meant to request suggestions on the things which are >>>>> working and for help on the things which are not. >>>>> >>>> [...] >>>>> What is not working: >>>>> >>>> [...] >>>>> * It seems that vmxnet3 migration itself is not working properly, at least >>>>> for me. The pvrdma device depends on it, vmxnet3 is function 0 and pvrdma >>>>> is function 1. This is happening even for a build of unmodified code from >>>>> the master branch. >>>>> After migration, the network connectivity is lost at destination. >>>>> Things are fine at the source before migration. >>>>> This is the command I am using at src: >>>>> >>>>> sudo /home/skrtbhtngr/qemu/build/x86_64-softmmu/qemu-system-x86_64 \ >>>>> -enable-kvm \ >>>>> -m 2G -smp cpus=2 \ >>>>> -hda /home/skrtbhtngr/fedora.img \ >>>>> -netdev tap,id=hostnet0 \ >>>>> -device vmxnet3,netdev=hostnet0,id=net0,mac=52:54:00:99:ff:bc \ >>>>> -monitor telnet:127.0.0.1:4444,server,nowait \ >>>>> -trace events=/home/skrtbhtngr/trace-events \ >>>>> -vnc 0.0.0.0:0 >>>>> >>>>> Similar command is used for the dest. Currently, I am trying >>>>> same-host migration for testing purpose, without the pvrdma device. >>>>> Two tap interfaces, for src and dest were created successfully at >>>>> the host. Kernel logs: >>>>> ... >>>>> br0: port 2(tap0) entered forwarding state >>>>> ... >>>>> br0: port 3(tap1) entered forwarding state >>>>> >>>>> tcpdump at the dest reports only outgoing ARP packets, which ask >>>>> for gateway: "ARP, Request who-has _gateway tell guest1". >>>>> >>>>> Tried using user (slirp) as the network backend, but no luck. >>>>> Also tried git bisect to find the issue using a working commit (given >>>>> by Marcel), but it turns out that it is very old and I faced build >>>>> errors one after another. >>>>> >>>>> Please note that e1000 live migration is working fine in the same setup. >>>>> >>>> I tried to git bisect , but I couldn't find a working version of vmxnet supporting live migration .... >>>> I tried even a commit from December 2014 and it didn't work. >>>> >>>> What is strange (to me) is that the networking packets can't be sent from the guest (after migration) >>>> even after rebooting the guest. >>> This makes me think that some network offload configuration wasn’t properly migrated or applied. >>> What network backend are you using? >> Suktrit tried with tap device, I tried with slirp. >> If you can point me to the property that disables all the offloads it will >> really help. >> >>> Do you see any outgoing packets in the sniffer? >> I didn't use the sniffer, I checked dmesg in guest, there was a line >> complaining that it can't send packets. > What exactly was the error? I'll try to reproduce the error Thanks, Marcel > I don't know much about vmxnet3; but if the guest driver is seeing the > problem then I guess that's the best pointer we have. > > > Dave > >> Thanks, >> Marcel >> >>>> Any help or pointer would be greatly appreciated. >>>> Thanks, >>>> Marcel >>>> >>>> >>>> [...] > -- > Dr. David Alan Gilbert / dgilbert@redhat.com / Manchester, UK