From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 01D3F1D7E47 for ; Mon, 7 Oct 2024 18:15:20 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1728324921; cv=none; b=XVYzVb7LnUAKPr2OlualbhXz0Jot3gmSQOsj1fb4wRh78NbX0ptCIf1KzGucOOcOpZSgrpx3IIoTzCRUczPyHOdiDOZOsAASabBSKsAC+EW8KpkjAz04Xfs50/fNmLfatMmmjR5HJTVPBQkbLXaAhcKZAhdbtytuuNBup6V+BJA= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1728324921; c=relaxed/simple; bh=gqHJwCIbdfrOnxiwwaMf8fDbuj9L9oc66uFjUXYdzUY=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=LU1YCmBB+/LlWhIytOEzJ26mUvb1W+V9oeYSiS7kvc1zn2hn2ewkyAGbscBRw5k6B/j7klYzMgnelTmLRXlkm2KoxrztsdBZzuBtUbFgAJw+/gxCKbfScKr1O0krRdXPXDnzVQ63QxsTwtRKDz7S4S1wO1nRM4cRDjitSMNog5I= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=oGoCrGF/; arc=none smtp.client-ip=10.30.226.201 Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="oGoCrGF/" Received: by smtp.kernel.org (Postfix) with ESMTPSA id 152A6C4CEC6; Mon, 7 Oct 2024 18:15:19 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1728324920; bh=gqHJwCIbdfrOnxiwwaMf8fDbuj9L9oc66uFjUXYdzUY=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=oGoCrGF/+YcDlUHOvZyUbYA89NGdxyJDwuCqJR+RDUVAIlFGzU4Z6U2SXBSfgUv7Z XVCribpgPF8WJTXvzUT5D3Uqml4PzZ8rNV1yP0f1uZTl7JUlc3Ke9sk+3w4GskFkZG EaX0c4Tq8cpk6z7Z9/4s+burBTqJVd9Nvq21xSMuNL3VmAE37efwkQLTQXMnZSnofL u+4atW6NBcMx7lVQjXkCBwQQ+uf/uJosjBqWQg8/zwsfr1IADJga2TqjNpxaNbjCCD oMix8Zs091IvflZwEdZHMy66haHs3NcGmvG+Opum19/JLGfrD3Vbh6mP3H3Paybp78 jvIpOXPvNS14w== Date: Mon, 7 Oct 2024 21:15:13 +0300 From: Leon Romanovsky To: Michael Galaxy Cc: Yu Zhang , Sean Hefty , "Gonglei (Arei)" , "Michael S. Tsirkin" , "qemu-devel@nongnu.org" , "elmar.gerdes@ionos.com" , zhengchuan , "berrange@redhat.com" , "armbru@redhat.com" , "lizhijian@fujitsu.com" , "pbonzini@redhat.com" , Xiexiangyou , "linux-rdma@vger.kernel.org" , "lixiao (H)" , "jinpu.wang@ionos.com" , Wangjialin Subject: Re: [PATCH 0/6] refactor RDMA live migration based on rsocket API Message-ID: <20241007181513.GC25819@unreal> References: <0730fa9b-49cd-46e4-9264-afabe2486154@akamai.com> <6211c525-0b9b-4eba-ac3c-2ac796c8ec83@akamai.com> <856d4f0e-8742-4848-acc5-dbaa5d21c9fd@akamai.com> Precedence: bulk X-Mailing-List: linux-rdma@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <856d4f0e-8742-4848-acc5-dbaa5d21c9fd@akamai.com> On Mon, Oct 07, 2024 at 08:45:07AM -0500, Michael Galaxy wrote: > Hi, > > On 10/7/24 03:47, Yu Zhang wrote: > > !-------------------------------------------------------------------| > > This Message Is From an External Sender > > This message came from outside your organization. > > |-------------------------------------------------------------------! > > > > Sure, as we talked at the KVM Forum, a possible approach is to set up > > two VMs on a physical host, configure the SoftRoCE, and run the > > migration test in two nested VMs to ensure that the migration data > > traffic goes through the emulated RDMA hardware. I will continue with > > this and let you know. > > > Acknowledged. Do share if you have any problems with it, like if it has > compatibility issues > or if we need a different solution. We're open to change. > > I'm not familiar with the "current state" of this or how well it would even > work. Any compatibility issue between versions of RXE (SoftRoCE) or between RXE and real devices is a bug in RXE, which should be fixed. RXE is expected to be compatible with rest RoCE devices, both virtual and physical. Thanks > > - Michael > > > > On Fri, Oct 4, 2024 at 4:06 PM Michael Galaxy wrote: > > > > > > On 10/3/24 16:43, Peter Xu wrote: > > > > !-------------------------------------------------------------------| > > > > This Message Is From an External Sender > > > > This message came from outside your organization. > > > > |-------------------------------------------------------------------! > > > > > > > > On Thu, Oct 03, 2024 at 04:26:27PM -0500, Michael Galaxy wrote: > > > > > What about the testing solution that I mentioned? > > > > > > > > > > Does that satisfy your concerns? Or is there still a gap here that needs to > > > > > be met? > > > > I think such testing framework would be helpful, especially if we can kick > > > > it off in CI when preparing pull requests, then we can make sure nothing > > > > will break RDMA easily. > > > > > > > > Meanwhile, we still need people committed to this and actively maintain it, > > > > who knows the rdma code well. > > > > > > > > Thanks, > > > > > > > OK, so comments from Yu Zhang and Gonglei? Can we work up a CI test > > > along these lines that would ensure that future RDMA breakages are > > > detected more easily? > > > > > > What do you think? > > > > > > - Michael > > > >