From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-qk1-f173.google.com (mail-qk1-f173.google.com [209.85.222.173]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id AADD41FA156 for ; Fri, 3 Jan 2025 15:05:49 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.222.173 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1735916751; cv=none; b=Df6X7n7d2QRHAkRVWzm/7zQjgR3hV9psKL6FSspowoiuasQQta+9Gpocih6EJlD7VbMKHJsTuGUtAX2N8NHmXj6V1nMEY5edXeD4l6t1LUK5hg5nlwpG7RRr9HOt4HsuR15JjODVaR+RGgQ7Q4GZFLffhjExpHsA/yky6ghqFWQ= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1735916751; c=relaxed/simple; bh=FS5d3aYzriRjnHm6y73ZVKZVJeP8MQMVEaTCyE3JELs=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=Isttqte0aSkq3bnaImqALXcQthw1Ihkyaj3u0OdKqQhFPMBnYe2pVk/hyRrtbpwahT3wAf8jmNz5pNCPuVsVVaQ8ZRR8qcq+lrB1BYX2QDf2KejWYhSUpTW+MpaPm0eN+J7EjwgQ8B6Gc4a+qvCjXnLNhKFh8ziAlgJ2Cl6migM= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=ziepe.ca; spf=pass smtp.mailfrom=ziepe.ca; dkim=pass (2048-bit key) header.d=ziepe.ca header.i=@ziepe.ca header.b=RTaotfxE; arc=none smtp.client-ip=209.85.222.173 Authentication-Results: smtp.subspace.kernel.org; dmarc=none (p=none dis=none) header.from=ziepe.ca Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=ziepe.ca Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=ziepe.ca header.i=@ziepe.ca header.b="RTaotfxE" Received: by mail-qk1-f173.google.com with SMTP id af79cd13be357-7ba0fa25f07so393783785a.2 for ; Fri, 03 Jan 2025 07:05:49 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ziepe.ca; s=google; t=1735916748; x=1736521548; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc:subject:date:message-id:reply-to; bh=n6BKjuq/x7JutnNCU83CQjlr2jbXNUAPtCF2NWZHrUM=; b=RTaotfxEr7oX0wShcZaO8T80H1S6dV3/TxwX5jMJlthcQu9SLSg5PwxgXcGs94lQxq KeX+8L7iSDgywyEvvpHggLZwhYtpeZkgw1wYiFEjF6KeSrX2cDnuftdskQHK+F1ura96 6cfzdiKOZctLqDWuKfluFjn7rjG8iDCuzBu+8Z+f09jtW62F2ojBxZCB7eRxJTNR2/AU 8/+xbOa9OarOGoO0Dtjsc7kh3TstOs4EVViyjDpKDQAjWSl1Qou2AzUqKueiWbGbE0J2 iCSMTujBU87DOKQl7+SCN7XhelwdITjQmRaUnTvcUYlmMcaJOpoNKH2pAjsdJ7PWuI04 ovaQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1735916748; x=1736521548; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=n6BKjuq/x7JutnNCU83CQjlr2jbXNUAPtCF2NWZHrUM=; b=gOMEfKiGZQuLlceQIx30dqjoepT4f6gW86fweQRZfnHbl4XKCQNWHGubUrYYhVJGOk Cyzx/ozs9+na6zYOFj5eroq2Qn9KYK03kYHR4WGfI3sLXru+GEppioL7ANRIge4S+5ws x1Z6gYk0kAvxAT0Y1UP1JjKEPYzx0aIKDXbqkbfzrRVDXjernp38NVYIcDGgkv6MCKTr OTRALsmOUCwCsV4ul8xUI968c+gZA6HtTh6x+z3fpIFMesVA3zM23GTNwsITwv9D8tE/ jFAezv68Y0yYrFgh9zWHWlVS74mAyhQ9SmnWwXpEm3DfrR/8RN/jiUSl50mEjTAt5kao zuPA== X-Forwarded-Encrypted: i=1; AJvYcCWtIBWxgkZpAybLzDyc2KbV/3CpQ2nOtGaMk+fuZMtmmlMdEkA05nxEk13ojM7uX7YzRLnurjm428Q5cLk=@vger.kernel.org X-Gm-Message-State: AOJu0YyeCj4DUkcoEpGMvWhypgmxBIpKl3losGXUMFbGSrqsFEFUxhY2 rJo+STNemnW9XmeyRQObreS/T/Uc94PGt3Ed4kSX0QWpT7td++TIW2gDNN0CdGA= X-Gm-Gg: ASbGnctsWHJ5ddyEV8l0YxE07MRBbOj6LPEcdlD3uu+uV24AslBff6CeeprSmW4Gb4L lx3u7VhQF7c2YFHeaWC6JrWzCyeDYrebI9p15GXtFoR4XQydeGgpgFmNzECcKrkViY29h6eMzIn o0LwUkyMetaLBL0UHZTeNMW/0Ir6NywMgZM1Xu7xGJPMwz7cAp0RObmH3oBQ6r1m4U1xy3gLPZi cax09yhTXkO5jvbCdM6vsnUyOjRmr1VNTlBEORBiigLP/8IUYvW66K4poGP+cIH2+H6yHKC/g5+ yE4AjBWswNucbtW9pt61x2sMSKrcdg== X-Google-Smtp-Source: AGHT+IE7tSnjtJxc238CW0Nv+0m7Du5Z0ntR7k8rJ/hiKDcj6QGkysJlx7+kpT6z+7a01T7NF0vlVg== X-Received: by 2002:a05:620a:394a:b0:7b6:d710:2282 with SMTP id af79cd13be357-7b9ba80ebc6mr8676475285a.49.1735916748505; Fri, 03 Jan 2025 07:05:48 -0800 (PST) Received: from ziepe.ca (hlfxns017vw-142-68-128-5.dhcp-dynamic.fibreop.ns.bellaliant.net. [142.68.128.5]) by smtp.gmail.com with ESMTPSA id af79cd13be357-7b9ac4cd53fsm1277146885a.112.2025.01.03.07.05.46 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 03 Jan 2025 07:05:47 -0800 (PST) Received: from jgg by wakko with local (Exim 4.97) (envelope-from ) id 1tTjFC-00000000irj-27xw; Fri, 03 Jan 2025 11:05:46 -0400 Date: Fri, 3 Jan 2025 11:05:46 -0400 From: Jason Gunthorpe To: "Daisuke Matsuda (Fujitsu)" , 'Joe Klein' Cc: "linux-rdma@vger.kernel.org" , "leon@kernel.org" , "zyjzyj2000@gmail.com" , "linux-kernel@vger.kernel.org" , "rpearsonhpe@gmail.com" , "Zhijian Li (Fujitsu)" Subject: Re: [PATCH for-next v9 0/5] On-Demand Paging on SoftRoCE Message-ID: <20250103150546.GD26854@ziepe.ca> References: <20241220100936.2193541-1-matsuda-daisuke@fujitsu.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Tue, Dec 24, 2024 at 08:52:24AM +0000, Daisuke Matsuda (Fujitsu) wrote: > On Mon, Dec 23, 2024 10:55 AM Daisuke Matsuda (Fujitsu) wrote: > > On Mon, Dec 23, 2024 2:25 AM Joe Klein wrote: > > > We have tested this patcheset and had a lot of problems, even without using the ODP option in softroce. I don't know if > > others have done similar tests. If we have to merge this patchset into upstream, is it > possible to add a kernel option to > > enable/disable this patchset? > > > > Hi Joe, > > > > Can you clarify the test and the problems you observed? > > I wonder if you tried the test with the latest tree WITHOUT my patches. > > > > As far as I know, there is something wrong with the upstream right now. > > It does not complete the rdma-core testcases, and 'segmentation fault' is observed > > in the middle of the full test run, which did not happen before October 2024. > > It appears that the root cause of this issue lies within the userspace components. > My report yesterday was based on experiments conducted on Ubuntu 24.04.1 LTS (x86_64). > It seems to me that rxe is somehow broken regardless of kernel version > as long as userspace components are provided by Ubuntu 24.04.1 LTS. > I built and tried linux-6.11, linux-6.10, and linux-6.8, and they all failed as I reported. > > I switched to Ubuntu 22.04.5 LTS (aarch64) to test with the older libraries. > All tests available passed using the rdma for-next tree without any problem. > Then, I applied my ODP patches onto it, and everything is still fine. > #################### > ubuntu@rdma-aarch64:~/rdma-core$ git branch -v > * master fb965e2d0 Merge pull request #1531 from selvintxavier/pbuf_optimization > ubuntu@rdma-aarch64:~/rdma-core$ ./build/bin/run_tests.py > ..........ss..........ssssssssss..............ssssssssssssssssssssssssss.sssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssssss........ssssss..ss....s.sssssss....ss....ss..............s......................ss.............sss...ssss > ---------------------------------------------------------------------- > Ran 321 tests in 3.599s > > OK (skipped=211) > ubuntu@rdma-aarch64:~/rdma-core$ ./build/bin/run_tests.py -k odp > sssssssss..ss....s.s > ---------------------------------------------------------------------- > Ran 20 tests in 0.269s > > OK (skipped=13) > #################### > > Possibly, there was a regression in libibverbs between v39.0-1 and v50.0-2build2. > We need to take a closer look to resolve the malfunction of rxe on Ubuntu 24.04. That's distressing. > In conclusion, I believe there is nothing in my ODP patches that could cause > the rxe driver to fail. I would appreciate any feedback on potential improvements. What am I supposed to do with this though? Joe, can you please answer Daisuke's questions on what problems you observed and if you observe them without the ODP patches? Jason