From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <netdev-owner@vger.kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
Received: from vger.kernel.org (vger.kernel.org [23.128.96.18])
	by smtp.lore.kernel.org (Postfix) with ESMTP id C75FEC761AF
	for <netdev@archiver.kernel.org>; Tue, 28 Mar 2023 15:16:04 +0000 (UTC)
Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand
        id S233962AbjC1PQD (ORCPT <rfc822;netdev@archiver.kernel.org>);
        Tue, 28 Mar 2023 11:16:03 -0400
Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55080 "EHLO
        lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org
        with ESMTP id S233936AbjC1PPm (ORCPT
        <rfc822;netdev@vger.kernel.org>); Tue, 28 Mar 2023 11:15:42 -0400
Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124])
        by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 209A411644
        for <netdev@vger.kernel.org>; Tue, 28 Mar 2023 08:14:09 -0700 (PDT)
DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com;
        s=mimecast20190719; t=1680016405;
        h=from:from:reply-to:subject:subject:date:date:message-id:message-id:
         to:to:cc:cc:mime-version:mime-version:content-type:content-type:
         content-transfer-encoding:content-transfer-encoding:
         in-reply-to:in-reply-to:references:references;
        bh=3DsgcTYFWxhX9PLfaIyKWk2AVWDq7BUElXbxQWwuPGo=;
        b=cfV+3DAxWSPaNOmjFFKvq9xKmFFejX+5WCXI1gyOD3Cjh+NYhBw3hT2jZL+t4yJR8HrkW1
        CTZLNakqatLcZ1mx7KrCCnIBLPdjKzwRHDyt7fFcoCoDewGWWICkp6m1iwW7k15Xl2eGBi
        WrMQKrmQMljYCsxOKB85H5ra3uKd+Qg=
Received: from mail-qt1-f197.google.com (mail-qt1-f197.google.com
 [209.85.160.197]) by relay.mimecast.com with ESMTP with STARTTLS
 (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id
 us-mta-196-eEI3YIqtOXqwyeICqC76Lg-1; Tue, 28 Mar 2023 11:13:24 -0400
X-MC-Unique: eEI3YIqtOXqwyeICqC76Lg-1
Received: by mail-qt1-f197.google.com with SMTP id r4-20020ac867c4000000b003bfefb6dd58so8422963qtp.2
        for <netdev@vger.kernel.org>; Tue, 28 Mar 2023 08:13:24 -0700 (PDT)
X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed;
        d=1e100.net; s=20210112; t=1680016403; x=1682608403;
        h=mime-version:user-agent:content-transfer-encoding:references
         :in-reply-to:date:cc:to:from:subject:message-id:x-gm-message-state
         :from:to:cc:subject:date:message-id:reply-to;
        bh=3DsgcTYFWxhX9PLfaIyKWk2AVWDq7BUElXbxQWwuPGo=;
        b=kHyY9j1SEyQvvKmPyfq8rkRnxpfML9JpbRummHG7uARjJueZ0hogmKyTmaGzDeChn3
         LzrBlF0y7nqvdZJkwpFdSM4pRMghKLpKB/l5WFfCBn1+1HH088nR2xsODzVSwX0YNSuA
         UxBZbQFcMC8zu4nnwImYM9hF9ENPwGCMeXjHD/lu+8pBjG+D9jdxlc8ybseyJVla3adA
         CXW3hjjHKgFYkHEc97/SqDxUZw/Tj/Axmz+yOMFwN1JtluPQkKpC91zWgvmWfLRxE2YM
         y0M8mQw+KK43HAh4cyri5INj5BGcJVNWGnacib5hPaIsi4pK0D1pMFvCP9s4Krrcsg2J
         pARA==
X-Gm-Message-State: AO0yUKVahzijkfGOGD5n4ORno/qnWy1Xyf6dZhyQqKyp/Td++MCTzr0D
        44bM49QHIFN5hcPzGOPNQ2ZCzyYZ7/fjHBeM3o5fklJqZfDq4vt1zgbBRUlsWDpVVew482+oMMW
        0qYBaA3bWLLo6skY/
X-Received: by 2002:ac8:5905:0:b0:3d9:56ce:a8cd with SMTP id 5-20020ac85905000000b003d956cea8cdmr27300505qty.6.1680016403644;
        Tue, 28 Mar 2023 08:13:23 -0700 (PDT)
X-Google-Smtp-Source: AKy350YwwGHDreK24ccOM1fRksnfMcjyV+HDpQvkvYlDeAaCKhWxUqK0D0aDc17Jr9LYaSTFZIprTg==
X-Received: by 2002:ac8:5905:0:b0:3d9:56ce:a8cd with SMTP id 5-20020ac85905000000b003d956cea8cdmr27300446qty.6.1680016403274;
        Tue, 28 Mar 2023 08:13:23 -0700 (PDT)
Received: from gerbillo.redhat.com (146-241-232-148.dyn.eolo.it. [146.241.232.148])
        by smtp.gmail.com with ESMTPSA id y16-20020a376410000000b00746aa4465b4sm8103982qkb.43.2023.03.28.08.13.21
        (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256);
        Tue, 28 Mar 2023 08:13:22 -0700 (PDT)
Message-ID: <9331f1358cf7c24442d705d840812e9cd490e018.camel@redhat.com>
Subject: Re: [PATCH net-next] net/core: add optional threading for backlog
 processing
From:   Paolo Abeni <pabeni@redhat.com>
To:     Felix Fietkau <nbd@nbd.name>, Jakub Kicinski <kuba@kernel.org>
Cc:     netdev@vger.kernel.org, Jonathan Corbet <corbet@lwn.net>,
        "David S. Miller" <davem@davemloft.net>,
        Eric Dumazet <edumazet@google.com>, linux-doc@vger.kernel.org,
        linux-kernel@vger.kernel.org
Date:   Tue, 28 Mar 2023 17:13:20 +0200
In-Reply-To: <b001c8ed-214f-94e6-2d4f-0ee13e3d8760@nbd.name>
References: <20230324171314.73537-1-nbd@nbd.name>
         <20230324102038.7d91355c@kernel.org>
         <2d251879-1cf4-237d-8e62-c42bb4feb047@nbd.name>
         <20230324104733.571466bc@kernel.org>
         <f59ee83f-7267-04df-7286-f7ea147b5b49@nbd.name>
         <751fd5bb13a49583b1593fa209bfabc4917290ae.camel@redhat.com>
         <b001c8ed-214f-94e6-2d4f-0ee13e3d8760@nbd.name>
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable
User-Agent: Evolution 3.46.4 (3.46.4-1.fc37) 
MIME-Version: 1.0
Precedence: bulk
List-ID: <netdev.vger.kernel.org>
X-Mailing-List: netdev@vger.kernel.org

On Tue, 2023-03-28 at 11:45 +0200, Felix Fietkau wrote:
> On 28.03.23 11:29, Paolo Abeni wrote:
> > On Fri, 2023-03-24 at 18:57 +0100, Felix Fietkau wrote:
> > > On 24.03.23 18:47, Jakub Kicinski wrote:
> > > > On Fri, 24 Mar 2023 18:35:00 +0100 Felix Fietkau wrote:
> > > > > I'm primarily testing this on routers with 2 or 4 CPUs and limite=
d=20
> > > > > processing power, handling routing/NAT. RPS is typically needed t=
o=20
> > > > > properly distribute the load across all available CPUs. When ther=
e is=20
> > > > > only a small number of flows that are pushing a lot of traffic, a=
 static=20
> > > > > RPS assignment often leaves some CPUs idle, whereas others become=
 a=20
> > > > > bottleneck by being fully loaded. Threaded NAPI reduces this a bi=
t, but=20
> > > > > CPUs can become bottlenecked and fully loaded by a NAPI thread al=
one.
> > > >=20
> > > > The NAPI thread becomes a bottleneck with RPS enabled?
> > >=20
> > > The devices that I work with often only have a single rx queue. That =
can
> > > easily become a bottleneck.
> > >=20
> > > > > Making backlog processing threaded helps split up the processing =
work=20
> > > > > even more and distribute it onto remaining idle CPUs.
> > > >=20
> > > > You'd want to have both threaded NAPI and threaded backlog enabled?
> > >=20
> > > Yes
> > >=20
> > > > > It can basically be used to make RPS a bit more dynamic and=20
> > > > > configurable, because you can assign multiple backlog threads to =
a set=20
> > > > > of CPUs and selectively steer packets from specific devices / rx =
queues=20
> > > >=20
> > > > Can you give an example?
> > > >=20
> > > > With the 4 CPU example, in case 2 queues are very busy - you're try=
ing
> > > > to make sure that the RPS does not end up landing on the same CPU a=
s
> > > > the other busy queue?
> > >=20
> > > In this part I'm thinking about bigger systems where you want to have=
 a
> > > group of CPUs dedicated to dealing with network traffic without
> > > assigning a fixed function (e.g. NAPI processing or RPS target) to ea=
ch
> > > one, allowing for more dynamic processing.
> > >=20
> > > > > to them and allow the scheduler to take care of the rest.
> > > >=20
> > > > You trust the scheduler much more than I do, I think :)
> > >=20
> > > In my tests it brings down latency (both avg and p99) considerably in
> > > some cases. I posted some numbers here:
> > > https://lore.kernel.org/netdev/e317d5bc-cc26-8b1b-ca4b-66b5328683c4@n=
bd.name/
> >=20
> > It's still not 110% clear to me why/how this additional thread could
> > reduce latency. What/which threads are competing for the busy CPU[s]? I
> > suspect it could be easier/cleaner move away the others (non RPS)
> > threads.
> In the tests that I'm doing, network processing load from routing/NAT is=
=20
> enough to occupy all available CPUs.
> If I dedicate the NAPI thread to one core and use RPS to steer packet=20
> processing to the other cores, the core taking care of NAPI has some=20
> idle cycles that go to waste, while the other cores are busy.
> If I include the core in the RPS mask, it can take too much away from=20
> the NAPI thread.

I feel like I'm missing some relevant points.

If RPS keeps the target CPU fully busy, moving RPS processing in a
separate thread still will not allow using more CPU time.

Which NIC driver are you using?

thanks!

Paolo