From: Paolo Abeni <pabeni@redhat.com>
To: Eric Dumazet <edumazet@google.com>
Cc: Eric Dumazet <eric.dumazet@gmail.com>,
netdev <netdev@vger.kernel.org>,
David Miller <davem@davemloft.net>,
Florian Westphal <fw@strlen.de>,
neilb@suse.com
Subject: Re: [RFC PATCH] ip: re-introduce fragments cache worker
Date: Fri, 20 Jul 2018 19:31:59 +0200 [thread overview]
Message-ID: <0419d5f8f551e481603b34f162299cc9961694fc.camel@redhat.com> (raw)
In-Reply-To: <CANn89iLgdmR_=SVeuf1wD79jUrgaRo8P3xXYcFLJESzGL4VVUg@mail.gmail.com>
On Fri, 2018-07-20 at 08:58 -0700, Eric Dumazet wrote:
> On Fri, Jul 20, 2018 at 7:48 AM Paolo Abeni <pabeni@redhat.com> wrote:
> >
> > Hi,
> >
> > On Mon, 2018-07-09 at 05:50 -0700, Eric Dumazet wrote:
> > > On 07/09/2018 04:39 AM, Eric Dumazet wrote:
> > >
> > > > Alternatively, you could try to patch fq_codel to drop all frags of one UDP datagram
> > > > instead of few of them.
> > >
> > > A first step would be to make sure fq_codel_hash() (using skb_get_hash(skb)) selects
> > > the same bucket for all frags of a datagram :/
> >
> > I gave the above a shot and I have some non upstream ready but somewhat
> > working code. Anyway it has some issues I'm unable to solve:
> > * it's very invasive for fq_codel, because I need to parse each packet
> > looking for the fragment id
> > * the parsing overhead can't be easily avoided for non fragments
>
> Have you tried using ip_defrag(net, skb, IP_DEFRAG_QDISC) from fq_codel ?
> (adding a new value in ip_defrag_users enum)
>
> if (skb->protocol == htons(ETH_P_IP) {
> if (ip_is_fragment(ip_hdr(skb))) {
> if ((ip_defrag(net, skb, IP_DEFRAG_QDISC))
> return 0;
> ...
Thank you for the feedback. I must admit this quite in the opposite
direction of what I have attempted so far. I'll try that.
Thanks.
Still for ipv6 it will require a litte more work inside fq_codel.
> > I tried also something hopefully along the same lines of your other
> > suggestion (drop eariler the fragment queues when above low threshold):
> > when allocating a new frag queue and the ipfrag mem is above the low
> > th, another frag queue is selected in a pseudorandom way and dropped.
>
> The problem with any strategy like that, is that forthcoming fragments
> for this frag queue
> will create another frag queue, that will never have a chance to complete.
>
> Some workloads might benefit, others might not.
Yes, of course: is an heuristic, but is cheap code-wise, and it can be
disabled setting low th >= high th, so that the kernel will behave
exactly as it does now, and the kind of workloads we could cope with
will increase without adding new knobs.
Cheers,
Paolo
next prev parent reply other threads:[~2018-07-20 18:21 UTC|newest]
Thread overview: 15+ messages / expand[flat|nested] mbox.gz Atom feed top
2018-07-06 10:10 [RFC PATCH] ip: re-introduce fragments cache worker Paolo Abeni
2018-07-06 11:23 ` Eric Dumazet
2018-07-06 11:56 ` Paolo Abeni
2018-07-06 12:09 ` Eric Dumazet
2018-07-06 13:56 ` Paolo Abeni
2018-07-06 14:20 ` Eric Dumazet
2018-07-09 9:43 ` Paolo Abeni
2018-07-09 11:34 ` Eric Dumazet
2018-07-09 11:39 ` Eric Dumazet
2018-07-09 12:50 ` Eric Dumazet
2018-07-20 14:48 ` Paolo Abeni
2018-07-20 15:58 ` Eric Dumazet
2018-07-20 17:31 ` Paolo Abeni [this message]
2018-07-20 17:37 ` Eric Dumazet
2018-07-06 14:37 ` Eric Dumazet
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=0419d5f8f551e481603b34f162299cc9961694fc.camel@redhat.com \
--to=pabeni@redhat.com \
--cc=davem@davemloft.net \
--cc=edumazet@google.com \
--cc=eric.dumazet@gmail.com \
--cc=fw@strlen.de \
--cc=neilb@suse.com \
--cc=netdev@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).