netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: David Miller <davem@davemloft.net>
To: ilpo.jarvinen@helsinki.fi
Cc: netdev@vger.kernel.org
Subject: Re: [PATCH 3/10] [TCP]: non-FACK SACK follows conservative SACK loss recovery
Date: Thu, 15 Nov 2007 19:41:15 -0800 (PST)	[thread overview]
Message-ID: <20071115.194115.255442199.davem@davemloft.net> (raw)
In-Reply-To: <11951338542528-git-send-email-ilpo.jarvinen@helsinki.fi>

From: "Ilpo_Järvinen" <ilpo.jarvinen@helsinki.fi>
Date: Thu, 15 Nov 2007 15:37:27 +0200

> Many assumptions that are true when no reordering or other
> strange events happen are not a part of the RFC3517. FACK
> implementation is based on such assumptions. Previously (before
> the rewrite) the non-FACK SACK was basically doing fast rexmit
> and then it times out all skbs when first cumulative ACK arrives,
> which cannot really be called SACK based recovery :-).
> 
> RFC3517 SACK disables these things:
> - Per SKB timeouts & head timeout entry to recovery
> - Marking at least one skb while in recovery (RFC3517 does this
>   only for the fast retransmission but not for the other skbs
>   when cumulative ACKs arrive in the recovery)
> - Sacktag's loss detection flavors B and C (see comment before
>   tcp_sacktag_write_queue)
> 
> This does not implement the "last resort" rule 3 of NextSeg, which
> allows retransmissions also when not enough SACK blocks have yet
> arrived above a segment for IsLost to return true [RFC3517].
> 
> The implementation differs from RFC3517 in these points:
> - Rate-halving is used instead of FlightSize / 2
> - Instead of using dupACKs to trigger the recovery, the number
>   of SACK blocks is used as FACK does with SACK blocks+holes
>   (which provides more accurate number). It seems that the
>   difference can affect negatively only if the receiver does not
>   generate SACK blocks at all even though it claimed to be
>   SACK-capable.
> - Dupthresh is not a constant one. Dynamical adjustments include
>   both holes and sacked segments (equal to what FACK has) due to
>   complexity involved in determining the number sacked blocks
>   between highest_sack and the reordered segment. Thus it's will
>   be an over-estimate.
> 
> Implementation note:
> 
> tcp_clean_rtx_queue doesn't need a lost_cnt tweak because head
> skb at that point cannot be SACKED_ACKED (nor would such
> situation last for long enough to cause problems).
> 
> Signed-off-by: Ilpo Järvinen <ilpo.jarvinen@helsinki.fi>

Thanks a lot for doing this work, these changes look fine to
me.

It occurs to me that the loss engine basically runs in about
2 or 3 modes, and instead of making the same tests multiple
times through the ACK processing paths we might want to move
to some kind of 'tcp_loss_ops' scheme.

It is just an idea.

Patch applied to net-2.6.25, thanks!

  parent reply	other threads:[~2007-11-16  3:41 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-11-15 13:37 [RFC PATCH net-2.6.25 0/10] [TCP]: Cleanups, tweaks & sacktag recode Ilpo Järvinen
2007-11-15 13:37 ` [PATCH 01/10] [TCP]: Move !in_sack test earlier in sacktag & reorganize if()s Ilpo Järvinen
2007-11-15 13:37   ` [PATCH 02/10] [TCP]: Extend reordering detection to cover CA_Loss partially Ilpo Järvinen
2007-11-15 13:37     ` [PATCH 3/10] [TCP]: non-FACK SACK follows conservative SACK loss recovery Ilpo Järvinen
2007-11-15 13:37       ` [PATCH 04/10] [TCP]: Convert highest_sack to sk_buff to allow direct access Ilpo Järvinen
2007-11-15 13:37         ` [PATCH 05/10] [TCP]: Make lost retrans detection more self-contained Ilpo Järvinen
2007-11-15 13:37           ` [PATCH 06/10] [TCP]: Prior_fackets can be replaced by highest_sack seq Ilpo Järvinen
2007-11-15 13:37             ` [PATCH 07/10] [TCP]: Create tcp_sacktag_one() Ilpo Järvinen
2007-11-15 13:37               ` [PATCH 08/10] [TCP]: Earlier SACK block verification & simplify access to them Ilpo Järvinen
2007-11-15 13:37                 ` [RFC PATCH 09/10] [TCP]: Rewrite SACK block processing & sack_recv_cache use Ilpo Järvinen
2007-11-15 13:37                   ` [PATCH 10/10] [TCP]: Track sacktag (DEVEL PATCH) Ilpo Järvinen
2007-11-16  4:00                   ` [RFC PATCH 09/10] [TCP]: Rewrite SACK block processing & sack_recv_cache use David Miller
2007-11-16 13:44                     ` Ilpo Järvinen
2007-11-20  6:08                       ` David Miller
2007-11-20  9:22                         ` Ilpo Järvinen
2007-11-16  3:50                 ` [PATCH 08/10] [TCP]: Earlier SACK block verification & simplify access to them David Miller
2007-11-16  3:45               ` [PATCH 07/10] [TCP]: Create tcp_sacktag_one() David Miller
2007-11-16  3:44             ` [PATCH 06/10] [TCP]: Prior_fackets can be replaced by highest_sack seq David Miller
2007-11-16  3:43           ` [PATCH 05/10] [TCP]: Make lost retrans detection more self-contained David Miller
2007-11-16  3:42         ` [PATCH 04/10] [TCP]: Convert highest_sack to sk_buff to allow direct access David Miller
2007-11-16  3:41       ` David Miller [this message]
2007-11-16  3:35     ` [PATCH 02/10] [TCP]: Extend reordering detection to cover CA_Loss partially David Miller
2007-11-16  3:33   ` [PATCH 01/10] [TCP]: Move !in_sack test earlier in sacktag & reorganize if()s David Miller

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20071115.194115.255442199.davem@davemloft.net \
    --to=davem@davemloft.net \
    --cc=ilpo.jarvinen@helsinki.fi \
    --cc=netdev@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).