From mboxrd@z Thu Jan  1 00:00:00 1970
From: Joe Cao <caoco2002@yahoo.com>
Subject: Re: TCP stack bug related to F-RTO?
Date: Sat, 26 Sep 2009 13:48:28 -0700 (PDT)
Message-ID: <222796.87049.qm@web63403.mail.re1.yahoo.com>
References: <Pine.LNX.4.64.0909262034130.12882@melkinkari.cs.Helsinki.FI>
Mime-Version: 1.0
Content-Type: text/plain; charset=iso-8859-1
Content-Transfer-Encoding: QUOTED-PRINTABLE
Cc: Ray Lee <ray-lk@madrabbit.org>, Netdev <netdev@vger.kernel.org>,
	LKML <linux-kernel@vger.kernel.org>
To: =?iso-8859-1?Q?Ilpo_J=E4rvinen?= <ilpo.jarvinen@helsinki.fi>
Return-path: <netdev-owner@vger.kernel.org>
Received: from n1a.bullet.mail.ac4.yahoo.com ([76.13.13.64]:29745 "HELO
	n1a.bullet.mail.ac4.yahoo.com" rhost-flags-OK-OK-OK-OK)
	by vger.kernel.org with SMTP id S1752511AbZIZUsZ convert rfc822-to-8bit
	(ORCPT <rfc822;netdev@vger.kernel.org>);
	Sat, 26 Sep 2009 16:48:25 -0400
In-Reply-To: <Pine.LNX.4.64.0909262034130.12882@melkinkari.cs.Helsinki.FI>
Sender: netdev-owner@vger.kernel.org
List-ID: <netdev.vger.kernel.org>

Hi Ilpo,

Thanks for the replay.  We noticed the problem while we were debugging =
a connection failure case reported by one of our customers (we are a ne=
twork device vendor).  Actually we have suggested our customer to upgra=
de their server software to fix the problem, and we are still waiting f=
or the feedback from them.  Meanwhile, I asked all those questions just=
 because I want to understand the issue and the fixes.  We also has to =
convince the customer to move to a right kernel and don't want them to =
come back with the same problem again.

Again, thanks for the help!

Joe

--- On Sat, 9/26/09, Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi> wrote:

> From: Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi>
> Subject: Re: TCP stack bug related to F-RTO?
> To: "Joe Cao" <caoco2002@yahoo.com>
> Cc: "Ray Lee" <ray-lk@madrabbit.org>, "Netdev" <netdev@vger.kernel.or=
g>, "LKML" <linux-kernel@vger.kernel.org>
> Date: Saturday, September 26, 2009, 10:51 AM
> On Sat, 26 Sep 2009, Joe Cao wrote:
>=20
> > Can you elaborate on "Some retransmission would happen
> here as step 3"?=A0=20
> > When the second timeout happens, it will again go into
> FRTO and then=20
> > retransmit the write queue head.
>=20
> Why do you think that the second RTO will happen with
> anything else than=20
> with 2.6.24. And it's perfectly ok to go into FRTO for the
> second time.
>=20
> > I looked at the patch (debian Bug#478062) that's
> probably what you=20
> > mentioned as the fix. All it does was to exclude the
> SACK case when=20
> > considering FRTO.=A0 But in my case, SACK was
> enabled, as seen in the=20
> > trace..
>=20
> You should be looking from where I said rather than picking
> up your own=20
> sources and assuming that they'll tell you all the story
> :-). In fact,=20
> there are two fixes that were made in a row and one
> workaround in the
> same timeframe. ...And you managed to pick the wrong one of
> the fixes, so=20
> I kind of understand why you got confused :-).
>=20
> > In other words, do we still have a problem with FRTO
> when SACK is=20
> > enabled in the latest kernel?
>=20
> For sure we might have all kinds of problems no one has yet
>=20
> noticed/reported :-). ....However, it seems that this
> particular problem=20
> your trace is showing is solved. Can you please test with a
> fixed kernel=20
> before coming back here with these claims.
>=20
>=20
> --=20
>  i.
>=20
> --- On Fri, 9/25/09, Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi>
> wrote:
>=20
> > From: Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi>
> > Subject: Re: TCP stack bug related to F-RTO?
> > To: "Joe Cao" <caoco2002@yahoo.com>
> > Cc: "Ray Lee" <ray-lk@madrabbit.org>,
> "Netdev" <netdev@vger.kernel.org>,
> "LKML" <linux-kernel@vger.kernel.org>
> > Date: Friday, September 25, 2009, 11:03 AM
> > On Fri, 25 Sep 2009, Joe Cao wrote:
> >=20
> > > Thanks for the reply!=A0 Do you happen to know
> > which patch fixed the=20
> > > problem?
> >=20
> > You can find those patches from the stable queue git
> tree.
> > I gave you hint=20
> > from what release to look from in the last mail.
> However,
> > as 2.6.24 is=20
> > anyway obsolete my recommendation is that you should
> > probably consider=20
> > upgrading to fix all the other bugs that have been
> found
> > since 2.6.24 was=20
> > obsoleted.
> >=20
> > > Is there a bug tracking system for linux kernel?
> >=20
> > Nothing that knows everything about everything.
> >=20
> > > I studied the FRTO code in latest kernel
> 2.6.31..=A0
> > It seems the problem=20
> > > is still there:=A0=20
> > >
> > > 1. Every time a RTO fires, because
> tcp_is_sackfrto(tp)
> > returns 1,=20
> > > tcp_use_frto() returns true.=A0 And the server
> tcp
> > enters FRTO.
> > > 2. After the head of write queue is
> retransmitted, two
> > new data packets=20
> > > are transmitted, the server receives two
> > dup-ACKs.=A0 That will make the=20
> > > TCP enter tcp_enter_frto_loss(), however, that
> only
> > rests ssthresh and=20
> > > some other fields.
> >=20
> > Perhaps those other fields are far more important than
> you
> > think... :-)
> > ...Some retransmission would happen here as step 3.
> >=20
> > > 3. After another longer RTO fires, because
> > tcp_is_sackfrto(tp) returns=20
> > > 1, tcp_use_frto() again returns true.=A0 The
> stack
> > enters FRTO again.
> > > 4. The above repeats and the stack couldn't
> > retransmits the lost packets=20
> > > faster.
> > >=20
> > > Is my understanding above correct?
> >=20
> > ...No. All magic that happens in tcp_enter_frto_loss
> should
> > be enough to=20
> > really do more than a single retransmission (that is,
> in
> > any other than=20
> > 2.6.24 series kernel). There was an unfortunate bug in
> this
> > area in 2.6.24=20
> > which basically undoed the effect of correct actions
> > tcp_enter_frto_loss=20
> > did which effectively prevented
> tcp_xmit_retransmit_queue
> > from doing its=20
> > part.
> >=20
> > --=20
> >=A0 i.
> >=20
> > --- On Fri, 9/25/09, Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi>
> > wrote:
> >=20
> > > From: Ilpo J=E4rvinen <ilpo.jarvinen@helsinki.fi>
> > > Subject: Re: TCP stack bug related to F-RTO?
> > > To: "Ray Lee" <ray-lk@madrabbit.org>
> > > Cc: "Joe Cao" <caoco2002@yahoo.com>,
> > "Netdev" <netdev@vger.kernel.org>,
> > "LKML" <linux-kernel@vger.kernel.org>,
> > jcaoco2002@yahoo.com
> > > Date: Friday, September 25, 2009, 6:09 AM
> > > On Thu, 24 Sep 2009, Ray Lee wrote:
> > >=20
> > > > [adding netdev cc:]
> > > >=20
> > > > On Thu, Sep 24, 2009 at 10:43 AM, Joe Cao
> <caoco2002@yahoo.com>
> > > wrote:
> > > > >
> > > > > Hello,
> > > > >
> > > > > I have found the following behavior
> with
> > > different versions of linux=20
> > > > > kernel. The attached pcap trace is
> collected
> > with
> > > server=20
> > > > > (192.168.0.13) running 2.6.24 and shows
> the
> > > problem. Basically the=20
> > > > > behavior is like this:=20
> > > > >
> > > > > 1. The client opens up a big window,
> > > > > 2. the server sends 19 packets in a row
> (pkt
> > #14-
> > > #32 in the trace), but all of them are dropped
> due to
> > some
> > > congestion.
> > > > > 3. The server hits RTO and retransmits
> pkt
> > #14 in
> > > #33
> > > > > 4. The client immediately acks #33
> (=3D#14),
> > and
> > > the server (seems like to enter F-RTO) expends
> the
> > window
> > > and sends *NEW* pkt #35 & #36.=3DA0 Timeoute
> is
> > doubled to
> > > 2*RTO; The client immediately sends two Dup-ack
> to #35
> > and
> > > #36.
> > > > > 5. after 2*RTO, pkt #15 is
> retransmitted in
> > #39.
> > > > > 6. The client immediately acks #39
> (=3D#15) in
> > #40,
> > > and the server continues to expand the window
> and
> > sends two
> > > *NEW* pkt #41 & #42. Now the timeoute is
> doubled
> > to 4
> > > *RTO.
> > > > > 8. After 4*RTO timeout, #16 is
> > retransmitted.
> > > > > 9....
> > > > > 10. The above steps repeats for
> > retransmitting
> > > pkt #16-#32 and each time the timeout is
> doubled.
> > > > > 11. It takes a long long time to
> retransmit
> > all
> > > the lost packets and before that is done, the
> client
> > sends a
> > > RST because of timeout.
> > > > >
> > > > > The above behavior looks like F-RTO is
> in
> > effect.
> > > =A0And there seems to=20
> > > > > be a bug in the TCP's congestion
> control
> > and
> > > retransmission algorithm.=20
> > > > > Why doesn't the TCP on server (running
> > 2.6.24)
> > > enter the slow start?=20
> > > > > Why should the server take that long
> to
> > recover
> > > from a short period=20
> > > > > of packet loss?
> > > > >
> > > > > Has anyone else noticed similar
> problem
> > before?
> > > =A0If my analysis was=20
> > > > > wrong, can anyone gives me some
> pointers to
> > > what's really wrong and=20
> > > > > how to fix it?
> > >=20
> > > Yes, 2.6.24 is an obsoleted version with known
> wrongs
> > in
> > > FRTO=20
> > > implementation. Fixes never when to 2.6.24
> stable
> > series as
> > > it was=20
> > > _already_ obsoleted when the problems where
> reported
> > and
> > > found. The=20
> > > correct fixes may be found from 2.6.25.7 (.7
> iirc) and
> > are
> > > included from=20
> > > 2.6.26 onward too.
> > >=20
> > > Just in case you happen to run ubuntu based
> kernel
> > from
> > > that era (of=20
> > > course you should be reporting the bug here
> then...),
> > a
> > > word of warning:=20
> > > it seemed nearly impossible for them to get a
> simple
> > thing
> > > like that=20
> > > fixed, I haven't been looking if they'd
> eventually
> > come to
> > > some sensible=20
> > > conclusion in that matter or is it still
> unresolved
> > (or
> > > e.g., closed=20
> > > without real resolution).
>=20


     =20