* analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-02 1:32 ` Arnaldo Carvalho de Melo
@ 2004-07-06 9:35 ` bert hubert
2004-07-06 20:19 ` David S. Miller
0 siblings, 1 reply; 7+ messages in thread
From: bert hubert @ 2004-07-06 9:35 UTC (permalink / raw)
To: Arnaldo Carvalho de Melo
Cc: David S. Miller,
linux-kernel @ vger. kernel. org Stephen Hemminger, netdev,
alessandro.suardi, phyprabab
On Thu, Jul 01, 2004 at 10:32:25PM -0300, Arnaldo Carvalho de Melo wrote:
> > > Rather than using *tcp_prot.memory_pressure, just go back to looking at
> > > tcp_memory_pressure.
> >
> > Hehe, applied thanks Stephen.
People,
There are still persistent reports of TCP problems, even after patching away
the memory_pressure pointer problem. From one trace I've seen, by Alessandro
Suardi, but I lack the SACK knowledge to fully interpret these traces:
22:42:40.890025 192.168.1.6.32843 > 204.152.189.116.http: S 1994994484:1994994484(0) win 5840 <mss 1460,sackOK,timestamp 4294940315 0,nop,wscale 7> (DF)
22:42:41.143063 204.152.189.116.http > 192.168.1.6.32843: S 1404108869:1404108869(0) ack 1994994485 win 5792 <mss 1452,sackOK,timestamp 3383469176 4294940315,nop,wscale 0> (DF)
22:42:41.143123 192.168.1.6.32843 > 204.152.189.116.http: . ack 1 win 45 <nop,nop,timestamp 4294940568 3383469176> (DF)
Alessandro's machine does perform window scaling, tcpdump however does not
understand that and neglects to multiply 45 by 2^7 (=5760). Kernel.org does do
wscale, but defaults to 2^0.
22:42:41.143362 192.168.1.6.32843 > 204.152.189.116.http: P 1:421(420) ack 1 win 45 <nop,nop,timestamp 4294940568 3383469176> (DF)
Alessandro's machine sends a GET request.
22:42:41.147669 204.152.189.116.http > 192.168.1.6.32843: S 1404108869:1404108869(0) ack 1994994485 win 5792 <mss 1452,sackOK,timestamp 3383469180 4294940315,nop,wscale 0> (DF)
www.kernel.org acts like it did not see our ACK.
22:42:41.147723 192.168.1.6.32843 > 204.152.189.116.http: . ack 1 win 45 <nop,nop,timestamp 4294940572 3383469180,nop,nop,sack sack 1 {0:1} > (DF)
Allessandro's machine sends a selective ACK - could have gotten away with a
regular one I'd think?
22:42:41.408763 204.152.189.116.http > 192.168.1.6.32843: . ack 421 win 6432 <nop,nop,timestamp 3383469440 4294940568> (DF)
www.kernel.org acks the GET, but from then on does not send anything.
After a minute, Alessandro gets bored and presses STOP in Mozilla:
22:43:41.051537 192.168.1.6.32843 > 204.152.189.116.http: F 421:421(0) ack 1 win 45 <nop,nop,timestamp 33189 3383469440> (DF)
k.o acks this FIN, but also sends a selective ack:
22:43:41.304371 204.152.189.116.http > 192.168.1.6.32843: . ack 422 win 6432 <nop,nop,timestamp 3383529343 33189,nop,nop,sack sack 1 {421:422} > (DF)
Here are the underlying reports:
http://lkml.org/lkml/2004/7/4/116 (Alessandro Suardi)
The _only_ site I found I can browse without disabling TCP
window scaling is http://www.google.it.
tcpdump at: http://xoomer.virgilio.it/incident/tcpdump.out
http://lkml.org/lkml/2004/7/5/105 (Phy Prabab)
Concerning my issue with ftp'ing from remote sites,
using these sysctl's I was able to get the performance
back:
net.ipv4.tcp_default_win_scale=0
net.ipv4.tcp_moderate_rcvbuf=0
2.6.7-bk18 w/out sysctls:
2573621 bytes received in 2e+02 seconds (12 Kbytes/s)
2.6.7-bk18 w/sysctls:
2573621 bytes received in 0.69 seconds (3.6e+03
Kbytes/s)
These both refer to bk after the *tcp_prot.memory_pressure patch was
applied.
Thanks for your attention.
--
http://www.PowerDNS.com Open source, database driven DNS Software
http://lartc.org Linux Advanced Routing & Traffic Control HOWTO
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
@ 2004-07-06 12:30 ALESSANDRO.SUARDI
2004-07-06 12:42 ` bert hubert
0 siblings, 1 reply; 7+ messages in thread
From: ALESSANDRO.SUARDI @ 2004-07-06 12:30 UTC (permalink / raw)
To: bert hubert, Arnaldo Carvalho de Melo
Cc: David S. Miller,
linux-kernel @ vger. kernel. org Stephen Hemminger, netdev,
phyprabab
Hi all,
sorry about the silly formatting, but the webmail
client isn't really nice in this respect :(
Further data points: -bk18 built with GCC 3.4.1
still hangs talking to most of the world except
google.it (redirected from google.com) from
my home DSL (US Robotics USR9003 router).
It _works_ without changing anything when in
my office (unsure about hardware there).
It _also_ works from my home _if_ I fire up the
Cisco VPN client (4.0.4.B) to an Oracle VPN
server and access the 'net (kernel.org and
all) via our proxies through the VPN.
Hope that helps, and sorry for top-posting.
Bert Hubert wrote:
> On Thu, Jul 01, 2004 at 10:32:25PM -0300,
> Arnaldo Carvalho de Melo wrote:
>
> > > > Rather than using
> *tcp_prot.memory_pressure, just go back to
> looking at
> > > > tcp_memory_pressure.
> > >
> > > Hehe, applied thanks Stephen.
>
> People,
>
> There are still persistent reports of TCP
> problems, even after patching away
> the memory_pressure pointer problem. From one
> trace I've seen, by Alessandro
> Suardi, but I lack the SACK knowledge to fully
> interpret these traces:
>
> 22:42:40.890025 192.168.1.6.32843 >
> 204.152.189.116.http: S 1994994484:1994994484(0)
> win 5840 <mss 1460,sackOK,timestamp 4294940315
> 0,nop,wscale 7> (DF)
> 22:42:41.143063 204.152.189.116.http >
> 192.168.1.6.32843: S 1404108869:1404108869(0)
> ack 1994994485 win 5792 <mss
> 1452,sackOK,timestamp 3383469176
> 4294940315,nop,wscale 0> (DF)
> 22:42:41.143123 192.168.1.6.32843 >
> 204.152.189.116.http: . ack 1 win 45
> <nop,nop,timestamp 4294940568 3383469176> (DF)
>
> Alessandro's machine does perform window
> scaling, tcpdump however does not
> understand that and neglects to multiply 45 by
> 2^7 (=5760). Kernel.org does do
> wscale, but defaults to 2^0.
>
> 22:42:41.143362 192.168.1.6.32843 >
> 204.152.189.116.http: P 1:421(420) ack 1 win 45
> <nop,nop,timestamp 4294940568 3383469176> (DF)
>
> Alessandro's machine sends a GET request.
>
> 22:42:41.147669 204.152.189.116.http >
> 192.168.1.6.32843: S 1404108869:1404108869(0)
> ack 1994994485 win 5792 <mss
> 1452,sackOK,timestamp 3383469180
> 4294940315,nop,wscale 0> (DF)
>
> www.kernel.org acts like it did not see our ACK.
>
>
> 22:42:41.147723 192.168.1.6.32843 >
> 204.152.189.116.http: . ack 1 win 45
> <nop,nop,timestamp 4294940572
> 3383469180,nop,nop,sack sack 1 {0:1} > (DF)
>
> Allessandro's machine sends a selective ACK -
> could have gotten away with a
> regular one I'd think?
>
> 22:42:41.408763 204.152.189.116.http >
> 192.168.1.6.32843: . ack 421 win 6432
> <nop,nop,timestamp 3383469440 4294940568> (DF)
>
> www.kernel.org acks the GET, but from then on
> does not send anything.
>
> After a minute, Alessandro gets bored and
> presses STOP in Mozilla:
>
> 22:43:41.051537 192.168.1.6.32843 >
> 204.152.189.116.http: F 421:421(0) ack 1 win 45
> <nop,nop,timestamp 33189 3383469440> (DF)
>
> k.o acks this FIN, but also sends a selective
> ack:
>
> 22:43:41.304371 204.152.189.116.http >
> 192.168.1.6.32843: . ack 422 win 6432
> <nop,nop,timestamp 3383529343 33189,nop,nop,sack
> sack 1 {421:422} > (DF)
>
> Here are the underlying reports:
>
> http://lkml.org/lkml/2004/7/4/116 (Alessandro
> Suardi)
> The _only_ site I found I can browse without
> disabling TCP
> window scaling is http://www.google.it.
>
> tcpdump at:
> http://xoomer.virgilio.it/incident/tcpdump.out
>
> http://lkml.org/lkml/2004/7/5/105 (Phy Prabab)
> Concerning my issue with ftp'ing from remote
> sites,
> using these sysctl's I was able to get the
> performance
> back:
>
> net.ipv4.tcp_default_win_scale=0
> net.ipv4.tcp_moderate_rcvbuf=0
>
> 2.6.7-bk18 w/out sysctls:
> 2573621 bytes received in 2e+02 seconds (12
> Kbytes/s)
>
> 2.6.7-bk18 w/sysctls:
> 2573621 bytes received in 0.69 seconds (3.6e+03
>
> Kbytes/s)
>
> These both refer to bk after the
> *tcp_prot.memory_pressure patch was
> applied.
>
> Thanks for your attention.
>
> --
> http://www.PowerDNS.com Open source,
> database driven DNS Software
> http://lartc.org Linux Advanced
> Routing & Traffic Control HOWTO
>
--alessandro
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-06 12:30 analysis of TCP window size issues still around - several reports / SACK involved? ALESSANDRO.SUARDI
@ 2004-07-06 12:42 ` bert hubert
0 siblings, 0 replies; 7+ messages in thread
From: bert hubert @ 2004-07-06 12:42 UTC (permalink / raw)
To: ALESSANDRO.SUARDI
Cc: Arnaldo Carvalho de Melo, David S. Miller,
linux-kernel @ vger. kernel. org Stephen Hemminger, netdev,
phyprabab
On Tue, Jul 06, 2004 at 06:30:51AM -0600, ALESSANDRO.SUARDI@ORACLE.COM wrote:
> Further data points: -bk18 built with GCC 3.4.1 still hangs talking to
> most of the world except google.it (redirected from google.com) from my
> home DSL (US Robotics USR9003 router).
I juist built 2.6.7-mm6 in an attempt to reproduce your problems but I can't
see them, can you check with that version too?
> It _also_ works from my home _if_ I fire up the Cisco VPN client (4.0.4.B)
> to an Oracle VPN server and access the 'net (kernel.org and all) via our
> proxies through the VPN.
Is this the 5mb binary kernel module blob version? Can you reproduce the bug
on a system that has never touched this module?
Furthermore, can you email me your IP address and try to connect to
http://ds9a.nl/hi-ahu afterwards I confirm receipt? I want to see the server
side of the story.
Thanks.
--
http://www.PowerDNS.com Open source, database driven DNS Software
http://lartc.org Linux Advanced Routing & Traffic Control HOWTO
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-06 9:35 ` analysis of TCP window size issues still around - several reports / SACK involved? bert hubert
@ 2004-07-06 20:19 ` David S. Miller
2004-07-06 20:27 ` bert hubert
0 siblings, 1 reply; 7+ messages in thread
From: David S. Miller @ 2004-07-06 20:19 UTC (permalink / raw)
To: bert hubert; +Cc: acme, shemminger, netdev, alessandro.suardi, phyprabab
On Tue, 6 Jul 2004 11:35:03 +0200
bert hubert <ahu@ds9a.nl> wrote:
> 22:42:40.890025 192.168.1.6.32843 > 204.152.189.116.http: S 1994994484:1994994484(0) win 5840 <mss 1460,sackOK,timestamp 4294940315 0,nop,wscale 7> (DF)
> 22:42:41.143063 204.152.189.116.http > 192.168.1.6.32843: S 1404108869:1404108869(0) ack 1994994485 win 5792 <mss 1452,sackOK,timestamp 3383469176 4294940315,nop,wscale 0> (DF)
> 22:42:41.143123 192.168.1.6.32843 > 204.152.189.116.http: . ack 1 win 45 <nop,nop,timestamp 4294940568 3383469176> (DF)
>
> Alessandro's machine does perform window scaling, tcpdump however does not
> understand that and neglects to multiply 45 by 2^7 (=5760). Kernel.org does do
> wscale, but defaults to 2^0.
tcpdump's behavior is correct, it's just reporting the raw window
field in the TCP header, unscaled, and that is fine. In fact I'd
rather it do this, so that diagnosing dumps are easier. If tcpdump
tries to be too clever, scaling the window, then I might end up
chasing down a tcpdump bug rather than a TCP one :-)
What would be more interesting is to get the tcpdump trace from the
other side of this connection. This is crucial, as it will show how
and in what way exactly the window scale options and/or window fields
are being edited by a firewall or other device and thus causing
the problems.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-06 20:19 ` David S. Miller
@ 2004-07-06 20:27 ` bert hubert
2004-07-06 20:31 ` David S. Miller
2004-07-07 21:25 ` Alessandro Suardi
0 siblings, 2 replies; 7+ messages in thread
From: bert hubert @ 2004-07-06 20:27 UTC (permalink / raw)
To: David S. Miller; +Cc: acme, shemminger, netdev, alessandro.suardi, phyprabab
On Tue, Jul 06, 2004 at 01:19:55PM -0700, David S. Miller wrote:
> rather it do this, so that diagnosing dumps are easier. If tcpdump
> tries to be too clever, scaling the window, then I might end up
> chasing down a tcpdump bug rather than a TCP one :-)
True - it might want to print '43 (*128=5706)' or something like that.
> What would be more interesting is to get the tcpdump trace from the
> other side of this connection. This is crucial, as it will show how
> and in what way exactly the window scale options and/or window fields
> are being edited by a firewall or other device and thus causing
> the problems.
I have an appointment with Alessandro tomorrow evening at 11PM CEST to do
just that.
It sounds like window scaling may become yet another ECN...
--
http://www.PowerDNS.com Open source, database driven DNS Software
http://lartc.org Linux Advanced Routing & Traffic Control HOWTO
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-06 20:27 ` bert hubert
@ 2004-07-06 20:31 ` David S. Miller
2004-07-07 21:25 ` Alessandro Suardi
1 sibling, 0 replies; 7+ messages in thread
From: David S. Miller @ 2004-07-06 20:31 UTC (permalink / raw)
To: bert hubert; +Cc: acme, shemminger, netdev, alessandro.suardi, phyprabab
On Tue, 6 Jul 2004 22:27:08 +0200
bert hubert <ahu@ds9a.nl> wrote:
> On Tue, Jul 06, 2004 at 01:19:55PM -0700, David S. Miller wrote:
>
> > What would be more interesting is to get the tcpdump trace from the
> > other side of this connection. This is crucial, as it will show how
> > and in what way exactly the window scale options and/or window fields
> > are being edited by a firewall or other device and thus causing
> > the problems.
>
> I have an appointment with Alessandro tomorrow evening at 11PM CEST to do
> just that.
That's great, thanks a lot.
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: analysis of TCP window size issues still around - several reports / SACK involved?
2004-07-06 20:27 ` bert hubert
2004-07-06 20:31 ` David S. Miller
@ 2004-07-07 21:25 ` Alessandro Suardi
1 sibling, 0 replies; 7+ messages in thread
From: Alessandro Suardi @ 2004-07-07 21:25 UTC (permalink / raw)
To: bert hubert; +Cc: David S. Miller, acme, shemminger, netdev, phyprabab
bert hubert wrote:
> On Tue, Jul 06, 2004 at 01:19:55PM -0700, David S. Miller wrote:
>
>
>>rather it do this, so that diagnosing dumps are easier. If tcpdump
>>tries to be too clever, scaling the window, then I might end up
>>chasing down a tcpdump bug rather than a TCP one :-)
>
>
> True - it might want to print '43 (*128=5706)' or something like that.
>
>
>>What would be more interesting is to get the tcpdump trace from the
>>other side of this connection. This is crucial, as it will show how
>>and in what way exactly the window scale options and/or window fields
>>are being edited by a firewall or other device and thus causing
>>the problems.
>
>
> I have an appointment with Alessandro tomorrow evening at 11PM CEST to do
> just that.
Sorry about being slightly late - I read the thread from my VPN
link (which does work), now I'm turning it off and will email
Bert with my actual IP address and my connection.
Thanks,
--alessandro
"Practice is more important than theory. A _lot_ more important."
(Linus Torvalds on lkml, 1 June 2004)
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2004-07-07 21:25 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-07-06 12:30 analysis of TCP window size issues still around - several reports / SACK involved? ALESSANDRO.SUARDI
2004-07-06 12:42 ` bert hubert
[not found] <32886.63.170.215.71.1088564087.squirrel@www.osdl.org>
[not found] ` <20040629222751.392f0a82.davem@redhat.com>
[not found] ` <20040630152750.2d01ca51@dell_ss3.pdx.osdl.net>
[not found] ` <20040630153049.3ca25b76.davem@redhat.com>
2004-07-01 20:37 ` [PATCH] TCP acts like it is always out of memory Stephen Hemminger
2004-07-01 21:04 ` David S. Miller
2004-07-02 1:32 ` Arnaldo Carvalho de Melo
2004-07-06 9:35 ` analysis of TCP window size issues still around - several reports / SACK involved? bert hubert
2004-07-06 20:19 ` David S. Miller
2004-07-06 20:27 ` bert hubert
2004-07-06 20:31 ` David S. Miller
2004-07-07 21:25 ` Alessandro Suardi
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).