* Connection reset by peer - need a patch
@ 2009-02-27 15:07 Pascal GREGIS
2009-02-27 15:28 ` David Miller
2009-02-28 10:03 ` Ilpo Järvinen
0 siblings, 2 replies; 4+ messages in thread
From: Pascal GREGIS @ 2009-02-27 15:07 UTC (permalink / raw)
To: netdev
Hi people,
I have a very annoying bug that seems to be well known today.
It happens on a backup server that issue a "Connection reset by peer" while the other side does not reset or stop the connection.
I have found a report that seems very similar on this mailing list :
http://kerneltrap.org/index.php?q=mailarchive/linux-netdev/2008/4/28/1628834
this sends to the commit 7951f0b03a63d657c72c7d54d306ef3357e7e604
Author: Daniel Lezcano <...
Date: Thu Apr 10 20:53:10 2008 -0700
[NETNS][IPV6] tcp - assign the netns for timewait sockets
and gives a simple patch that adds the line
tw->tw_net = sk->sk_net;
somewhere in the function inet_twsk_alloc (in the file net/ipv4/inet_timewait_sock.c).
Right, but the problem is that I use a kernel 2.6.21.1 and cannot upgrade my whole kernel easily.
And in the kernel 2.6.21.1, network namespaces don't seem to exist, so I cannot apply this simple patch.
I am in a very uncomfortable situation because this bug is causing harmfull problems on all the backup servers of my company and as I said above, upgrading the kernel is not really possible at this time.
Does anyone know what I could do to solve this ?
Thanks a lot
Pascal
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Connection reset by peer - need a patch
2009-02-27 15:07 Connection reset by peer - need a patch Pascal GREGIS
@ 2009-02-27 15:28 ` David Miller
2009-02-27 16:06 ` Pascal GREGIS
2009-02-28 10:03 ` Ilpo Järvinen
1 sibling, 1 reply; 4+ messages in thread
From: David Miller @ 2009-02-27 15:28 UTC (permalink / raw)
To: pgs; +Cc: netdev
From: Pascal GREGIS <pgs@synerway.com>
Date: Fri, 27 Feb 2009 16:07:55 +0100
> Right, but the problem is that I use a kernel 2.6.21.1 and cannot
> upgrade my whole kernel easily. And in the kernel 2.6.21.1, network
> namespaces don't seem to exist, so I cannot apply this simple patch.
You're chasing a different bug then, the bug was introduced
by a network namespace patch so 2.6.21 can't possibly have
that bug.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Connection reset by peer - need a patch
2009-02-27 15:28 ` David Miller
@ 2009-02-27 16:06 ` Pascal GREGIS
0 siblings, 0 replies; 4+ messages in thread
From: Pascal GREGIS @ 2009-02-27 16:06 UTC (permalink / raw)
To: David Miller; +Cc: netdev
Ok, thank you for your answer. Would you have a hint to help me track this bug ?
Pascal
David Miller a écrit, le ven 27 fév 2009 à 07:28:08 :
> From: Pascal GREGIS <pgs@synerway.com>
> Date: Fri, 27 Feb 2009 16:07:55 +0100
>
> > Right, but the problem is that I use a kernel 2.6.21.1 and cannot
> > upgrade my whole kernel easily. And in the kernel 2.6.21.1, network
> > namespaces don't seem to exist, so I cannot apply this simple patch.
>
> You're chasing a different bug then, the bug was introduced
> by a network namespace patch so 2.6.21 can't possibly have
> that bug.
^ permalink raw reply [flat|nested] 4+ messages in thread
* Re: Connection reset by peer - need a patch
2009-02-27 15:07 Connection reset by peer - need a patch Pascal GREGIS
2009-02-27 15:28 ` David Miller
@ 2009-02-28 10:03 ` Ilpo Järvinen
1 sibling, 0 replies; 4+ messages in thread
From: Ilpo Järvinen @ 2009-02-28 10:03 UTC (permalink / raw)
To: Pascal GREGIS; +Cc: Netdev
On Fri, 27 Feb 2009, Pascal GREGIS wrote:
> I have a very annoying bug that seems to be well known today.
> It happens on a backup server that issue a "Connection reset by peer"
> while the other side does not reset or stop the connection.
If I understand you correct you lost synchronization between hosts...
If so I'd suggest you start tracking what's getting dropped/discarded and
where (it might affect only a single direction). Any middlebox is outright
a suspect :-). Tcpdump (on both host, and possibly on intermediate nodes
ifaces if losses in between are found from the end host tcpdumps), mibs
(/proc/net/netstat, for in-host discards) and strace are there to help
you onward. Even if it's not lost syncronization you basically use
the same tools.
> I have found a report that seems very similar on this mailing list :
> http://kerneltrap.org/index.php?q=mailarchive/linux-netdev/2008/4/28/1628834
>
> this sends to the commit 7951f0b03a63d657c72c7d54d306ef3357e7e604
> Author: Daniel Lezcano <...
> Date: Thu Apr 10 20:53:10 2008 -0700
> [NETNS][IPV6] tcp - assign the netns for timewait sockets
>
> and gives a simple patch that adds the line
> tw->tw_net = sk->sk_net;
> somewhere in the function inet_twsk_alloc (in the file net/ipv4/inet_timewait_sock.c).
I don't think you're on a right track with that lead...
> Right, but the problem is that I use a kernel 2.6.21.1 and cannot
> upgrade my whole kernel easily. And in the kernel 2.6.21.1, network
> namespaces don't seem to exist, so I cannot apply this simple patch.
...as that bug was introduced along with network namespaces, so for sure
you won't need that fix for anything that doesn't have them. <update>Ah,
DaveM already told that</update>.
> I am in a very uncomfortable situation because this bug is causing
> harmfull problems on all the backup servers of my company and as I said
> above, upgrading the kernel is not really possible at this time. >
> Does anyone know what I could do to solve this ?
Unfortunately it's a bit same for us as we don't act as a support for
random, ancient kernels (if it's a distro kernel you can probably ask
them but basically you'll need more information that what's available in
this mail to actually solve the problem)... I gave you some directions how
these problems are located in general regardless of kernel version.
On the other hand, please don't hesitate to report/ask from us if you have
a recent enough kernel and encounter some problems.
--
i.
^ permalink raw reply [flat|nested] 4+ messages in thread
end of thread, other threads:[~2009-02-28 10:03 UTC | newest]
Thread overview: 4+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2009-02-27 15:07 Connection reset by peer - need a patch Pascal GREGIS
2009-02-27 15:28 ` David Miller
2009-02-27 16:06 ` Pascal GREGIS
2009-02-28 10:03 ` Ilpo Järvinen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).