public inbox for linux-kernel@vger.kernel.org
 help / color / mirror / Atom feed
* Re: [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not  2.4.x,
@ 2003-02-04 19:54 jjs
  0 siblings, 0 replies; 23+ messages in thread
From: jjs @ 2003-02-04 19:54 UTC (permalink / raw)
  To: linux kernel

Tony Gale wrote:

>
> The ssh hang on exit "problem" is a policy of the ssh coders. It'll
> happen when you have a background job still running when you exit, which
> is still connected to the terminal.
>
> As I said, it's an ssh policy issue (which many people disagree with)
> and not a bug.
>
So, admin logs in and restarts a process -
a very very common task. oops, can't log out.

Sure sounds like a thinko to me, if not a bug.

Demoronized openssh packages for
suse and redhat are available by
popular request from:

ftp.mainphrame.com/pub/openssh

Joe


^ permalink raw reply	[flat|nested] 23+ messages in thread
[parent not found: <Pine.LNX.3.96.1030203155651.28323A-100000@dstl.gov.uk>]
* Re: [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x,
@ 2003-02-03 14:23 Franz Sirl
  2003-02-03 17:11 ` J Sloan
  2003-02-03 21:04 ` Bill Davidsen
  0 siblings, 2 replies; 23+ messages in thread
From: Franz Sirl @ 2003-02-03 14:23 UTC (permalink / raw)
  To: bill davidsen; +Cc: linux-kernel

On 2003-02-02 15:40:33 Bill Davidsen wrote:
>On Wed, 29 Jan 2003, David C Niemi wrote:
>
> >
> > On Tue, 28 Jan 2003, David S. Miller wrote:
> > >    From: kuznet@ms2.inr.ac.ru
> > >    Date: Wed, 29 Jan 2003 02:56:41 +0300 (MSK)
> > >
> > >    Hey! Interesting thing has just happened, it is the first time when I
> > >    found the bug formulating a senstence while writing e-mail not while
> > >    peering to code. :-)
> > >
> > > Congratulations :-)
> >
> > Just to confirm, this fix works for me as well.
> >
> > ...
> > > Indeed, this bug exists in 2.4 as well of course.
> > >
> > > This bug is 2.4.3 vintage :-)  It got added as part of initial
> > > zerocopy merge in fact.
> >
> > Odd, then, that it I was unable to reproduce the SSH hangs under 2.4.18
> > even once, despite heavily using it for several days under the same
> > circumstances.  Is there any reason 2.4.x would be better able to 
> recover?
> > 2.5.59 with the fix seems to feel a bit less balky than 2.4.18 without the
> > fix, so it seemed to me that 2.4.18 had some way of recovering at the cost
> > of a several second pause in the session.
>
>The problem which I have been seeing with some regularity is not the hang
>you describe (I see that infrequently) but rather a hang after I exit an
>ssh connection. I open several dozen windows at a time to a cluster when I
>do admin, and when I close almost always at least one doesn't drop without
>"~." to help. So far in a hour I haven't seen that.

That's some internal problem in OpenSSH, can be seen on Solaris as well. 
Can be easily reproduced in a ssh session:

nohup sleep 60 &
logout

The ssh session will terminate only after the sleep exited.

Franz.



^ permalink raw reply	[flat|nested] 23+ messages in thread
* Re: [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX
@ 2003-01-28 23:21 David S. Miller
  2003-01-29  0:02 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
  2003-01-29  0:09 ` kuznet
  0 siblings, 2 replies; 23+ messages in thread
From: David S. Miller @ 2003-01-28 23:21 UTC (permalink / raw)
  To: benoit-lists; +Cc: kuznet, dada1, cgf, andersg, lkernel2003, linux-kernel, tobi

   From: Sebastian Benoit <benoit-lists@fb12.de>
   Date: Tue, 28 Jan 2003 23:12:01 +0100

   David S. Miller(davem@redhat.com)@2003.01.28 12:34:13 +0000:
   > Thanks for testing, how about this new patch at the end of this email?
   > Does it make the problem go away?
   
   this does it!
   
Alexey, my current suspect is skb->csum state on retransmit.

BTW, how come tcp_trim_head() can just set skb->ip_summed
blindly to CHECKSUM_HW and not setup skb->csum?  Even if you
can depend upon net/core/dev.c to do the checksum for
you, you still would need to setup skb->csum properly.

^ permalink raw reply	[flat|nested] 23+ messages in thread
* Re: [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX
@ 2003-01-28 20:34 David S. Miller
  2003-01-28 23:56 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
  0 siblings, 1 reply; 23+ messages in thread
From: David S. Miller @ 2003-01-28 20:34 UTC (permalink / raw)
  To: benoit-lists; +Cc: kuznet, dada1, cgf, andersg, lkernel2003, linux-kernel, tobi

   From: Sebastian Benoit <benoit-lists@fb12.de>
   Date: Tue, 28 Jan 2003 20:16:45 +0100

   David S. Miller(davem@redhat.com)@2003.01.28 10:35:34 +0000:
   > Good set of debug checks would be the following:
   
   no output, i did 4 tests, everytime i was able to lock the ssh-connection
   within a few seconds. kernel 2.5.59 + your debug-patch.

Thanks for testing, how about this new patch at the end of this email?
Does it make the problem go away?

Alexey, most solid report is that 2.5.43-bk1 makes bug appear.
This is good because it sort of narrows things down.  What is
contained there in networking is:

1) initial stackable dst logic, should not cause problems
2) addition of UDP sendfile and ip_append_*() logic
3) fix to tcp_check_req() "fix" :-)  it only changes bahevior
   on connect so should not be a problem

I heavily, therefore, suspect #2 which is why I am poking around
in the tcp.c changes to change checksumming and copying semantics.

--- net/ipv4/tcp.c.~1~	Tue Jan 28 12:40:09 2003
+++ net/ipv4/tcp.c	Tue Jan 28 12:41:48 2003
@@ -1089,11 +1089,13 @@
 				if (!skb)
 					goto wait_for_memory;
 
+#if 0
 				/*
 				 * Check whether we can use HW checksum.
 				 */
 				if (sk->route_caps & (NETIF_F_IP_CSUM|NETIF_F_NO_CSUM|NETIF_F_HW_CSUM))
 					skb->ip_summed = CHECKSUM_HW;
+#endif
 
 				skb_entail(sk, tp, skb);
 				copy = mss_now;

^ permalink raw reply	[flat|nested] 23+ messages in thread
* [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX
@ 2003-01-27 22:36 David S. Miller
  2003-01-28  2:57 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
  0 siblings, 1 reply; 23+ messages in thread
From: David S. Miller @ 2003-01-27 22:36 UTC (permalink / raw)
  To: andersg; +Cc: lkernel2003, linux-kernel, kuznet, tobi


Hey guys, can you all see if this patch makes the problem go away in
2.5.x?  It is merely a guess, but it is worth enough to experiment.

Alexey, this piece of code was buggy first time it was coded, and it
may still have some holes. :-)))

--- net/ipv4/tcp_output.c.~1~	Mon Jan 27 14:45:49 2003
+++ net/ipv4/tcp_output.c	Mon Jan 27 14:46:33 2003
@@ -889,7 +889,7 @@
 	if (atomic_read(&sk->wmem_alloc) > min(sk->wmem_queued+(sk->wmem_queued>>2),sk->sndbuf))
 		return -EAGAIN;
 
-	if (before(TCP_SKB_CB(skb)->seq, tp->snd_una)) {
+	if (0 && before(TCP_SKB_CB(skb)->seq, tp->snd_una)) {
 		if (before(TCP_SKB_CB(skb)->end_seq, tp->snd_una))
 			BUG();
 

^ permalink raw reply	[flat|nested] 23+ messages in thread

end of thread, other threads:[~2003-02-04 19:44 UTC | newest]

Thread overview: 23+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2003-02-04 19:54 [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, jjs
     [not found] <Pine.LNX.3.96.1030203155651.28323A-100000@dstl.gov.uk>
2003-02-04  9:58 ` Tony Gale
2003-02-04 14:12   ` David Ford
2003-02-04 14:40   ` Bill Davidsen
  -- strict thread matches above, loose matches on Subject: below --
2003-02-03 14:23 Franz Sirl
2003-02-03 17:11 ` J Sloan
2003-02-03 18:22   ` Jeff Garzik
2003-02-03 21:04 ` Bill Davidsen
2003-01-28 23:21 [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX David S. Miller
2003-01-29  0:02 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
2003-01-29  0:09 ` kuznet
2003-01-29  0:46   ` Sebastian Benoit
2003-01-29  4:12     ` Christopher Faylor
2003-01-29  6:52   ` David S. Miller
2003-01-28 20:34 [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX David S. Miller
2003-01-28 23:56 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
2003-01-29  0:08   ` David S. Miller
2003-01-29  3:14     ` kuznet
2003-01-29  7:32       ` David S. Miller
2003-01-29 14:12     ` David C Niemi
2003-01-29 14:24       ` kuznet
2003-01-29 15:11         ` dada1
2003-02-02 15:40       ` Bill Davidsen
2003-01-27 22:36 [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, through Cisco PIX David S. Miller
2003-01-28  2:57 ` [TEST FIX] Re: SSH Hangs in 2.5.59 and 2.5.55 but not 2.4.x, kuznet
2003-01-28  3:22   ` Christopher Faylor

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox