netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Hang when testing with AMD64 with Tg3
@ 2004-10-05  0:11 Stephen Hemminger
  2004-10-05  0:15 ` Andi Kleen
  0 siblings, 1 reply; 9+ messages in thread
From: Stephen Hemminger @ 2004-10-05  0:11 UTC (permalink / raw)
  To: Andi Kleen, davem; +Cc: netdev

Doing simple iperf test on new opteron with tg3 against existing Xeon
system.  I am seeing something wierd, the connection hangs right away.
Is this a TSO bug.  

Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)

Both machines are directly connected with a netgear 100mbit switch.

16:59:30.726551 IP XX.YY.250.3.32948 > XX.YY.1.73.5001: S 4113740578:4113740578(0) win 5840 <mss 1460,sackOK,timestamp 4233878 0,nop,wscale 2>
16:59:30.726574 IP XX.YY.1.73.5001 > XX.YY.250.3.32948: S 4117129705:4117129705(0) ack 4113740579 win 5792 <mss 1460,sackOK,timestamp 25903995 4233878,nop,wscale 2>
16:59:30.726578 IP XX.YY.250.3.32948 > XX.YY.1.73.5001: . ack 1 win 1460 <nop,nop,timestamp 4233879 25903995>
16:59:30.726604 IP XX.YY.250.3.32948 > XX.YY.1.73.5001: P 1:25(24) ack 1 win 1460 <nop,nop,timestamp 4233879 25903995>
16:59:30.726641 IP XX.YY.1.73.5001 > XX.YY.250.3.32948: . ack 25 win 1448 <nop,nop,timestamp 25903995 4233879>

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:11 Hang when testing with AMD64 with Tg3 Stephen Hemminger
@ 2004-10-05  0:15 ` Andi Kleen
  2004-10-05  0:29   ` David S. Miller
                     ` (2 more replies)
  0 siblings, 3 replies; 9+ messages in thread
From: Andi Kleen @ 2004-10-05  0:15 UTC (permalink / raw)
  To: Stephen Hemminger; +Cc: Andi Kleen, davem, netdev

On Mon, Oct 04, 2004 at 05:11:11PM -0700, Stephen Hemminger wrote:
> Doing simple iperf test on new opteron with tg3 against existing Xeon
> system.  I am seeing something wierd, the connection hangs right away.
> Is this a TSO bug.  
> 
> Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
> Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)
> 
> Both machines are directly connected with a netgear 100mbit switch.


I don't see things like this on my boxes with rc3, also with tg3s.

Of what nature is the hang? Does the machine crash?

-Andi

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:15 ` Andi Kleen
@ 2004-10-05  0:29   ` David S. Miller
  2004-10-05  0:41     ` Andi Kleen
  2004-10-05  0:47     ` Stephen Hemminger
  2004-10-05  1:00   ` Stephen Hemminger
  2004-10-05  1:05   ` Stephen Hemminger
  2 siblings, 2 replies; 9+ messages in thread
From: David S. Miller @ 2004-10-05  0:29 UTC (permalink / raw)
  To: Andi Kleen; +Cc: shemminger, ak, davem, netdev

On Tue, 5 Oct 2004 02:15:27 +0200
Andi Kleen <ak@suse.de> wrote:

> I don't see things like this on my boxes with rc3, also with tg3s.
> 
> Of what nature is the hang? Does the machine crash?

And what tg3 revisions do you have?

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:29   ` David S. Miller
@ 2004-10-05  0:41     ` Andi Kleen
  2004-10-05  0:47     ` Stephen Hemminger
  1 sibling, 0 replies; 9+ messages in thread
From: Andi Kleen @ 2004-10-05  0:41 UTC (permalink / raw)
  To: David S. Miller; +Cc: Andi Kleen, shemminger, davem, netdev

On Mon, Oct 04, 2004 at 05:29:48PM -0700, David S. Miller wrote:
> On Tue, 5 Oct 2004 02:15:27 +0200
> Andi Kleen <ak@suse.de> wrote:
> 
> > I don't see things like this on my boxes with rc3, also with tg3s.
> > 
> > Of what nature is the hang? Does the machine crash?
> 
> And what tg3 revisions do you have?

Mostly 5701s.

-Andi

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:29   ` David S. Miller
  2004-10-05  0:41     ` Andi Kleen
@ 2004-10-05  0:47     ` Stephen Hemminger
  1 sibling, 0 replies; 9+ messages in thread
From: Stephen Hemminger @ 2004-10-05  0:47 UTC (permalink / raw)
  To: David S. Miller; +Cc: Andi Kleen, davem, netdev

On Mon, 2004-10-04 at 17:29 -0700, David S. Miller wrote:
> On Tue, 5 Oct 2004 02:15:27 +0200
> Andi Kleen <ak@suse.de> wrote:
> 
> > I don't see things like this on my boxes with rc3, also with tg3s.
> > 
> > Of what nature is the hang? Does the machine crash?

> And what tg3 revisions do you have?

# ethtool -i eth0
driver: tg3
version: 3.10
firmware-version:
bus-info: 0000:03:02.0

# ethtool -k eth0
Offload parameters for eth0:
rx-checksumming: on
tx-checksumming: on
scatter-gather: on
tcp segmentation offload: off


# lspci -v
0000:03:02.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5703 Gigabit Ethernet (rev 02)
	Subsystem: IBM: Unknown device 026f
	Flags: bus master, 66Mhz, medium devsel, latency 64, IRQ 28
	Memory at c0200000 (64-bit, non-prefetchable)
	Capabilities: [40] PCI-X non-bridge device.
	Capabilities: [48] Power Management version 2
	Capabilities: [50] Vital Product Data
	Capabilities: [58] Message Signalled Interrupts: 64bit+ Queue=0/3 Enable-

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:15 ` Andi Kleen
  2004-10-05  0:29   ` David S. Miller
@ 2004-10-05  1:00   ` Stephen Hemminger
  2004-10-05  1:05   ` Stephen Hemminger
  2 siblings, 0 replies; 9+ messages in thread
From: Stephen Hemminger @ 2004-10-05  1:00 UTC (permalink / raw)
  To: Andi Kleen; +Cc: davem, netdev

On Tue, 2004-10-05 at 02:15 +0200, Andi Kleen wrote:
> On Mon, Oct 04, 2004 at 05:11:11PM -0700, Stephen Hemminger wrote:
> > Doing simple iperf test on new opteron with tg3 against existing Xeon
> > system.  I am seeing something wierd, the connection hangs right away.
> > Is this a TSO bug.  
> > 
> > Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
> > Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)
> > 
> > Both machines are directly connected with a netgear 100mbit switch.
> 
> 
> I don't see things like this on my boxes with rc3, also with tg3s.
> 
> Of what nature is the hang? Does the machine crash?

The sender just doesn't send any more data??? I can reproduce it every
time and it even happens with stock Suse 9.1 kernel (2.6.4-XX)

Will try a different ether card tommorrow to see if it is there.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  0:15 ` Andi Kleen
  2004-10-05  0:29   ` David S. Miller
  2004-10-05  1:00   ` Stephen Hemminger
@ 2004-10-05  1:05   ` Stephen Hemminger
  2004-10-05 14:58     ` Nivedita Singhvi
  2 siblings, 1 reply; 9+ messages in thread
From: Stephen Hemminger @ 2004-10-05  1:05 UTC (permalink / raw)
  To: Andi Kleen; +Cc: davem, netdev

On Tue, 2004-10-05 at 02:15 +0200, Andi Kleen wrote:
> On Mon, Oct 04, 2004 at 05:11:11PM -0700, Stephen Hemminger wrote:
> > Doing simple iperf test on new opteron with tg3 against existing Xeon
> > system.  I am seeing something wierd, the connection hangs right away.
> > Is this a TSO bug.  
> > 
> > Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
> > Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)
> > 
> > Both machines are directly connected with a netgear 100mbit switch.
> 

IT ISN'T A NETWORK PROBLEM.  The problem is that iperf uses posix
pthread mutex's to sychronize and it looks like a futex bug.

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05  1:05   ` Stephen Hemminger
@ 2004-10-05 14:58     ` Nivedita Singhvi
  2004-10-05 17:06       ` Stephen Hemminger
  0 siblings, 1 reply; 9+ messages in thread
From: Nivedita Singhvi @ 2004-10-05 14:58 UTC (permalink / raw)
  To: Stephen Hemminger; +Cc: Andi Kleen, davem, netdev

Stephen Hemminger wrote:
> On Tue, 2004-10-05 at 02:15 +0200, Andi Kleen wrote:
> 
>>On Mon, Oct 04, 2004 at 05:11:11PM -0700, Stephen Hemminger wrote:
>>
>>>Doing simple iperf test on new opteron with tg3 against existing Xeon
>>>system.  I am seeing something wierd, the connection hangs right away.
>>>Is this a TSO bug.  
>>>
>>>Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
>>>Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)
>>>
>>>Both machines are directly connected with a netgear 100mbit switch.
>>
> 
> IT ISN'T A NETWORK PROBLEM.  The problem is that iperf uses posix
> pthread mutex's to sychronize and it looks like a futex bug.

might not be a futex bug - check to see if iperf does
any writes, fflushes, etc, in a signal handler - i.e.
that its signal handling is thread-safe. glibc now
makes it a fatal bug..

thanks,
Nivedita

^ permalink raw reply	[flat|nested] 9+ messages in thread

* Re: Hang when testing with AMD64 with Tg3
  2004-10-05 14:58     ` Nivedita Singhvi
@ 2004-10-05 17:06       ` Stephen Hemminger
  0 siblings, 0 replies; 9+ messages in thread
From: Stephen Hemminger @ 2004-10-05 17:06 UTC (permalink / raw)
  To: Nivedita Singhvi; +Cc: Andi Kleen, davem, netdev

On Tue, 2004-10-05 at 07:58 -0700, Nivedita Singhvi wrote:
> Stephen Hemminger wrote:
> > On Tue, 2004-10-05 at 02:15 +0200, Andi Kleen wrote:
> > 
> >>On Mon, Oct 04, 2004 at 05:11:11PM -0700, Stephen Hemminger wrote:
> >>
> >>>Doing simple iperf test on new opteron with tg3 against existing Xeon
> >>>system.  I am seeing something wierd, the connection hangs right away.
> >>>Is this a TSO bug.  
> >>>
> >>>Sender: BK latest (2.6.9-rc3) Tg3 (XX.YY.250.3)
> >>>Receiver: BK + netdev(jeffm) + dave's latest e100 (XX.YY.1.73)
> >>>
> >>>Both machines are directly connected with a netgear 100mbit switch.
> >>
> > 
> > IT ISN'T A NETWORK PROBLEM.  The problem is that iperf uses posix
> > pthread mutex's to sychronize and it looks like a futex bug.
> 
> might not be a futex bug - check to see if iperf does
> any writes, fflushes, etc, in a signal handler - i.e.
> that its signal handling is thread-safe. glibc now
> makes it a fatal bug..

It doesn't do I/O from signal handler, but the problem goes away when
not compiling with -O2.

^ permalink raw reply	[flat|nested] 9+ messages in thread

end of thread, other threads:[~2004-10-05 17:06 UTC | newest]

Thread overview: 9+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2004-10-05  0:11 Hang when testing with AMD64 with Tg3 Stephen Hemminger
2004-10-05  0:15 ` Andi Kleen
2004-10-05  0:29   ` David S. Miller
2004-10-05  0:41     ` Andi Kleen
2004-10-05  0:47     ` Stephen Hemminger
2004-10-05  1:00   ` Stephen Hemminger
2004-10-05  1:05   ` Stephen Hemminger
2004-10-05 14:58     ` Nivedita Singhvi
2004-10-05 17:06       ` Stephen Hemminger

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).