From: Hans Nieser <hnsr@xs4all.nl>
To: Francois Romieu <romieu@fr.zoreil.com>
Cc: netdev@vger.kernel.org, linux-kernel@vger.kernel.org
Subject: Re: Mass udp flow reboot linux with RealTek RTL-8169 Gigabit
Date: Wed, 23 Feb 2011 19:31:33 +0100 [thread overview]
Message-ID: <1298485893.31256.40.camel@krikkit> (raw)
In-Reply-To: <1298463704.31256.29.camel@krikkit>
On Wed, 2011-02-23 at 13:21 +0100, Hans Nieser wrote:
> On Wed, 2011-02-23 at 10:55 +0100, Francois Romieu wrote:
> > Hans Nieser <hnsr@xs4all.nl> :
> > [...]
> > > With your patches applied to 2.6.38-rc6, I have gathered some of the
> > > info you requested from Seblu as well, I hope it's helpful:
> > >
> > > 1: see attachment
> >
> > Ok.
> >
> > The chipset requires no trivial last minute regression fix (yet).
> >
> > > 2: I'm not sure how to check the size of the packets, but I'm just
> > > fetching a (large) file over http/tcp, so I guess they are mostly of the
> > > size of my MTU which is 1500 looking at ifconfig output
> >
> > Fine.
> >
> > Your testcases are always based on a real download, whence including some
> > disk activity, as opposed to a pure network test, right ?
>
> Yeah, I just had a little script that wgetted a file from a webserver in
> my LAN and saved it to separate (non-root) fs, then removed it - in a
> loop. When testing on the 2.6.35 and 2.6.35.9 kernels it did max out at
> about 107MiB/s, sometimes falling down a little presumably when disk was
> being touched.
>
> > > For the other vmstat/ethtool/interrupts output, I started the following
> > > commands remotely via ssh a second or two before starting the download,
> > > and the machine locked up a few seconds later:
> >
> > SysRq is enabled (/etc/sysctl.conf::kernel.sysrq = 1), the computer was
> > switched back on a no-X console before the test. Then the keyboard leds
> > ignore keypresses and the sysrq keys don't display anything in the
> > console, right ?
>
> Yep I had X shutdown and switched to VT1, after lock up the LEDs can't
> be toggled anymore and sysrq key combo was nonresponsive (it works if I
> do it before it locks up)
>
> > You may enable PCIEASPM_DEBUG, force 'pcie_aspm=off' and switch from
> > SLUB to SLAB but it's a bit cargo-cultish.
>
> I'll give that a try this evening
>
> > A bisection could help. Bisecting 2.6.35 .. 2.6.35.9 may be enough if
> > 2.6.35.9 works well.
>
> Hmm did you mean bisecting 2.6.36 - 2.6.35.9 ? Since with 2.6.36 and
> above I can get the machine to hang within seconds and performance is
> really bad (10-20MiB/s with wget), while with 2.6.35.9 and 2.6.35
> performance was really good (reaching 107MiB/s most of the time) and
> lock up took 5-10 minutes instead of seconds (I guess I didn't mention
> this in my last e-mail but I managed to get both 2.6.35 and 2.6.35.9 to
> lock up eventually) - but I guess something changed between .35 and .36
> that made the issue easier to trigger.
>
> I can also try even older kernels to see if there is one that doesn't
> lock up at all
>
Ok, I just tried 2.6.34, and after over 5 hours of running my script,
the system is still up and running, with only 24 'link up' messages on
dmesg, and having transferred 2.1TiB of data (1428042421 rx_packets, 45
rx_missed). So I'm going to assume the problem isn't present with this
kernel and try a bisect between it and 2.6.35
next prev parent reply other threads:[~2011-02-23 18:31 UTC|newest]
Thread overview: 17+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-02-22 17:33 Mass udp flow reboot linux with RealTek RTL-8169 Gigabit Hans Nieser
2011-02-23 9:55 ` Francois Romieu
2011-02-23 12:21 ` Hans Nieser
2011-02-23 18:31 ` Hans Nieser [this message]
2011-02-25 13:45 ` Hans Nieser
2011-03-03 19:53 ` Hans Nieser
-- strict thread matches above, loose matches on Subject: below --
2011-02-21 11:56 Hans Nieser
2011-02-13 1:35 Seblu
2011-02-13 7:17 ` Eric Dumazet
2011-02-13 13:56 ` Francois Romieu
2011-02-13 17:27 ` Seblu
2011-02-13 18:02 ` Seblu
2011-02-13 20:34 ` Francois Romieu
2011-02-18 2:54 ` Seblu
2011-02-18 9:30 ` Francois Romieu
2011-03-06 0:29 ` Seblu
2011-03-10 12:08 ` Francois Romieu
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=1298485893.31256.40.camel@krikkit \
--to=hnsr@xs4all.nl \
--cc=linux-kernel@vger.kernel.org \
--cc=netdev@vger.kernel.org \
--cc=romieu@fr.zoreil.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.