netdev.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Richard Kennedy <richard@rsk.demon.co.uk>
To: Francois Romieu <romieu@fr.zoreil.com>
Cc: netdev@vger.kernel.org
Subject: Re: v3.0-rc* intermittent network failure: how to debug?
Date: Thu, 21 Jul 2011 16:18:44 +0100	[thread overview]
Message-ID: <1311261527.2980.26.camel@castor.rsk> (raw)
In-Reply-To: <20110721143218.GA10595@electric-eye.fr.zoreil.com>

On Thu, 2011-07-21 at 16:32 +0200, Francois Romieu wrote:
> Richard Kennedy <richard@rsk.demon.co.uk> :
> > I keep seeing a total network failure on v3.0.0-rc* , it is highly
> > intermittent, anything from 1 hour to 12+, and I don't have a reliable
> > test case.
> > When it fails I lose all network comms, but there are no errors in the
> > system log, no hung tasks reported, nothing. But after it fails the
> > machine hangs during shutdown, it just never turns off. So I guess
> > something is getting stuck but I can't find it.
> 
> Assuming the kernel hangs late enough, you can try the "reboot=" kernel
> parameter and see if a value in arch/x86/include/asm/emergency-restart.h
> makes a difference.
> 
> > Can you suggest how to find out what going on? 
> 
> Switch into text mode before starting the reboot sequence then send a
> magic sysrq T or W ?
> 
> > I'm going to add a serial console and see if that helps.
> 
> It will help, especially with the kilometer long output of sysrq.
> 
> > this is on a x86_64, via_velocity currently running 3.0.0-rc7 latest.
> > 
> > all suggestions gratefully received
> 
> Last via-velocity change in mainline dates back to may 25 (see
> d10358de8d70aaeb965a974d56e9b72f6c6dbb3a). Were you previously fine
> with a recent enough kernel to rule it out ?
> 

Thanks Francois,
I'll try the reboot= tomorrow.

I don't really know when my last know good was, it could be that
via-velocity change, but the problem is so intermittent it's difficult
to be sure. I've been trying to stress the network to make the problem
happen sooner but I've had no luck yet.

regards
Richard  


  reply	other threads:[~2011-07-21 15:44 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-07-21 13:49 v3.0-rc* intermittent network failure: how to debug? Richard Kennedy
2011-07-21 14:32 ` Francois Romieu
2011-07-21 15:18   ` Richard Kennedy [this message]
2011-07-25 12:01     ` v3.0-rc* intermittent network failure: Test case found! Richard Kennedy

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=1311261527.2980.26.camel@castor.rsk \
    --to=richard@rsk.demon.co.uk \
    --cc=netdev@vger.kernel.org \
    --cc=romieu@fr.zoreil.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).