All of lore.kernel.org
 help / color / mirror / Atom feed
From: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
To: John McMonagle <johnm@advocap.org>
Cc: tinnycloud@hotmail.com, xen-devel@lists.xensource.com
Subject: Re: Network dies and  kernel errors
Date: Fri, 29 Jul 2011 13:31:29 -0400	[thread overview]
Message-ID: <20110729173129.GA7637@dumpdata.com> (raw)
In-Reply-To: <201107291038.21289.johnm@advocap.org>

> > Did you try that? Did that make any difference?
> 
> Not tested I did install one.
> 
> I think I found a way to keep it running.
> On the new igb driver I built from new intel source added module parameter 
> IntMode=1.
> 
> This puts it in msi mode. It was in msi-x mode.
> It's never died with that setting.
> It's up now over a day.
> No real experience with msi-x. I think it's the first time I have seen a 
> driver use msi-x interrupts.
> Maybe that gives you more ideas?

That was my thought - the MSI-X aren't somehow being ACKed properly. But I don't
know if the issue with Dom0 or Xen.
> 
> 
> > 
> > > Any ideas?
> > 
> > There is a Xen parameter called 'noirqbalance' . Try that. Also see if you
> > can limit the CPUs in the dom0 using these two arguments on Xen
> > hypervisor:
> > 
> Should I turn off the irqbalence daemon also?

Sure.

> Just in case you wonder it does with out it.
> 
> > dom0_vcpus=2 dom0_vcpus_pin=1
> > 
> > 
> > It would be interesting to narrow down _when_ you trigger this failure. B/c
> > we can pull Xen to see what the MSI's are 'xl debug-keys M' _before_ and
> > _after_ your failure to see if something is amiss.
> > 
> > Mainly to figure out if the vectors are moving around the CPUs (or not)
> > 
> > (XEN)  MSI    29 vec=21 lowest  edge   assert  log lowest dest=00000001
> > mask=0/0/-1
> > 
> > and also 'xl debug-keys i' to see if the domain has ACK-ed the interrupt:
> > (XEN)    IRQ:  29 affinity:00000000,00000000,00000000,00000001 vec:21
> > type=PCI-MSI         status=00000010 in-flight=0 domain-list=0:275(----),
> > 
> > (the last '----' might have something else in in them - if so that is a
> > sign that dom0 hasn't picked up the event/vector).
> 
> Much of my frustration is that I have not found a way to get it to fail other 
> than waiting a long time :-(

Ah that sucks. Well, just make a nice shell script that will run those continously
(and also 'xl dmesg') and pipe the log to a file.

  reply	other threads:[~2011-07-29 17:31 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2011-07-25 19:18 Network dies and kernel errors John McMonagle
2011-07-29 15:03 ` Konrad Rzeszutek Wilk
2011-07-29 15:38   ` John McMonagle
2011-07-29 17:31     ` Konrad Rzeszutek Wilk [this message]
2011-07-29 21:12       ` John McMonagle
2011-07-31 23:21       ` John McMonagle
2011-08-02 16:17         ` Konrad Rzeszutek Wilk
2011-08-02 18:07           ` John McMonagle
2011-08-03  9:01           ` MaoXiaoyun
2011-08-03 13:37             ` Konrad Rzeszutek Wilk
2011-08-04  3:55               ` MaoXiaoyun
2011-08-04 18:03                 ` Konrad Rzeszutek Wilk

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20110729173129.GA7637@dumpdata.com \
    --to=konrad.wilk@oracle.com \
    --cc=johnm@advocap.org \
    --cc=tinnycloud@hotmail.com \
    --cc=xen-devel@lists.xensource.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.