From: "Christopher S. Aker" <caker@theshore.net>
To: xen devel <xen-devel@lists.xensource.com>
Cc: Konrad Rzeszutek Wilk <konrad.wilk@oracle.com>
Subject: Re: Xen 4.1 + 3ware 9690SA = rejecting I/O to offline device
Date: Tue, 27 Sep 2011 14:13:21 -0400 [thread overview]
Message-ID: <4E821241.6090602@theshore.net> (raw)
In-Reply-To: <4CB38558.5060207@theshore.net>
On 10/11/10 5:44 PM, Christopher S. Aker wrote:
> In an effort to fix the problem described in my previous xen-devel post
> ("New CPUS, now get: NETDEV WATCHDOG: eth0: transmit timed out"), we've
> come across another problem. 3ware 9690SA cards to not behave under Xen
> 4.1 (as of cs 22155).
>
> We have a simple Xen thrash test suite which fires up domUs that do
> different workloads (some swap thrash, some kernel build, some spin
> CPUs, some cycle rebooting, etc). Almost immediately after launching the
> suite we can get the 3ware 9690SA card to fail with something like the
> following:
>
> sd 0:0:0:0: WARNING: (0x06:0x002C): Command (0x28) timed out, resetting
> card.
> sd 0:0:0:0: WARNING: (0x06:0x002C): Command (0x0) timed out, resetting
> card.
> sd 0:0:0:0: rejecting I/O to offline device
> sd 0:0:0:0: rejecting I/O to offline device
>
> Under a 2.6.32 dom0 it sometimes also triggers Xenwatch like so:
>
> http://theshore.net/~caker/xen/BUGS/9690SA/xenwatch.txt
>
> Results matrix:
>
> +---------------------------------------------------------------+
> | Xen | Dom0 | 9550SXU | 9690SA | 9750 |
> +---------------------------------------------------------------+
> | 3.4.1 | 2.6.18.8-931-2 | OK | OK | OK |
> | 3.4.4-rc1-pre | 2.6.18.8-931-2 | OK | OK | OK |
> | 3.4.4-rc1-pre | 2.6.32.23-g41a85de5 | OK | OK | OK |
> | 4.1 @ 22155 | 2.6.18.8-931-2 | OK | FAIL | OK |
> | 4.1 @ 22155 | 2.6.32.23-g41a85de5 | OK | FAIL | OK |
> +---------------------------------------------------------------+
>
> The failures were verified on at least 2 machines of identical
> specification.
>
> The same dom0 kernels that produce a stable 9690SA under Xen 3.4, bomb
> under Xen 4.1.
I'm back at this, and the problem still exists with a 4.1.1/3.0.4 stack.
Konrad, in the "offline raid" thread you asked for the following debug
information:
http://www.theshore.net/~caker/xen/BUGS/offline-raid/
The sysrq-t.txt and triple-a-star.txt outputs are after I got the raid
card to hang up (but before it timed out and started spewing to the
console).
Oddly, lspci shows three devices assigned IRQ 16, however
/proc/interrupts only lists two of them. Side effect of MSI?
Also, the problem still happens even with MSI disabled (pci=nomsi).
Thanks,
-Chris
next prev parent reply other threads:[~2011-09-27 18:13 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2010-10-11 21:44 Xen 4.1 + 3ware 9690SA = rejecting I/O to offline device Christopher S. Aker
2010-11-21 16:55 ` gianfi
2010-11-22 16:37 ` Konrad Rzeszutek Wilk
2011-09-27 18:13 ` Christopher S. Aker [this message]
2011-09-27 18:22 ` Andrew Cooper
2011-09-27 19:33 ` Christopher S. Aker
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4E821241.6090602@theshore.net \
--to=caker@theshore.net \
--cc=konrad.wilk@oracle.com \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.