From: Andreas Kinzler <ml-xen-devel@hfp.de>
To: James Harper <james.harper@bendigoit.com.au>
Cc: xen-devel@lists.xensource.com
Subject: Re: State of GPLPV tests - 28.11.11
Date: Tue, 29 Nov 2011 18:05:04 +0100 [thread overview]
Message-ID: <4ED510C0.8000202@hfp.de> (raw)
In-Reply-To: <AEC6C66638C05B468B556EA548C1A77D01EDFF48@trantor>
On 29.11.2011 00:16, James Harper wrote:
>> I am still running tests 7 days a week on two test systems. Results are quite
>> discouraging though. After experiencing crash after crash I wanted to test if
>> the configuration I called "stable" (Xen 4.0.1, GPLPV 0.11.0.213, dom0 kernel
>> 2.6.32.18-pvops0-ak3) was stable indeed. But even that config crashed when
>> running my torture test. It is stable on our production systems - running
>> other workloads of course.
> What crash are you getting these days? Is it the same one as you used to
> get?
Yes, still exactly the same crashes.
Good good news: I think I have found the bug. Since I am not really a
Xen or Windows kernel developer it cannot say for sure but here is what
I found:
When domU hang I ran xentop and found out that the number of vbd read
requests was an number like 0x7FFFzzzz in hex which lead me to a thesis:
GPLPV crashes as soon as the number of disk requests reaches 2^32. On my
hardware with 5000 IIOPs/sec this is reached in
2^32 / 5000 IIOPs / 3600 sec-per-hour / 24 hours-per-day = 9.94 days
And there we go: there are the 9-10 days I was always seeing.
I studied the source code of blkback/blktap/aio and found nothing. But
in GPLPV and its use of the ring macros I found suspicious code in every
version of GPLPV I ever used
while (more_to_do)
{
rp = xvdd->ring.sring->rsp_prod;
KeMemoryBarrier();
for (i = xvdd->ring.rsp_cons; i < rp; i++)
{
rep = XenVbd_GetResponse(xvdd, i);
If now rp is 10 for example and xvdd->ring.rsp_cons is 0xFFFFFFF7 then
the for loop is skipped, responses are not delivered and we see the hang.
Regards Andreas
next prev parent reply other threads:[~2011-11-29 17:05 UTC|newest]
Thread overview: 5+ messages / expand[flat|nested] mbox.gz Atom feed top
2011-11-28 13:49 State of GPLPV tests - 28.11.11 Andreas Kinzler
2011-11-28 23:16 ` James Harper
2011-11-29 17:05 ` Andreas Kinzler [this message]
2011-11-29 22:39 ` James Harper
[not found] ` <CACaajQtWvkLt3d+H+CeQwK-WXxGo9MCUCBipLbvqnXka0yp3Vw@mail.gmail.com>
[not found] ` <6035A0D088A63A46850C3988ED045A4B0550BB45@BITCOM1.int.sbss.com.au>
[not found] ` <CACaajQvRhQD_dtAyBfRQ=SeRmuDnJc+vW1_VAr9SgK+dyb5sig@mail.gmail.com>
2012-02-10 8:52 ` Vasiliy Tolstov
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=4ED510C0.8000202@hfp.de \
--to=ml-xen-devel@hfp.de \
--cc=james.harper@bendigoit.com.au \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.