xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: Boris Ostrovsky <boris.ostrovsky@oracle.com>
To: Jan Beulich <JBeulich@suse.com>
Cc: Igor Druzhinin <igor.druzhinin@citrix.com>,
	osstest-admin@xenproject.org,
	xen-devel <xen-devel@lists.xenproject.org>,
	Roger Pau Monne <roger.pau@citrix.com>
Subject: Re: [xen-unstable test] 112855: regressions - trouble: blocked/broken/fail/pass
Date: Tue, 29 Aug 2017 08:45:53 -0400	[thread overview]
Message-ID: <07d3bd14-4b6b-ff84-e711-11a9fdf4406e@oracle.com> (raw)
In-Reply-To: <59A53CC60200007800174CCA@prv-mh.provo.novell.com>

On 08/29/2017 04:07 AM, Jan Beulich wrote:
>>>> On 28.08.17 at 17:36, <boris.ostrovsky@oracle.com> wrote:
>> On 08/28/2017 10:52 AM, Jan Beulich wrote:
>>>>>> On 28.08.17 at 16:24, <boris.ostrovsky@oracle.com> wrote:
>>>>>> As for periodically testing process_pending_softirqs() we may still want
>>>>>> to do this in alloc_heap_pages(), even without CONFIG_SCRUB_DEBUG.
>>>>> For my taste, alloc_heap_pages() is the wrong place for such
>>>>> calls.
>>>> But the loop is in alloc_heap_pages() --- where else would you be testing?
>>> It can only reasonably be the callers of alloc_heap_pages() imo.
>>> A single call to it should never trigger the watchdog, 
>> check_one_page() is rather slow so for a large order allocation even
>> with clean heap the 'for' loop may take quite some time. Whether it
>> could trip the watchdog -- I don't know.
> If that was a problem, we'd have to think about shortening the
> loop. I stand by my assertion that nowhere down from
> alloc_heap_pages() should be any invocation of
> process_pending_softirqs() - it is simply too risky, as we don't
> know what state we're in. One thing I could imagine to do is not
> check the entire page, but (randomly?) pick a couple of locations
> to check. But first of all we really need to be clear about whether
> it's really a single alloc_heap_pages() invocation that trips the
> watchdog, or whether something can be done about it in the
> caller(s).

At least one of the crashes was from alloc_chunk()->free_heap_pages(),
i.e. not from inside alloc_heap_pages()' loop. My proposal was not
necessarily based on the specific crashes in this flight (this issue
will be addressed by the patches I sent yesterday) but rather as a
general suggestion. But I understand that calling alloc_heap_pages()
from alloc_heap_pages() may not be a great idea.

I am somewhat puzzled though by the fact that I haven't seen this in my
testing --- I was creating/destroying very large guests (> 1TB) in
parallel so there must have been loops over high orders and I never had
a watchdog go off. And my dom0s were quite large too while the one in
this flight is only 512M.

-boris

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

  reply	other threads:[~2017-08-29 12:46 UTC|newest]

Thread overview: 12+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2017-08-25  3:15 [xen-unstable test] 112855: regressions - trouble: blocked/broken/fail/pass osstest service owner
2017-08-25 13:40 ` Jan Beulich
2017-08-25 17:14   ` Boris Ostrovsky
2017-08-28  7:25     ` Jan Beulich
2017-08-28 13:57       ` Boris Ostrovsky
2017-08-28 14:02         ` Jan Beulich
2017-08-28 14:24           ` Boris Ostrovsky
2017-08-28 14:52             ` Jan Beulich
2017-08-28 15:36               ` Boris Ostrovsky
2017-08-29  8:07                 ` Jan Beulich
2017-08-29 12:45                   ` Boris Ostrovsky [this message]
2017-08-29 13:12                     ` Jan Beulich

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=07d3bd14-4b6b-ff84-e711-11a9fdf4406e@oracle.com \
    --to=boris.ostrovsky@oracle.com \
    --cc=JBeulich@suse.com \
    --cc=igor.druzhinin@citrix.com \
    --cc=osstest-admin@xenproject.org \
    --cc=roger.pau@citrix.com \
    --cc=xen-devel@lists.xenproject.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).