From: Bill Burns <bburns@redhat.com>
To: Bill Burns <bburns@redhat.com>
Cc: Ian Pratt <Ian.Pratt@eu.citrix.com>,
xen-devel@lists.xensource.com, "Carb,
Brian A" <Brian.Carb@unisys.com>
Subject: Re: Test results on Unisys ES7000 64x 256gb using unstablec/s 16693 on 3.2.0 Release Candidate
Date: Mon, 28 Jan 2008 09:02:46 -0500 [thread overview]
Message-ID: <479DE086.9020206@redhat.com> (raw)
In-Reply-To: <4799DEEF.4000302@redhat.com>
Bill Burns wrote:
> Bill Burns wrote:
>> Bill Burns wrote:
>>> Ian Pratt wrote:
>>>>>>> No, I have not tried on 3.2.0. Will see if I can at some
>>>>>>> point...
>>>>>> Also, do you have any more info to share on what actually goes wrong
>>>>> when
>>>>>> dom0 has 'too much' memory?
>>>>>>
>>>>> The dom0 kernel spits out messages like the following starting around
>>>>> the
>>>>> init of cpu1 time, and periodically thereafter.
>>>>>
>>>>> Timer ISR/0: Time went backwards: delta=-50206266948
>>>> delta_cpu=13733052
>>>>> shadow=8186343367 off=13649733458 processed=72042343367
>>>>> cpu_processed=21822343367
>>>>>
>>>>> Eventually just hanging (or making such slow progress to be
>>>>> effectively hung).
>>>> How many CPUs does the system have? Does the same large memory issue
>>>> occur if you have fewer physical CPUs?
>>>>
>>> The system has 64 but is only built for 32, so the others are
>>> ignored. Don't know if the problem happens with less CPUs at
>>> this point. Hope to get more data soon...
>>>
>> Interestingly, the symptom seems to disappear with a
>> Hypervisor built to support all 64 CPUs. But I need to
>> get more time on the system to say that for sure.
>>
> Disregard the previous. It still happens. Continuing to debug..
>
>
<snip>
Ok, some progress. Background is that 3.1.2 (and 3.1.3 at least
as it was a wek or two ago) fails to boot on a 64 CPU es7000 with
over 112GB of memory. This is with both HV & dom0 being x86_64.
The symptom is that the dom0 kernel gets time went backwards
error during init.
The patch at which this first fails is 15137, which is the patch
that introduces using the ACPI PM timer as the clock
source. If I include the next patch (that allows for clock
selection) and choose pit as clock source the system boots
fine. Without the arg the ACPI timer is used and I get the hang.
Don't know if this is unique to this platform or a
general issue with large memory. Seems that most folks
smartly limit the dom0 memory well below 112GB.
Note I have not yet tried 3.2. Also note that the
patch determination was not a pure binary search.
There is a later patch (15194) specific to es7000 that
I pulled in and the second patch (15045) breaks things
during the HV init without some later patches, so it
was left out.
Bill
next prev parent reply other threads:[~2008-01-28 14:02 UTC|newest]
Thread overview: 29+ messages / expand[flat|nested] mbox.gz Atom feed top
2008-01-10 2:15 Test results on Unisys ES7000 64x 256gb using unstable c/s 16693 on 3.2.0 Release Candidate Carb, Brian A
2008-01-15 13:50 ` Bill Burns
2008-01-15 14:44 ` Keir Fraser
2008-01-15 16:15 ` Bill Burns
2008-01-15 16:29 ` Keir Fraser
2008-01-16 15:45 ` Bill Burns
2008-01-17 14:10 ` Test results on Unisys ES7000 64x 256gb using unstablec/s " Ian Pratt
2008-01-18 13:03 ` Bill Burns
2008-01-24 17:23 ` Bill Burns
2008-01-25 13:06 ` Bill Burns
2008-01-28 14:02 ` Bill Burns [this message]
2008-01-28 14:08 ` Keir Fraser
2008-01-28 20:38 ` Test results on Unisys ES7000 64x 256gb usingunstablec/s " Carb, Brian A
2008-01-28 21:12 ` Bill Burns
2008-01-29 8:44 ` Test results on Unisys ES7000 64x 256gbusingunstablec/s " Jan Beulich
2008-01-30 16:20 ` Test results on Unisys ES7000 64x 256gb using unstablec/s " Bill Burns
2008-01-30 16:45 ` Keir Fraser
2008-01-31 18:12 ` Bill Burns
2008-02-01 8:36 ` Keir Fraser
2008-02-01 12:40 ` Bill Burns
2008-02-01 20:10 ` Bill Burns
2008-02-08 13:49 ` Large system boot problems Bill Burns
2008-02-08 14:04 ` Keir Fraser
2008-02-08 15:10 ` Bill Burns
2008-02-08 15:14 ` Keir Fraser
2008-02-08 15:22 ` Bill Burns
2008-02-08 15:45 ` Keir Fraser
2008-02-12 16:34 ` Bill Burns
2008-02-12 16:54 ` Keir Fraser
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=479DE086.9020206@redhat.com \
--to=bburns@redhat.com \
--cc=Brian.Carb@unisys.com \
--cc=Ian.Pratt@eu.citrix.com \
--cc=xen-devel@lists.xensource.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.