* RE: Re: [Xen-staging] [xen-3.1-testing] xend: Fix use ofPIFIsPhysical (takes no arguments).
[not found] <B05D2E415E8CC94897BB44233D14EE6805A01ED6@USTR-EXCH5.na.uis.unisys.com>
@ 2007-05-03 19:25 ` Krysan, Susan
2007-05-03 20:29 ` Keir Fraser
0 siblings, 1 reply; 11+ messages in thread
From: Krysan, Susan @ 2007-05-03 19:25 UTC (permalink / raw)
To: xen-devel; +Cc: Pascal Bouchareine, Tom Wilkie, Alex Williamson
I am running on a Unisys ES7000 with changeset 15006, and I still see
this problem. I believe it only happens when I stop the server,
reconfigure the amount of host processors and RAM, and then reboot.
[2007-05-03 13:49:19 4204] INFO (SrvDaemon:331) Xend Daemon started
[2007-05-03 13:49:19 4204] INFO (SrvDaemon:335) Xend changeset: Wed May
02 15:27:10 2007 +0100 15006:b5cfbc8f7d2c.
[2007-05-03 13:49:19 4204] INFO (SrvDaemon:342) Xend version: Unknown.
[2007-05-03 13:49:21 4204] ERROR (SrvDaemon:353) Exception starting xend
(PIF is physical)
Traceback (most recent call last):
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
server/SrvDaemon.py", line 345, in run
servers = SrvServer.create()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
server/SrvServer.py", line 254, in create
root.putChild('xend', SrvRoot())
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
server/SrvRoot.py", line 40, in __init__
self.get(name)
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/web/S
rvDir.py", line 82, in get
val = val.getobj()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/web/S
rvDir.py", line 52, in getobj
self.obj = klassobj()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
server/SrvNode.py", line 30, in __init__
self.xn = XendNode.instance()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
XendNode.py", line 658, in instance
inst = XendNode()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
XendNode.py", line 168, in __init__
XendPIF.recreate(pif, pif_uuid)
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
XendPIF.py", line 208, in recreate
pif.destroy()
File
"/home/unisys/xen-3.1-testing.hg/dist/install/usr/lib64/python/xen/xend/
XendPIF.py", line 309, in destroy
raise PIFIsPhysical()
PIFIsPhysical: PIF is physical
Thanks,
Sue Krysan
Linux Systems Group
Unisys Corporation
-----Original Message-----
From: xen-devel-bounces@lists.xensource.com
[mailto:xen-devel-bounces@lists.xensource.com] On Behalf Of Tom Wilkie
Sent: Wednesday, May 02, 2007 12:16 PM
To: Alex Williamson
Cc: Pascal Bouchareine; Keir Fraser; xen-devel; Tom Wilkie
Subject: Re: [Xen-devel] Re: [Xen-staging] [xen-3.1-testing] xend: Fix
use ofPIFIsPhysical (takes no arguments).
On 2 May 2007, at 17:05, Alex Williamson wrote:
> On Wed, 2007-05-02 at 16:46 +0100, Tom Wilkie wrote:
>> Checked in a fix, and a fix for that fix.
>>
>> cset number 15001.
>
> Confirmed, Thanks!
>
>> ps did you do something weird like turn the box off, remove a network
>> card, and turn it back on? Thats the only way I can think of this
>> bug getting triggered... Or this could be related to the networks
>> scripts not being run when xend is started.
>
> Removing a NIC from a powered off box doesn't seem so weird to me,
> but no, the hardware config did not change at all. The only
> interesting config is that one port of a dual port e1000 NIC is hidden
> from dom0 using "pciback.hide=(0000:01:02.1)". Otherwise it's a
> standard Debian Etch system netbooting xen & xenlinux w/ matching
> tools locally built and installed. Thanks,
Okay, if you weren't changing the cards then it must have been the
previous bug mentioned by Pascal Bouchareine.
I've check in a fix for that too, cset 15002
Cheers
Tom
>
> Alex
>
> --
> Alex Williamson HP Open Source & Linux
> Org.
>
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Re: [Xen-staging] [xen-3.1-testing] xend: Fix use ofPIFIsPhysical (takes no arguments).
2007-05-03 19:25 ` Re: [Xen-staging] [xen-3.1-testing] xend: Fix use ofPIFIsPhysical (takes no arguments) Krysan, Susan
@ 2007-05-03 20:29 ` Keir Fraser
2007-05-03 22:29 ` Numa=on broken? Subrahmanian, Raj
0 siblings, 1 reply; 11+ messages in thread
From: Keir Fraser @ 2007-05-03 20:29 UTC (permalink / raw)
To: Krysan, Susan, xen-devel; +Cc: Pascal Bouchareine, Tom Wilkie, Alex Williamson
On 3/5/07 20:25, "Krysan, Susan" <KRYSANS@unisys.com> wrote:
> I am running on a Unisys ES7000 with changeset 15006, and I still see
> this problem. I believe it only happens when I stop the server,
> reconfigure the amount of host processors and RAM, and then reboot.
Xen-unstable c/s 15006, I assume?
In that case your xend hasn't been properly re-installed: line 208 in your
Python backtrace changed from pif.destroy() to XendBase.destroy() in
changeset 15000. That was the fix for this bug. :-)
-- Keir
^ permalink raw reply [flat|nested] 11+ messages in thread
* Numa=on broken?
2007-05-03 20:29 ` Keir Fraser
@ 2007-05-03 22:29 ` Subrahmanian, Raj
2007-05-03 23:10 ` Ryan Harper
0 siblings, 1 reply; 11+ messages in thread
From: Subrahmanian, Raj @ 2007-05-03 22:29 UTC (permalink / raw)
To: Keir Fraser, Ryan Harper; +Cc: xen-devel
All,
It looks like numa=on is broken.
I have tried with the tip of the tree, with the changeset 14676 (which
fixes the numa=on problem) and the two patches that Ryan sent out.
I am running with boot options acpi=on and numa=on.
I am running on an ES7000 with 2 nodes (2 cells of 8 processors and 32
GB memory each).
Thanks
Raj
PS. We(Unisys) are going to start running our weekly tests with default
numa=on. This weeks' tests brought out this problem.
\ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
\ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
/ \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
/_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Thu May 3 16:58:28 EDT 2007
Latest ChangeSet: Thu May 03 11:22:58 2007 +0100 15007:c857bf38f015
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
-----------------------------------------------------------------------
__ __ _____ ___ _ _ _
\ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
\ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
/ \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
/_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Thu May 3 17:48:50 EDT 2007
Latest ChangeSet: Sat Mar 31 12:20:31 2007 +0100 14676:c1578c694b39
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
------------------------------------------------------------------------
-----
__ __ _____ ___ _ _ _
\ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
\ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
/ \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
/_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Thu May 3 18:07:41 EDT 2007
Latest ChangeSet: Thu May 03 11:22:58 2007 +0100 15007:c857bf38f015
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Numa=on broken?
2007-05-03 22:29 ` Numa=on broken? Subrahmanian, Raj
@ 2007-05-03 23:10 ` Ryan Harper
2007-05-04 2:39 ` Subrahmanian, Raj
2007-05-04 6:23 ` Subrahmanian, Raj
0 siblings, 2 replies; 11+ messages in thread
From: Ryan Harper @ 2007-05-03 23:10 UTC (permalink / raw)
To: Subrahmanian, Raj; +Cc: Ryan Harper, xen-devel
* Subrahmanian, Raj <raj.subrahmanian@unisys.com> [2007-05-03 17:30]:
> All,
> It looks like numa=on is broken.
> I have tried with the tip of the tree, with the changeset 14676 (which
> fixes the numa=on problem) and the two patches that Ryan sent out.
> I am running with boot options acpi=on and numa=on.
> I am running on an ES7000 with 2 nodes (2 cells of 8 processors and 32
> GB memory each).
>
> Thanks
> Raj
> PS. We(Unisys) are going to start running our weekly tests with default
> numa=on. This weeks' tests brought out this problem.
>
> \ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
> \ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
> / \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
> /_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
>
> http://www.cl.cam.ac.uk/netos/xen
> University of Cambridge Computer Laboratory
>
> Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
> (prerelease) (SUSE Linux)) Thu May 3 16:58:28 EDT 2007
> Latest ChangeSet: Thu May 03 11:22:58 2007 +0100 15007:c857bf38f015
>
> (XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
> dom0_mem=512M acpi=on numa=on
> (XEN) 0000000000000000 - 000000000009d000 (usable)
> (XEN) 000000000009d800 - 00000000000a0000 (reserved)
> (XEN) 00000000000ce000 - 0000000000100000 (reserved)
> (XEN) 0000000000100000 - 0000000037e70000 (usable)
> (XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
> (XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
> (XEN) 0000000037f00000 - 00000000f0000000 (usable)
> (XEN) 00000000f8000000 - 00000000fec00000 (reserved)
> (XEN) 00000000fffc0000 - 0000000100000000 (reserved)
> (XEN) 0000000100000000 - 0000001008000000 (usable)
> (XEN) System RAM: 65407MB (66976820kB)
> (XEN) Cannot handle page request order 0!
> (XEN) Cannot handle page request order 2!
> (XEN) Cannot handle page request order 0!
> (XEN) Cannot handle page request order 2!
> (XEN) Cannot handle page request order 0!
> (XEN) Cannot handle page request order 2!
> (XEN) Cannot handle page request order 0!
> (XEN) Cannot handle page request order 2!
> (XEN) Cannot handle page request order 0!
> (XEN) Cannot handle page request order 2!
what repo and what changeset? We fixed this issue a while ago,
I just booted:
changeset: 15020:6f37c763bc0f from http://xenbits.xensource.com/xen-3.1-testing.hg
and
changeset: 15009:3a5722420de7 from http://xenbits.xensource.com/xen-unstable.hg
with no issues on my numa hardware.
--
Ryan Harper
Software Engineer; Linux Technology Center
IBM Corp., Austin, Tx
(512) 838-9253 T/L: 678-9253
ryanh@us.ibm.com
^ permalink raw reply [flat|nested] 11+ messages in thread
* RE: Numa=on broken?
2007-05-03 23:10 ` Ryan Harper
@ 2007-05-04 2:39 ` Subrahmanian, Raj
2007-05-04 6:23 ` Subrahmanian, Raj
1 sibling, 0 replies; 11+ messages in thread
From: Subrahmanian, Raj @ 2007-05-04 2:39 UTC (permalink / raw)
To: Ryan Harper; +Cc: xen-devel
>what repo and what changeset? We fixed this issue a while ago,
>
>I just booted:
>
>changeset: 15020:6f37c763bc0f from
>http://xenbits.xensource.com/xen-3.1-testing.hg
>
>and
>
>changeset: 15009:3a5722420de7 from
>http://xenbits.xensource.com/xen-unstable.hg
>
>with no issues on my numa hardware.
15007 from unstable.
I will try 15009 unstable and 15020 testing.
Raj
^ permalink raw reply [flat|nested] 11+ messages in thread
* RE: Numa=on broken?
2007-05-03 23:10 ` Ryan Harper
2007-05-04 2:39 ` Subrahmanian, Raj
@ 2007-05-04 6:23 ` Subrahmanian, Raj
2007-05-04 13:33 ` Ryan Harper
1 sibling, 1 reply; 11+ messages in thread
From: Subrahmanian, Raj @ 2007-05-04 6:23 UTC (permalink / raw)
To: Ryan Harper; +Cc: xen-devel
Ryan,
I did check out the mail discussion where you encountered and fixed
this.
I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
I have included the debug messages in both runs.
Raj
__ __ _____ ___ _ _ _
\ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
\ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
/ \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
/_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Thu May 3 23:05:43 EDT 2007
Latest ChangeSet: Thu May 03 19:25:47 2007 +0100 15009:3a5722420de7
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
(XEN) Cannot handle page request order 0!
------------------------------------------------------------------------
----------------------------------
__ __ _____ _ ___ _____
\ \/ /___ _ __ |___ / / | / _ \ _ __ __|___ |
\ // _ \ '_ \ |_ \ | || | | |__| '__/ __| / /
/ \ __/ | | | ___) || || |_| |__| | | (__ / /
/_/\_\___|_| |_| |____(_)_(_)___/ |_| \___/_/
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.1.0-rc7 (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Fri May 4 01:52:44 EDT 2007
Latest ChangeSet: Thu May 03 19:28:14 2007 +0100 15021:cc5800ecd71f
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) Cannot handle page request order 0!
(XEN) Cannot handle page request order 2!
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Numa=on broken?
2007-05-04 6:23 ` Subrahmanian, Raj
@ 2007-05-04 13:33 ` Ryan Harper
2007-05-04 13:38 ` Ryan Harper
0 siblings, 1 reply; 11+ messages in thread
From: Ryan Harper @ 2007-05-04 13:33 UTC (permalink / raw)
To: Subrahmanian, Raj; +Cc: xen-devel
[-- Attachment #1: Type: text/plain, Size: 815 bytes --]
* Subrahmanian, Raj <raj.subrahmanian@unisys.com> [2007-05-04 01:23]:
> Ryan,
> I did check out the mail discussion where you encountered and fixed
> this.
> I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
> I have included the debug messages in both runs.
Hrm, well, we need to see which node the request is failing in so we can
figure out how to initialize the heap. I'm attaching a patch that
should hopefully dump out which node the memory request is failing in.
Last time, it was the xmalloc() in xen/common/page_alloc.c in
init_heap_pages().
Give this patch a spin and email me the output.
I'd also be interested in your SRAT table output for this machine.
--
Ryan Harper
Software Engineer; Linux Technology Center
IBM Corp., Austin, Tx
(512) 838-9253 T/L: 678-9253
ryanh@us.ibm.com
[-- Attachment #2: numa_debug.patch --]
[-- Type: text/plain, Size: 910 bytes --]
diff -r 3a5722420de7 xen/common/page_alloc.c
--- a/xen/common/page_alloc.c Thu May 03 19:25:47 2007 +0100
+++ b/xen/common/page_alloc.c Fri May 04 08:29:48 2007 -0500
@@ -353,6 +353,8 @@ static struct page_info *alloc_heap_page
ASSERT(zone_lo <= zone_hi);
ASSERT(zone_hi < NR_ZONES);
+ printk("%s: request for node %d\n", node);
+
if ( unlikely(order > MAX_ORDER) )
return NULL;
@@ -543,8 +545,10 @@ void init_heap_pages(
if ( !avail[nid_curr] )
{
+ printk("%s: attempting to initialize avail[%d]\n", nid_curr);
avail[nid_curr] = xmalloc_array(unsigned long, NR_ZONES);
memset(avail[nid_curr], 0, NR_ZONES * sizeof(long));
+ printk("%s: attempting to initialize _heap[%d]\n", nid_curr);
_heap[nid_curr] = xmalloc(heap_by_zone_and_order_t);
init_heap_block(_heap[nid_curr]);
}
[-- Attachment #3: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Numa=on broken?
2007-05-04 13:33 ` Ryan Harper
@ 2007-05-04 13:38 ` Ryan Harper
2007-05-04 17:32 ` Subrahmanian, Raj
2007-05-07 20:24 ` Subrahmanian, Raj
0 siblings, 2 replies; 11+ messages in thread
From: Ryan Harper @ 2007-05-04 13:38 UTC (permalink / raw)
To: Ryan Harper; +Cc: Subrahmanian, Raj, xen-devel
[-- Attachment #1: Type: text/plain, Size: 868 bytes --]
* Ryan Harper <ryanh@us.ibm.com> [2007-05-04 08:34]:
> * Subrahmanian, Raj <raj.subrahmanian@unisys.com> [2007-05-04 01:23]:
> > Ryan,
> > I did check out the mail discussion where you encountered and fixed
> > this.
> > I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
> > I have included the debug messages in both runs.
>
> Hrm, well, we need to see which node the request is failing in so we can
> figure out how to initialize the heap. I'm attaching a patch that
> should hopefully dump out which node the memory request is failing in.
>
> Last time, it was the xmalloc() in xen/common/page_alloc.c in
> init_heap_pages().
>
> Give this patch a spin and email me the output.
Here is one that actually compiles.
--
Ryan Harper
Software Engineer; Linux Technology Center
IBM Corp., Austin, Tx
(512) 838-9253 T/L: 678-9253
ryanh@us.ibm.com
[-- Attachment #2: numa_debug.patch --]
[-- Type: text/plain, Size: 940 bytes --]
diff -r 3a5722420de7 xen/common/page_alloc.c
--- a/xen/common/page_alloc.c Thu May 03 19:25:47 2007 +0100
+++ b/xen/common/page_alloc.c Fri May 04 08:36:42 2007 -0500
@@ -353,6 +353,8 @@ static struct page_info *alloc_heap_page
ASSERT(zone_lo <= zone_hi);
ASSERT(zone_hi < NR_ZONES);
+ printk("%s: request for node %d\n", __func__, node);
+
if ( unlikely(order > MAX_ORDER) )
return NULL;
@@ -543,8 +545,10 @@ void init_heap_pages(
if ( !avail[nid_curr] )
{
+ printk("%s: attempting to initialize avail[%d]\n", __func__, nid_curr);
avail[nid_curr] = xmalloc_array(unsigned long, NR_ZONES);
memset(avail[nid_curr], 0, NR_ZONES * sizeof(long));
+ printk("%s: attempting to initialize _heap[%d]\n", __func__, nid_curr);
_heap[nid_curr] = xmalloc(heap_by_zone_and_order_t);
init_heap_block(_heap[nid_curr]);
}
[-- Attachment #3: Type: text/plain, Size: 138 bytes --]
_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xensource.com
http://lists.xensource.com/xen-devel
^ permalink raw reply [flat|nested] 11+ messages in thread
* RE: Numa=on broken?
2007-05-04 13:38 ` Ryan Harper
@ 2007-05-04 17:32 ` Subrahmanian, Raj
2007-05-07 20:24 ` Subrahmanian, Raj
1 sibling, 0 replies; 11+ messages in thread
From: Subrahmanian, Raj @ 2007-05-04 17:32 UTC (permalink / raw)
To: Ryan Harper; +Cc: xen-devel
Ryan
I will try this out once I get the machine.
Incidentally, it seems to work with numa=on on a single-node machine.
Raj
>-----Original Message-----
>From: Ryan Harper [mailto:ryanh@us.ibm.com]
>Sent: Friday, May 04, 2007 9:38 AM
>To: Ryan Harper
>Cc: Subrahmanian, Raj; xen-devel@lists.xensource.com
>Subject: Re: [Xen-devel] Numa=on broken?
>
>* Ryan Harper <ryanh@us.ibm.com> [2007-05-04 08:34]:
>> * Subrahmanian, Raj <raj.subrahmanian@unisys.com> [2007-05-04 01:23]:
>> > Ryan,
>> > I did check out the mail discussion where you encountered
>and fixed
>> > this.
>> > I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
>> > I have included the debug messages in both runs.
>>
>> Hrm, well, we need to see which node the request is failing in so we
>> can figure out how to initialize the heap. I'm attaching a
>patch that
>> should hopefully dump out which node the memory request is
>failing in.
>>
>> Last time, it was the xmalloc() in xen/common/page_alloc.c in
>> init_heap_pages().
>>
>> Give this patch a spin and email me the output.
>
>Here is one that actually compiles.
>
>
>--
>Ryan Harper
>Software Engineer; Linux Technology Center IBM Corp., Austin, Tx
>(512) 838-9253 T/L: 678-9253
>ryanh@us.ibm.com
>
^ permalink raw reply [flat|nested] 11+ messages in thread
* RE: Numa=on broken?
2007-05-04 13:38 ` Ryan Harper
2007-05-04 17:32 ` Subrahmanian, Raj
@ 2007-05-07 20:24 ` Subrahmanian, Raj
2007-05-07 20:43 ` Ryan Harper
1 sibling, 1 reply; 11+ messages in thread
From: Subrahmanian, Raj @ 2007-05-07 20:24 UTC (permalink / raw)
To: Ryan Harper; +Cc: xen-devel
Ryan,
>> > Ryan,
>> > I did check out the mail discussion where you encountered
>and fixed
>> > this.
>> > I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
>> > I have included the debug messages in both runs.
>>
>> Hrm, well, we need to see which node the request is failing in so we
>> can figure out how to initialize the heap. I'm attaching a
>patch that
>> should hopefully dump out which node the memory request is
>failing in.
>>
>> Last time, it was the xmalloc() in xen/common/page_alloc.c in
>> init_heap_pages().
>>
>> Give this patch a spin and email me the output.
>
>Here is one that actually compiles.
Here's the output..
Unstable tree. Changeset 15009.
Thanks
Raj
__ __ _____ ___ _ _ _
\ \/ /___ _ __ |___ / / _ \ _ _ _ __ ___| |_ __ _| |__ | | ___
\ // _ \ '_ \ |_ \| | | |__| | | | '_ \/ __| __/ _` | '_ \| |/ _ \
/ \ __/ | | | ___) | |_| |__| |_| | | | \__ \ || (_| | |_) | | __/
/_/\_\___|_| |_| |____(_)___/ \__,_|_| |_|___/\__\__,_|_.__/|_|\___|
http://www.cl.cam.ac.uk/netos/xen
University of Cambridge Computer Laboratory
Xen version 3.0-unstable (root@site) (gcc version 4.1.2 20070115
(prerelease) (SUSE Linux)) Sat May 5 00:04:27 EDT 2007
Latest ChangeSet: Thu May 03 19:25:47 2007 +0100 15009:3a5722420de7
(XEN) Command line: /boot/xen.gz com1=115200,8n1 apic_verbosity=debug
dom0_mem=512M acpi=on numa=on
(XEN) 0000000000000000 - 000000000009d000 (usable)
(XEN) 000000000009d800 - 00000000000a0000 (reserved)
(XEN) 00000000000ce000 - 0000000000100000 (reserved)
(XEN) 0000000000100000 - 0000000037e70000 (usable)
(XEN) 0000000037e70000 - 0000000037ed8000 (ACPI data)
(XEN) 0000000037ed8000 - 0000000037f00000 (ACPI NVS)
(XEN) 0000000037f00000 - 00000000f0000000 (usable)
(XEN) 00000000f8000000 - 00000000fec00000 (reserved)
(XEN) 00000000fffc0000 - 0000000100000000 (reserved)
(XEN) 0000000100000000 - 0000001008000000 (usable)
(XEN) System RAM: 65407MB (66976820kB)
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
(XEN) init_heap_pages: attempting to initialize avail[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 0!
(XEN) init_heap_pages: attempting to initialize _heap[1]
(XEN) alloc_heap_pages: request for node 0
(XEN) Cannot handle page request order 2!
^ permalink raw reply [flat|nested] 11+ messages in thread
* Re: Numa=on broken?
2007-05-07 20:24 ` Subrahmanian, Raj
@ 2007-05-07 20:43 ` Ryan Harper
0 siblings, 0 replies; 11+ messages in thread
From: Ryan Harper @ 2007-05-07 20:43 UTC (permalink / raw)
To: Subrahmanian, Raj; +Cc: xen-devel
* Subrahmanian, Raj <raj.subrahmanian@unisys.com> [2007-05-07 15:25]:
> Ryan,
> >> > Ryan,
> >> > I did check out the mail discussion where you encountered
> >and fixed
> >> > this.
> >> > I am seeing this again on 15009, unstable and in 15021 3.1. rc7.
> >> > I have included the debug messages in both runs.
> >>
> >> Hrm, well, we need to see which node the request is failing in so we
> >> can figure out how to initialize the heap. I'm attaching a
> >patch that
> >> should hopefully dump out which node the memory request is
> >failing in.
> >>
> >> Last time, it was the xmalloc() in xen/common/page_alloc.c in
> >> init_heap_pages().
> >>
> >> Give this patch a spin and email me the output.
> >
> >Here is one that actually compiles.
> Here's the output..
> Unstable tree. Changeset 15009.
Do you have the SRAT table information, we need to know if you have any
memory in node0. I believe this code relies on the xenheap residing in
node0.
The call chain is likely:
xen/arch/x86/setup.c:calls init_xenheap_pages()
xen/common/page_alloc.c: init_heap_pages()
if the memory being added to the heap belongs to node1, the structure
is dynamically allocated using xmalloc() which will request a page
from the heap which is no memory is already in the heap, we fail.
That's what this looks like. You can confirm this by adding some debug
to the init_heap_pages() function in xen/common/page_alloc.c.
--
Ryan Harper
Software Engineer; Linux Technology Center
IBM Corp., Austin, Tx
(512) 838-9253 T/L: 678-9253
ryanh@us.ibm.com
^ permalink raw reply [flat|nested] 11+ messages in thread
end of thread, other threads:[~2007-05-07 20:43 UTC | newest]
Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <B05D2E415E8CC94897BB44233D14EE6805A01ED6@USTR-EXCH5.na.uis.unisys.com>
2007-05-03 19:25 ` Re: [Xen-staging] [xen-3.1-testing] xend: Fix use ofPIFIsPhysical (takes no arguments) Krysan, Susan
2007-05-03 20:29 ` Keir Fraser
2007-05-03 22:29 ` Numa=on broken? Subrahmanian, Raj
2007-05-03 23:10 ` Ryan Harper
2007-05-04 2:39 ` Subrahmanian, Raj
2007-05-04 6:23 ` Subrahmanian, Raj
2007-05-04 13:33 ` Ryan Harper
2007-05-04 13:38 ` Ryan Harper
2007-05-04 17:32 ` Subrahmanian, Raj
2007-05-07 20:24 ` Subrahmanian, Raj
2007-05-07 20:43 ` Ryan Harper
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.