* [Qemu-devel] Tracking hugepages usage
@ 2017-06-01 10:38 Vladyslav Drok
2017-06-01 10:56 ` Andrey Korolyov
0 siblings, 1 reply; 5+ messages in thread
From: Vladyslav Drok @ 2017-06-01 10:38 UTC (permalink / raw)
To: qemu-devel
Hello qemu community!
I come from openstack world, and one of our customers complains about an
issue with huge pages on compute nodes. From the "virsh frepages --all" and
"cat /proc/meminfo", they see that 4 huge pages are consumed:
http://paste.openstack.org/show/611186/
In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node
1. There are no VMs running at this point.
When trying to find out what consumes the 4 1G huge pages from node 0, I
was suggesting "grep 1048576 /proc/*/numa_maps" to find out which processes
are using 1G pages, but in this particular case it shows no processes.
While when some VM is running, I can see the qemu process that's consuming
huge pages, numa_maps reports the correct amount of pages, corresponding to
what has been requested for the VM's RAM.
Are there any recommended ways for trying to track what consumes these 4
"lost" pages? (I might be a bit slow providing more info, as I don't have
access to this environment :( )
Thanks,
Vlad
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage
2017-06-01 10:38 [Qemu-devel] Tracking hugepages usage Vladyslav Drok
@ 2017-06-01 10:56 ` Andrey Korolyov
2017-06-01 11:55 ` Vladyslav Drok
0 siblings, 1 reply; 5+ messages in thread
From: Andrey Korolyov @ 2017-06-01 10:56 UTC (permalink / raw)
To: Vladyslav Drok; +Cc: qemu-devel@nongnu.org
On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> wrote:
> Hello qemu community!
>
> I come from openstack world, and one of our customers complains about an
> issue with huge pages on compute nodes. From the "virsh frepages --all" and
> "cat /proc/meminfo", they see that 4 huge pages are consumed:
>
> http://paste.openstack.org/show/611186/
>
> In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node
> 1. There are no VMs running at this point.
>
> When trying to find out what consumes the 4 1G huge pages from node 0, I
> was suggesting "grep 1048576 /proc/*/numa_maps" to find out which processes
> are using 1G pages, but in this particular case it shows no processes.
> While when some VM is running, I can see the qemu process that's consuming
> huge pages, numa_maps reports the correct amount of pages, corresponding to
> what has been requested for the VM's RAM.
>
> Are there any recommended ways for trying to track what consumes these 4
> "lost" pages? (I might be a bit slow providing more info, as I don't have
> access to this environment :( )
>
> Thanks,
> Vlad
Could you please try to walk against /proc/[0-9]/smaps to check that
these pages are not claimed by any process?
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage
2017-06-01 10:56 ` Andrey Korolyov
@ 2017-06-01 11:55 ` Vladyslav Drok
2017-06-01 16:24 ` Vladyslav Drok
0 siblings, 1 reply; 5+ messages in thread
From: Vladyslav Drok @ 2017-06-01 11:55 UTC (permalink / raw)
To: qemu-devel@nongnu.org
On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote:
> On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> wrote:
> > Hello qemu community!
> >
> > I come from openstack world, and one of our customers complains about an
> > issue with huge pages on compute nodes. From the "virsh frepages --all"
> and
> > "cat /proc/meminfo", they see that 4 huge pages are consumed:
> >
> > http://paste.openstack.org/show/611186/
> >
> > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node
> > 1. There are no VMs running at this point.
> >
> > When trying to find out what consumes the 4 1G huge pages from node 0, I
> > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which
> processes
> > are using 1G pages, but in this particular case it shows no processes.
> > While when some VM is running, I can see the qemu process that's
> consuming
> > huge pages, numa_maps reports the correct amount of pages, corresponding
> to
> > what has been requested for the VM's RAM.
> >
> > Are there any recommended ways for trying to track what consumes these 4
> > "lost" pages? (I might be a bit slow providing more info, as I don't have
> > access to this environment :( )
> >
> > Thanks,
> > Vlad
>
> Could you please try to walk against /proc/[0-9]/smaps to check that
> these pages are not claimed by any process?
>
Thanks for the suggestion! Will provide the results as soon as I have it.
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage
2017-06-01 11:55 ` Vladyslav Drok
@ 2017-06-01 16:24 ` Vladyslav Drok
2017-06-02 11:30 ` Vladyslav Drok
0 siblings, 1 reply; 5+ messages in thread
From: Vladyslav Drok @ 2017-06-01 16:24 UTC (permalink / raw)
To: qemu-devel@nongnu.org
[-- Attachment #1: Type: text/plain, Size: 2485 bytes --]
On Thu, Jun 1, 2017 at 2:55 PM, Vladyslav Drok <vdrok@mirantis.com> wrote:
>
>
> On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote:
>
>> On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com>
>> wrote:
>> > Hello qemu community!
>> >
>> > I come from openstack world, and one of our customers complains about an
>> > issue with huge pages on compute nodes. From the "virsh frepages --all"
>> and
>> > "cat /proc/meminfo", they see that 4 huge pages are consumed:
>> >
>> > http://paste.openstack.org/show/611186/
>> >
>> > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa
>> node
>> > 1. There are no VMs running at this point.
>> >
>> > When trying to find out what consumes the 4 1G huge pages from node 0, I
>> > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which
>> processes
>> > are using 1G pages, but in this particular case it shows no processes.
>> > While when some VM is running, I can see the qemu process that's
>> consuming
>> > huge pages, numa_maps reports the correct amount of pages,
>> corresponding to
>> > what has been requested for the VM's RAM.
>> >
>> > Are there any recommended ways for trying to track what consumes these 4
>> > "lost" pages? (I might be a bit slow providing more info, as I don't
>> have
>> > access to this environment :( )
>> >
>> > Thanks,
>> > Vlad
>>
>> Could you please try to walk against /proc/[0-9]/smaps to check that
>> these pages are not claimed by any process?
>>
>
> Thanks for the suggestion! Will provide the results as soon as I have it.
>
> So, here (in the attachment, is is a bit lengthy so, sorry, was not able
to use paste :)) is an output of ps -F and smaps for processes that have
any entry with KernelPageSize: 1048576 kB. In this case, there are two
instances running on this compute node, 16 GB and 32 GB. The usage reported
by qemu processes seems to report huge page count correctly. In case of
ovs-vswitchd process, I'm not sure how to interpret the output, as if I
just add up the number of pages used by that, it is much bigger than it's
reported as used, any hint on that would be much appreciated :) Though the
ovs-vswitchd process is run with --huge-dir /mnt/huge_ovs_2M which is where
2 mb pages are mounted, so I supposed it should not use the 1G pages.
I'll also try to request the output for a compute that does not have any
instances running but still having some pages used, so the problem is a bit
more clear.
Thanks,
Vlad
[-- Attachment #2: ouput.tar.xz --]
[-- Type: application/x-xz, Size: 23476 bytes --]
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage
2017-06-01 16:24 ` Vladyslav Drok
@ 2017-06-02 11:30 ` Vladyslav Drok
0 siblings, 0 replies; 5+ messages in thread
From: Vladyslav Drok @ 2017-06-02 11:30 UTC (permalink / raw)
To: qemu-devel@nongnu.org, andrey
On Thu, Jun 1, 2017 at 7:24 PM, Vladyslav Drok <vdrok@mirantis.com> wrote:
>
>
> On Thu, Jun 1, 2017 at 2:55 PM, Vladyslav Drok <vdrok@mirantis.com> wrote:
>
>>
>>
>> On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote:
>>
>>> On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com>
>>> wrote:
>>> > Hello qemu community!
>>> >
>>> > I come from openstack world, and one of our customers complains about
>>> an
>>> > issue with huge pages on compute nodes. From the "virsh frepages
>>> --all" and
>>> > "cat /proc/meminfo", they see that 4 huge pages are consumed:
>>> >
>>> > http://paste.openstack.org/show/611186/
>>> >
>>> > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa
>>> node
>>> > 1. There are no VMs running at this point.
>>> >
>>> > When trying to find out what consumes the 4 1G huge pages from node 0,
>>> I
>>> > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which
>>> processes
>>> > are using 1G pages, but in this particular case it shows no processes.
>>> > While when some VM is running, I can see the qemu process that's
>>> consuming
>>> > huge pages, numa_maps reports the correct amount of pages,
>>> corresponding to
>>> > what has been requested for the VM's RAM.
>>> >
>>> > Are there any recommended ways for trying to track what consumes these
>>> 4
>>> > "lost" pages? (I might be a bit slow providing more info, as I don't
>>> have
>>> > access to this environment :( )
>>> >
>>> > Thanks,
>>> > Vlad
>>>
>>> Could you please try to walk against /proc/[0-9]/smaps to check that
>>> these pages are not claimed by any process?
>>>
>>
>> Thanks for the suggestion! Will provide the results as soon as I have it.
>>
>> So, here (in the attachment, is is a bit lengthy so, sorry, was not able
> to use paste :)) is an output of ps -F and smaps for processes that have
> any entry with KernelPageSize: 1048576 kB. In this case, there are two
> instances running on this compute node, 16 GB and 32 GB. The usage reported
> by qemu processes seems to report huge page count correctly. In case of
> ovs-vswitchd process, I'm not sure how to interpret the output, as if I
> just add up the number of pages used by that, it is much bigger than it's
> reported as used, any hint on that would be much appreciated :) Though the
> ovs-vswitchd process is run with --huge-dir /mnt/huge_ovs_2M which is where
> 2 mb pages are mounted, so I supposed it should not use the 1G pages.
>
Reuploaded it on google drive just in case, here -
https://drive.google.com/a/mirantis.com/file/d/0BwCsFeCyKJjMYjdTNFJ0Tnd6OFU/view?usp=sharing
>
> I'll also try to request the output for a compute that does not have any
> instances running but still having some pages used, so the problem is a bit
> more clear.
>
So here is the paste for the compute node that does not have any instances
(smaller and easier to notice a problem) -
http://paste.openstack.org/show/cvT96Bp0Lu1zpwqS0jDa/, smaps does not
report any processes using 1G pages, while there are 4 used by something as
reported by meminfo.
>
> Thanks,
> Vlad
>
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-06-02 11:30 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2017-06-01 10:38 [Qemu-devel] Tracking hugepages usage Vladyslav Drok
2017-06-01 10:56 ` Andrey Korolyov
2017-06-01 11:55 ` Vladyslav Drok
2017-06-01 16:24 ` Vladyslav Drok
2017-06-02 11:30 ` Vladyslav Drok
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).