* [Qemu-devel] Tracking hugepages usage @ 2017-06-01 10:38 Vladyslav Drok 2017-06-01 10:56 ` Andrey Korolyov 0 siblings, 1 reply; 5+ messages in thread From: Vladyslav Drok @ 2017-06-01 10:38 UTC (permalink / raw) To: qemu-devel Hello qemu community! I come from openstack world, and one of our customers complains about an issue with huge pages on compute nodes. From the "virsh frepages --all" and "cat /proc/meminfo", they see that 4 huge pages are consumed: http://paste.openstack.org/show/611186/ In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node 1. There are no VMs running at this point. When trying to find out what consumes the 4 1G huge pages from node 0, I was suggesting "grep 1048576 /proc/*/numa_maps" to find out which processes are using 1G pages, but in this particular case it shows no processes. While when some VM is running, I can see the qemu process that's consuming huge pages, numa_maps reports the correct amount of pages, corresponding to what has been requested for the VM's RAM. Are there any recommended ways for trying to track what consumes these 4 "lost" pages? (I might be a bit slow providing more info, as I don't have access to this environment :( ) Thanks, Vlad ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage 2017-06-01 10:38 [Qemu-devel] Tracking hugepages usage Vladyslav Drok @ 2017-06-01 10:56 ` Andrey Korolyov 2017-06-01 11:55 ` Vladyslav Drok 0 siblings, 1 reply; 5+ messages in thread From: Andrey Korolyov @ 2017-06-01 10:56 UTC (permalink / raw) To: Vladyslav Drok; +Cc: qemu-devel@nongnu.org On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> wrote: > Hello qemu community! > > I come from openstack world, and one of our customers complains about an > issue with huge pages on compute nodes. From the "virsh frepages --all" and > "cat /proc/meminfo", they see that 4 huge pages are consumed: > > http://paste.openstack.org/show/611186/ > > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node > 1. There are no VMs running at this point. > > When trying to find out what consumes the 4 1G huge pages from node 0, I > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which processes > are using 1G pages, but in this particular case it shows no processes. > While when some VM is running, I can see the qemu process that's consuming > huge pages, numa_maps reports the correct amount of pages, corresponding to > what has been requested for the VM's RAM. > > Are there any recommended ways for trying to track what consumes these 4 > "lost" pages? (I might be a bit slow providing more info, as I don't have > access to this environment :( ) > > Thanks, > Vlad Could you please try to walk against /proc/[0-9]/smaps to check that these pages are not claimed by any process? ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage 2017-06-01 10:56 ` Andrey Korolyov @ 2017-06-01 11:55 ` Vladyslav Drok 2017-06-01 16:24 ` Vladyslav Drok 0 siblings, 1 reply; 5+ messages in thread From: Vladyslav Drok @ 2017-06-01 11:55 UTC (permalink / raw) To: qemu-devel@nongnu.org On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote: > On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> wrote: > > Hello qemu community! > > > > I come from openstack world, and one of our customers complains about an > > issue with huge pages on compute nodes. From the "virsh frepages --all" > and > > "cat /proc/meminfo", they see that 4 huge pages are consumed: > > > > http://paste.openstack.org/show/611186/ > > > > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa node > > 1. There are no VMs running at this point. > > > > When trying to find out what consumes the 4 1G huge pages from node 0, I > > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which > processes > > are using 1G pages, but in this particular case it shows no processes. > > While when some VM is running, I can see the qemu process that's > consuming > > huge pages, numa_maps reports the correct amount of pages, corresponding > to > > what has been requested for the VM's RAM. > > > > Are there any recommended ways for trying to track what consumes these 4 > > "lost" pages? (I might be a bit slow providing more info, as I don't have > > access to this environment :( ) > > > > Thanks, > > Vlad > > Could you please try to walk against /proc/[0-9]/smaps to check that > these pages are not claimed by any process? > Thanks for the suggestion! Will provide the results as soon as I have it. ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage 2017-06-01 11:55 ` Vladyslav Drok @ 2017-06-01 16:24 ` Vladyslav Drok 2017-06-02 11:30 ` Vladyslav Drok 0 siblings, 1 reply; 5+ messages in thread From: Vladyslav Drok @ 2017-06-01 16:24 UTC (permalink / raw) To: qemu-devel@nongnu.org [-- Attachment #1: Type: text/plain, Size: 2485 bytes --] On Thu, Jun 1, 2017 at 2:55 PM, Vladyslav Drok <vdrok@mirantis.com> wrote: > > > On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote: > >> On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> >> wrote: >> > Hello qemu community! >> > >> > I come from openstack world, and one of our customers complains about an >> > issue with huge pages on compute nodes. From the "virsh frepages --all" >> and >> > "cat /proc/meminfo", they see that 4 huge pages are consumed: >> > >> > http://paste.openstack.org/show/611186/ >> > >> > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa >> node >> > 1. There are no VMs running at this point. >> > >> > When trying to find out what consumes the 4 1G huge pages from node 0, I >> > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which >> processes >> > are using 1G pages, but in this particular case it shows no processes. >> > While when some VM is running, I can see the qemu process that's >> consuming >> > huge pages, numa_maps reports the correct amount of pages, >> corresponding to >> > what has been requested for the VM's RAM. >> > >> > Are there any recommended ways for trying to track what consumes these 4 >> > "lost" pages? (I might be a bit slow providing more info, as I don't >> have >> > access to this environment :( ) >> > >> > Thanks, >> > Vlad >> >> Could you please try to walk against /proc/[0-9]/smaps to check that >> these pages are not claimed by any process? >> > > Thanks for the suggestion! Will provide the results as soon as I have it. > > So, here (in the attachment, is is a bit lengthy so, sorry, was not able to use paste :)) is an output of ps -F and smaps for processes that have any entry with KernelPageSize: 1048576 kB. In this case, there are two instances running on this compute node, 16 GB and 32 GB. The usage reported by qemu processes seems to report huge page count correctly. In case of ovs-vswitchd process, I'm not sure how to interpret the output, as if I just add up the number of pages used by that, it is much bigger than it's reported as used, any hint on that would be much appreciated :) Though the ovs-vswitchd process is run with --huge-dir /mnt/huge_ovs_2M which is where 2 mb pages are mounted, so I supposed it should not use the 1G pages. I'll also try to request the output for a compute that does not have any instances running but still having some pages used, so the problem is a bit more clear. Thanks, Vlad [-- Attachment #2: ouput.tar.xz --] [-- Type: application/x-xz, Size: 23476 bytes --] ^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [Qemu-devel] Tracking hugepages usage 2017-06-01 16:24 ` Vladyslav Drok @ 2017-06-02 11:30 ` Vladyslav Drok 0 siblings, 0 replies; 5+ messages in thread From: Vladyslav Drok @ 2017-06-02 11:30 UTC (permalink / raw) To: qemu-devel@nongnu.org, andrey On Thu, Jun 1, 2017 at 7:24 PM, Vladyslav Drok <vdrok@mirantis.com> wrote: > > > On Thu, Jun 1, 2017 at 2:55 PM, Vladyslav Drok <vdrok@mirantis.com> wrote: > >> >> >> On Thu, Jun 1, 2017 at 1:56 PM, Andrey Korolyov <andrey@xdel.ru> wrote: >> >>> On Thu, Jun 1, 2017 at 1:38 PM, Vladyslav Drok <vdrok@mirantis.com> >>> wrote: >>> > Hello qemu community! >>> > >>> > I come from openstack world, and one of our customers complains about >>> an >>> > issue with huge pages on compute nodes. From the "virsh frepages >>> --all" and >>> > "cat /proc/meminfo", they see that 4 huge pages are consumed: >>> > >>> > http://paste.openstack.org/show/611186/ >>> > >>> > In total there are 239 1G pages, 120 in numa node 0, and 119 in numa >>> node >>> > 1. There are no VMs running at this point. >>> > >>> > When trying to find out what consumes the 4 1G huge pages from node 0, >>> I >>> > was suggesting "grep 1048576 /proc/*/numa_maps" to find out which >>> processes >>> > are using 1G pages, but in this particular case it shows no processes. >>> > While when some VM is running, I can see the qemu process that's >>> consuming >>> > huge pages, numa_maps reports the correct amount of pages, >>> corresponding to >>> > what has been requested for the VM's RAM. >>> > >>> > Are there any recommended ways for trying to track what consumes these >>> 4 >>> > "lost" pages? (I might be a bit slow providing more info, as I don't >>> have >>> > access to this environment :( ) >>> > >>> > Thanks, >>> > Vlad >>> >>> Could you please try to walk against /proc/[0-9]/smaps to check that >>> these pages are not claimed by any process? >>> >> >> Thanks for the suggestion! Will provide the results as soon as I have it. >> >> So, here (in the attachment, is is a bit lengthy so, sorry, was not able > to use paste :)) is an output of ps -F and smaps for processes that have > any entry with KernelPageSize: 1048576 kB. In this case, there are two > instances running on this compute node, 16 GB and 32 GB. The usage reported > by qemu processes seems to report huge page count correctly. In case of > ovs-vswitchd process, I'm not sure how to interpret the output, as if I > just add up the number of pages used by that, it is much bigger than it's > reported as used, any hint on that would be much appreciated :) Though the > ovs-vswitchd process is run with --huge-dir /mnt/huge_ovs_2M which is where > 2 mb pages are mounted, so I supposed it should not use the 1G pages. > Reuploaded it on google drive just in case, here - https://drive.google.com/a/mirantis.com/file/d/0BwCsFeCyKJjMYjdTNFJ0Tnd6OFU/view?usp=sharing > > I'll also try to request the output for a compute that does not have any > instances running but still having some pages used, so the problem is a bit > more clear. > So here is the paste for the compute node that does not have any instances (smaller and easier to notice a problem) - http://paste.openstack.org/show/cvT96Bp0Lu1zpwqS0jDa/, smaps does not report any processes using 1G pages, while there are 4 used by something as reported by meminfo. > > Thanks, > Vlad > ^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2017-06-02 11:30 UTC | newest] Thread overview: 5+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2017-06-01 10:38 [Qemu-devel] Tracking hugepages usage Vladyslav Drok 2017-06-01 10:56 ` Andrey Korolyov 2017-06-01 11:55 ` Vladyslav Drok 2017-06-01 16:24 ` Vladyslav Drok 2017-06-02 11:30 ` Vladyslav Drok
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).