* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
[not found] ` <20130809131759.GB2868@redhat.com>
@ 2013-08-09 13:29 ` Michal Privoznik
2013-08-09 15:58 ` Anthony Liguori
0 siblings, 1 reply; 7+ messages in thread
From: Michal Privoznik @ 2013-08-09 13:29 UTC (permalink / raw)
To: Daniel P. Berrange; +Cc: libvir-list, QEMU Developers
[CC'ing qemu-devel list]
On 09.08.2013 15:17, Daniel P. Berrange wrote:
> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
>>> This function is to guess the correct limit for maximal memory
>>> usage by qemu for given domain. This can never be guessed
>>> correctly, not to mention all the pains and sleepless nights this
>>> code has caused. Once somebody discovers algorithm to solve the
>>> Halting Problem, we can compute the limit algorithmically. But
>>> till then, this code should never see the light of the release
>>> again.
>>> ---
>>> src/qemu/qemu_cgroup.c | 3 +--
>>> src/qemu/qemu_command.c | 2 +-
>>> src/qemu/qemu_domain.c | 49 -------------------------------------------------
>>> src/qemu/qemu_domain.h | 2 --
>>> src/qemu/qemu_hotplug.c | 2 +-
>>> 5 files changed, 3 insertions(+), 55 deletions(-)
>>
>> ACK. Users that put an explicit limit in their XML are taking on their
>> own risk at guessing correctly; all other users should not be forced to
>> suffer from a bad guess on our part killing their domain.
>
> If we don't understand how to calculate a default limit that works,
> how are users with even less knowledge than us, suppose to calculate
> an explicit level of their own ?
>
> This limit was designed so that the hosts are not vulnerable to DOS
> attack from a compromised QEMU, so removing this is arguably introducing
> a security weakness in our default deployment.
>
> I think I'd like to see some feedback / agreement from QEMU developers
> that this problem really can't be solved, before we remove it.
>
> Daniel
>
In libvirt I've introduced a heuristic to guess the maximum limit for a
memory for a given VM definition. The rationale was "it's better to be
safe by default" and not let leaking qemu trash the host. The heuristic
is only used if user has not configured any limit himself. However, over
the time the number of users reporting OOM kills due to my heuristic has
grown. Finally, I've full nose of this problem so I've made a patch [1]
that removes this 'functionality' completely (I'd say it's bug after
all). In the patch you can see the heuristic we've converged to. But Dan
has his point. If libvirt & qemu devels aren't able to come up with
proper heuristic, how can an ordinary user (who doesn't have any
knowledge of code) do so? So before I apply my patch, I want to ask you
guys, what do you think about it.
Michal
1: https://www.redhat.com/archives/libvir-list/2013-August/msg00437.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-09 13:29 ` [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit Michal Privoznik
@ 2013-08-09 15:58 ` Anthony Liguori
2013-08-09 16:29 ` Daniel P. Berrange
2013-08-09 16:32 ` Andreas Färber
0 siblings, 2 replies; 7+ messages in thread
From: Anthony Liguori @ 2013-08-09 15:58 UTC (permalink / raw)
To: Michal Privoznik, Daniel P. Berrange; +Cc: libvir-list, QEMU Developers
Michal Privoznik <mprivozn@redhat.com> writes:
> [CC'ing qemu-devel list]
> On 09.08.2013 15:17, Daniel P. Berrange wrote:
>> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
>>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
>>>> This function is to guess the correct limit for maximal memory
>>>> usage by qemu for given domain. This can never be guessed
>>>> correctly, not to mention all the pains and sleepless nights this
>>>> code has caused. Once somebody discovers algorithm to solve the
>>>> Halting Problem, we can compute the limit algorithmically. But
>>>> till then, this code should never see the light of the release
>>>> again.
>>>> ---
>>>> src/qemu/qemu_cgroup.c | 3 +--
>>>> src/qemu/qemu_command.c | 2 +-
>>>> src/qemu/qemu_domain.c | 49 -------------------------------------------------
>>>> src/qemu/qemu_domain.h | 2 --
>>>> src/qemu/qemu_hotplug.c | 2 +-
>>>> 5 files changed, 3 insertions(+), 55 deletions(-)
>>>
>>> ACK. Users that put an explicit limit in their XML are taking on their
>>> own risk at guessing correctly; all other users should not be forced to
>>> suffer from a bad guess on our part killing their domain.
>>
>> If we don't understand how to calculate a default limit that works,
>> how are users with even less knowledge than us, suppose to calculate
>> an explicit level of their own ?
>>
>> This limit was designed so that the hosts are not vulnerable to DOS
>> attack from a compromised QEMU, so removing this is arguably introducing
>> a security weakness in our default deployment.
>>
>> I think I'd like to see some feedback / agreement from QEMU developers
>> that this problem really can't be solved, before we remove it.
>>
>> Daniel
>>
>
> In libvirt I've introduced a heuristic to guess the maximum limit for a
> memory for a given VM definition. The rationale was "it's better to be
> safe by default" and not let leaking qemu trash the host. The heuristic
> is only used if user has not configured any limit himself. However, over
> the time the number of users reporting OOM kills due to my heuristic has
> grown. Finally, I've full nose of this problem so I've made a patch [1]
> that removes this 'functionality' completely (I'd say it's bug after
> all). In the patch you can see the heuristic we've converged to. But Dan
> has his point. If libvirt & qemu devels aren't able to come up with
> proper heuristic, how can an ordinary user (who doesn't have any
> knowledge of code) do so? So before I apply my patch, I want to ask you
> guys, what do you think about it.
Even if we had an algorithm for calculating memory overhead (we don't),
glibc will still introduce uncertainty since malloc(size) doesn't
translate to allocating size bytes from the kernel. When you throw in
fragmentation too it becomes extremely hard to predict.
The only practical way of doing this would be to have QEMU gracefully
handle malloc() == NULL so that you could set a limit and gracefully
degrade. We don't though so setting a limit is likely to get you in
trouble.
Regards,
Anthony Liguori
>
> Michal
>
> 1: https://www.redhat.com/archives/libvir-list/2013-August/msg00437.html
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-09 15:58 ` Anthony Liguori
@ 2013-08-09 16:29 ` Daniel P. Berrange
2013-08-19 8:29 ` Michal Privoznik
2013-08-09 16:32 ` Andreas Färber
1 sibling, 1 reply; 7+ messages in thread
From: Daniel P. Berrange @ 2013-08-09 16:29 UTC (permalink / raw)
To: Anthony Liguori; +Cc: libvir-list, Michal Privoznik, QEMU Developers
On Fri, Aug 09, 2013 at 10:58:55AM -0500, Anthony Liguori wrote:
> Michal Privoznik <mprivozn@redhat.com> writes:
>
> > [CC'ing qemu-devel list]
> > On 09.08.2013 15:17, Daniel P. Berrange wrote:
> >> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
> >>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
> >>>> This function is to guess the correct limit for maximal memory
> >>>> usage by qemu for given domain. This can never be guessed
> >>>> correctly, not to mention all the pains and sleepless nights this
> >>>> code has caused. Once somebody discovers algorithm to solve the
> >>>> Halting Problem, we can compute the limit algorithmically. But
> >>>> till then, this code should never see the light of the release
> >>>> again.
> >>>> ---
> >>>> src/qemu/qemu_cgroup.c | 3 +--
> >>>> src/qemu/qemu_command.c | 2 +-
> >>>> src/qemu/qemu_domain.c | 49 -------------------------------------------------
> >>>> src/qemu/qemu_domain.h | 2 --
> >>>> src/qemu/qemu_hotplug.c | 2 +-
> >>>> 5 files changed, 3 insertions(+), 55 deletions(-)
> >>>
> >>> ACK. Users that put an explicit limit in their XML are taking on their
> >>> own risk at guessing correctly; all other users should not be forced to
> >>> suffer from a bad guess on our part killing their domain.
> >>
> >> If we don't understand how to calculate a default limit that works,
> >> how are users with even less knowledge than us, suppose to calculate
> >> an explicit level of their own ?
> >>
> >> This limit was designed so that the hosts are not vulnerable to DOS
> >> attack from a compromised QEMU, so removing this is arguably introducing
> >> a security weakness in our default deployment.
> >>
> >> I think I'd like to see some feedback / agreement from QEMU developers
> >> that this problem really can't be solved, before we remove it.
> >>
> >> Daniel
> >>
> >
> > In libvirt I've introduced a heuristic to guess the maximum limit for a
> > memory for a given VM definition. The rationale was "it's better to be
> > safe by default" and not let leaking qemu trash the host. The heuristic
> > is only used if user has not configured any limit himself. However, over
> > the time the number of users reporting OOM kills due to my heuristic has
> > grown. Finally, I've full nose of this problem so I've made a patch [1]
> > that removes this 'functionality' completely (I'd say it's bug after
> > all). In the patch you can see the heuristic we've converged to. But Dan
> > has his point. If libvirt & qemu devels aren't able to come up with
> > proper heuristic, how can an ordinary user (who doesn't have any
> > knowledge of code) do so? So before I apply my patch, I want to ask you
> > guys, what do you think about it.
>
> Even if we had an algorithm for calculating memory overhead (we don't),
> glibc will still introduce uncertainty since malloc(size) doesn't
> translate to allocating size bytes from the kernel. When you throw in
> fragmentation too it becomes extremely hard to predict.
>
> The only practical way of doing this would be to have QEMU gracefully
> handle malloc() == NULL so that you could set a limit and gracefully
> degrade. We don't though so setting a limit is likely to get you in
> trouble.
So you're saying there's no way we can define a reasonable limit
on a QEMU guest to prevent a compomised QEMU exhausting all host
memory ? It rather sucks if that's the position we're in :-(
Daniel
--
|: http://berrange.com -o- http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org -o- http://virt-manager.org :|
|: http://autobuild.org -o- http://search.cpan.org/~danberr/ :|
|: http://entangle-photo.org -o- http://live.gnome.org/gtk-vnc :|
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-09 15:58 ` Anthony Liguori
2013-08-09 16:29 ` Daniel P. Berrange
@ 2013-08-09 16:32 ` Andreas Färber
1 sibling, 0 replies; 7+ messages in thread
From: Andreas Färber @ 2013-08-09 16:32 UTC (permalink / raw)
To: Anthony Liguori; +Cc: libvir-list, Michal Privoznik, QEMU Developers
Am 09.08.2013 17:58, schrieb Anthony Liguori:
> Even if we had an algorithm for calculating memory overhead (we don't),
> glibc will still introduce uncertainty since malloc(size) doesn't
> translate to allocating size bytes from the kernel. When you throw in
> fragmentation too it becomes extremely hard to predict.
>
> The only practical way of doing this would be to have QEMU gracefully
> handle malloc() == NULL so that you could set a limit and gracefully
> degrade. We don't though so setting a limit is likely to get you in
> trouble.
FWIW my QOM realize work is targetted at reducing likelihood that
device_add blows up QEMU due to OOM in object_new(). But before I can
change qdev-monitor.c I still need to tweak core QOM to either get at
TypeImpl::instance_size or to introduce an object_try_new() function
using g_try_malloc0() rather than g_malloc0(). That's where proper child
struct composition comes into play.
The major variance in runtime memory consumption was so far attributed
to block and network I/O, without ever getting exact proof points...
Regards,
Andreas
--
SUSE LINUX Products GmbH, Maxfeldstr. 5, 90409 Nürnberg, Germany
GF: Jeff Hawn, Jennifer Guild, Felix Imendörffer; HRB 16746 AG Nürnberg
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-09 16:29 ` Daniel P. Berrange
@ 2013-08-19 8:29 ` Michal Privoznik
2013-08-19 9:06 ` Daniel P. Berrange
0 siblings, 1 reply; 7+ messages in thread
From: Michal Privoznik @ 2013-08-19 8:29 UTC (permalink / raw)
To: Daniel P. Berrange; +Cc: libvir-list, QEMU Developers, Anthony Liguori
On 09.08.2013 18:29, Daniel P. Berrange wrote:
> On Fri, Aug 09, 2013 at 10:58:55AM -0500, Anthony Liguori wrote:
>> Michal Privoznik <mprivozn@redhat.com> writes:
>>
>>> [CC'ing qemu-devel list]
>>> On 09.08.2013 15:17, Daniel P. Berrange wrote:
>>>> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
>>>>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
So what's the conclusion? Should I push the patch until qemu gets
stabilized in mem consumption (which can take a while - no offense to
qemu devels, I can imagine it's nontrivial).
Michal
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-19 8:29 ` Michal Privoznik
@ 2013-08-19 9:06 ` Daniel P. Berrange
2013-08-19 10:06 ` Michal Privoznik
0 siblings, 1 reply; 7+ messages in thread
From: Daniel P. Berrange @ 2013-08-19 9:06 UTC (permalink / raw)
To: Michal Privoznik; +Cc: libvir-list, QEMU Developers, Anthony Liguori
On Mon, Aug 19, 2013 at 10:29:10AM +0200, Michal Privoznik wrote:
> On 09.08.2013 18:29, Daniel P. Berrange wrote:
> > On Fri, Aug 09, 2013 at 10:58:55AM -0500, Anthony Liguori wrote:
> >> Michal Privoznik <mprivozn@redhat.com> writes:
> >>
> >>> [CC'ing qemu-devel list]
> >>> On 09.08.2013 15:17, Daniel P. Berrange wrote:
> >>>> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
> >>>>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
>
>
> So what's the conclusion? Should I push the patch until qemu gets
> stabilized in mem consumption (which can take a while - no offense to
> qemu devels, I can imagine it's nontrivial).
Given the lack of any useful info to make memory limits work reliably,
I guess we don't have any choice but to remove this default memory limit,
and also recommend against people setting explicit memory limits too.
Regards,
Daniel
--
|: http://berrange.com -o- http://www.flickr.com/photos/dberrange/ :|
|: http://libvirt.org -o- http://virt-manager.org :|
|: http://autobuild.org -o- http://search.cpan.org/~danberr/ :|
|: http://entangle-photo.org -o- http://live.gnome.org/gtk-vnc :|
^ permalink raw reply [flat|nested] 7+ messages in thread
* Re: [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit
2013-08-19 9:06 ` Daniel P. Berrange
@ 2013-08-19 10:06 ` Michal Privoznik
0 siblings, 0 replies; 7+ messages in thread
From: Michal Privoznik @ 2013-08-19 10:06 UTC (permalink / raw)
To: Daniel P. Berrange; +Cc: libvir-list, QEMU Developers, Anthony Liguori
On 19.08.2013 11:06, Daniel P. Berrange wrote:
> On Mon, Aug 19, 2013 at 10:29:10AM +0200, Michal Privoznik wrote:
>> On 09.08.2013 18:29, Daniel P. Berrange wrote:
>>> On Fri, Aug 09, 2013 at 10:58:55AM -0500, Anthony Liguori wrote:
>>>> Michal Privoznik <mprivozn@redhat.com> writes:
>>>>
>>>>> [CC'ing qemu-devel list]
>>>>> On 09.08.2013 15:17, Daniel P. Berrange wrote:
>>>>>> On Fri, Aug 09, 2013 at 07:13:58AM -0600, Eric Blake wrote:
>>>>>>> On 08/09/2013 06:56 AM, Michal Privoznik wrote:
>>
>>
>> So what's the conclusion? Should I push the patch until qemu gets
>> stabilized in mem consumption (which can take a while - no offense to
>> qemu devels, I can imagine it's nontrivial).
>
> Given the lack of any useful info to make memory limits work reliably,
> I guess we don't have any choice but to remove this default memory limit,
> and also recommend against people setting explicit memory limits too.
>
> Regards,
> Daniel
>
Okay, I've pushed the patch. And I've just proposed patch to discourage
users setting this limit:
https://www.redhat.com/archives/libvir-list/2013-August/msg00820.html
Michal
^ permalink raw reply [flat|nested] 7+ messages in thread
end of thread, other threads:[~2013-08-19 10:06 UTC | newest]
Thread overview: 7+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
[not found] <ad6173415eb6c51c95645e42b4bafa67cac39986.1376052970.git.mprivozn@redhat.com>
[not found] ` <5204EB16.8020801@redhat.com>
[not found] ` <20130809131759.GB2868@redhat.com>
2013-08-09 13:29 ` [Qemu-devel] [libvirt] [PATCH] qemu: Drop qemuDomainMemoryLimit Michal Privoznik
2013-08-09 15:58 ` Anthony Liguori
2013-08-09 16:29 ` Daniel P. Berrange
2013-08-19 8:29 ` Michal Privoznik
2013-08-19 9:06 ` Daniel P. Berrange
2013-08-19 10:06 ` Michal Privoznik
2013-08-09 16:32 ` Andreas Färber
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).