From: Paolo Bonzini <pbonzini@redhat.com>
To: Markus Armbruster <armbru@redhat.com>
Cc: qemu-devel@nongnu.org, lcapitulino@redhat.com
Subject: Re: [Qemu-devel] [PATCH v2 4/4] json-streamer: Limit number of tokens in addition to total size
Date: Fri, 20 Nov 2015 09:50:43 +0100 [thread overview]
Message-ID: <564EDEE3.80000@redhat.com> (raw)
In-Reply-To: <87egflqjty.fsf@blackfin.pond.sub.org>
On 20/11/2015 07:13, Markus Armbruster wrote:
>>> @@ -64,6 +65,7 @@ static void json_message_process_token(JSONLexer *lexer, QString *token, JSONTok
>>> parser->bracket_count == 0)) {
>>> goto out_emit;
>>> } else if (parser->token_size > MAX_TOKEN_SIZE ||
>>> + qlist_size(parser->tokens) > MAX_TOKEN_COUNT ||
>>
>> This is O(n^2). I'd rather skip this patch, fix the memory hog and
>> possibly decrease MAX_TOKEN_SIZE a bit.
>
> I can't see the square right now.
It's hidden in qlist_size:
static void qlist_size_iter(QObject *obj, void *opaque)
{
size_t *count = opaque;
(*count)++;
}
size_t qlist_size(const QList *qlist)
{
size_t count = 0;
qlist_iter(qlist, qlist_size_iter, &count);
return count;
}
You process the whole list on every token, which makes it O(n^2) where n
is the token count.
Paolo
Anyway, O() is unchanged by my patch,
> only n is more limited. See also commit 65c0f1e, which claims to have
> fixed the quadradic behavior.
>
> Regardless, the memory consumption is still ridiculous, and we certainly
> need to fix that. Whether we can have one for 2.5 I don't know.
>
> Even with a fix, this patch has some value. As explained in the commit
> message, "total token size is a rather imprecise predictor of heap
> usage: many small tokens use more space than few large tokens with the
> same input size, because there's a constant per-token overhead." As
> long as we limit input to limit memory use, we better use a limit that's
> connected to actual memory use with reasonable accuracy This patch
> improves accuracy.
>
>>> parser->bracket_count + parser->brace_count > MAX_NESTING) {
>>> /* Security consideration, we limit total memory allocated per object
>>> * and the maximum recursion depth that a message can force.
>
>
next prev parent reply other threads:[~2015-11-20 8:51 UTC|newest]
Thread overview: 14+ messages / expand[flat|nested] mbox.gz Atom feed top
2015-11-19 15:29 [Qemu-devel] [PATCH v2 0/4] json-streamer: Fix up code to limit nesting and size Markus Armbruster
2015-11-19 15:29 ` [Qemu-devel] [PATCH v2 1/4] json-streamer: Apply nesting limit more sanely Markus Armbruster
2015-11-19 15:29 ` [Qemu-devel] [PATCH v2 2/4] json-streamer: Don't crash when input exceeds nesting limit Markus Armbruster
2015-11-19 15:29 ` [Qemu-devel] [PATCH v2 3/4] check-qjson: Add test for JSON nesting depth limit Markus Armbruster
2015-11-19 15:29 ` [Qemu-devel] [PATCH v2 4/4] json-streamer: Limit number of tokens in addition to total size Markus Armbruster
2015-11-19 22:01 ` Paolo Bonzini
2015-11-20 6:13 ` Markus Armbruster
2015-11-20 8:50 ` Paolo Bonzini [this message]
2015-11-20 17:32 ` Eric Blake
2015-11-23 14:27 ` Paolo Bonzini
2015-11-23 16:03 ` Eric Blake
2015-11-23 17:09 ` Markus Armbruster
2015-11-19 16:15 ` [Qemu-devel] [PATCH v2 0/4] json-streamer: Fix up code to limit nesting and size Eric Blake
2015-11-19 16:59 ` Markus Armbruster
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=564EDEE3.80000@redhat.com \
--to=pbonzini@redhat.com \
--cc=armbru@redhat.com \
--cc=lcapitulino@redhat.com \
--cc=qemu-devel@nongnu.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.