qemu-devel.nongnu.org archive mirror
 help / color / mirror / Atom feed
From: Luiz Capitulino <lcapitulino@redhat.com>
To: Anthony Liguori <anthony@codemonkey.ws>
Cc: aliguori@us.ibm.com, qemu-devel@nongnu.org
Subject: Re: [Qemu-devel] [PATCH 0/6]: QMP: Fix issues in parser/lexer
Date: Thu, 20 May 2010 10:35:52 -0300	[thread overview]
Message-ID: <20100520103552.1b266a5e@redhat.com> (raw)
In-Reply-To: <4BF45B6C.8000908@codemonkey.ws>

On Wed, 19 May 2010 16:43:08 -0500
Anthony Liguori <anthony@codemonkey.ws> wrote:

> On 05/19/2010 04:15 PM, Luiz Capitulino wrote:
> >   Hi Anthony,
> >
> >   While investigating a QMP bug reported by a user, I've found a few issues
> > in our parser/lexer.
> >
> >   The patches in this series fix the problems I was able to solve, but we
> > still have the following issues:
> >
> > 1. Our 'private extension' is open to the public
> >
> >     Eg. The following input issued by a client is valid:
> >
> >     { 'execute': 'query-pci' }
> >
> >     I don't think it's a good idea to have clients relying on this kind of
> >     JSON extension.
> >
> >     To fix this we could add a 'extension' flag to JSONLexer and set it to
> >     nonzero in internal functions (eg. qobject_from_jsonf()), of course that
> >     the lexer code should handle this too.
> >    
> 
> The JSON specification explicitly says:
> 
> "A JSON parser transforms a JSON text into another representation. A 
> JSON parser MUST accept all texts that conform to the JSON grammar.  A 
> JSON parser MAY accept non-JSON forms or extensions."
> 
> IOW, we're under no obligation to reject extensions and I can't think of 
> a reason why we should.

 I know we're legal, but what's the point to offer this extension to clients?

 The main motivation behind this was to write JSON in C strings w/o the
need of repetitive escapes. This is internal to QEMU, but it's also
available to clients for no reason.

 And you know, after 0.13 we won't be able to remove it.

> > 2. QMP doesn't check the return of json_message_parser_feed()
> >
> >     Which means we don't handle JSON syntax errors. While the fix might seem
> >     trivial (ie. just return an error!), I'm not sure what's the best way
> >     to handle this, because the streamer seems to return multiple errors for
> >     the same input string.
> >
> >     For example, this input:
> >
> >     { "execute": yy_uu }
> >
> >     Seems to return an error for each bad character (yy_uu), shouldn't it
> >     return only once and stop processing the whole string?
> >    
> 
> It probably should kill the connection.

 Ok.

> > 3. The lexer enter in ERROR state when processing is done
> >
> >     Not sure whether this is an issue, but I found it while reviewing the code
> >     and maybe this is related with item 2 above.
> >
> >     When json_lexer_feed_char() is finished scanning a string, (ie. ch='\0')
> >     the JSON_SKIP clause will set lexer->state to ERROR as there's no entry
> >     for '\0' in the IN_START array.
> >
> >     Shouldn't we have a LEXER_DONE or something like it instead?
> >    
> 
> No, you must have malformed input if an error occurs.

 Yes, json_message_parser_feed() returns OK.

> [IN_WHITESPACE] -> TERMINAL(JSON_SKIP)
> 
> JSON_SKIP is a terminal so once you're in that state, you go back to 
> IN_START.

 Yes, but what I'm trying to say is that when ch='\0' and you do:

     lexer->state = json_lexer[IN_START][(uint8_t)ch];

 Then 'lexer->state' becomes 0, which is what the code recognizes as ERROR.

 Again, not sure if this is an issue. Just caught my attention.

> > 4. Lexer expects a 'terminal' char to process a token
> >
> >     Which means clients must send a sort of end of line char, so that we
> >     process their input.
> >
> >     Maybe I'm missing something here, but I thought that the whole point of
> >     writing our own parser was to avoid this.
> >    
> 
> If the lexer gets:
> 
> "abc"
> 
> It has no way of knowing if that's a token or if we're going to get:
> 
> "abcd"
> 
> As a token.  You can fix this in two ways.  You can either flush() the 
> lexer to significant end of input or you can wait until there's some 
> other valid symbol to cause the previous symbol to be emitted.
> 
> IOW, a client either needs to: 1) send the request and follow it with a 
> newline or some form of whitespace or 2) close the connection to flush 
> the request

 Ok.

  reply	other threads:[~2010-05-20 13:37 UTC|newest]

Thread overview: 34+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-05-19 21:15 [Qemu-devel] [PATCH 0/6]: QMP: Fix issues in parser/lexer Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 1/6] json-lexer: Initialize 'x' and 'y' Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 2/6] json-lexer: Handle missing escapes Luiz Capitulino
2010-05-19 21:44   ` Anthony Liguori
2010-05-20 13:44     ` Luiz Capitulino
2010-05-20 15:16       ` [Qemu-devel] " Paolo Bonzini
2010-05-20 15:25         ` Luiz Capitulino
2010-05-20 15:26           ` Paolo Bonzini
2010-05-20 15:35             ` Luiz Capitulino
2010-05-20 15:54               ` Anthony Liguori
2010-05-20 16:27                 ` Luiz Capitulino
2010-05-20 15:50         ` Anthony Liguori
2010-05-20 16:27           ` Luiz Capitulino
2010-05-20 16:55             ` Anthony Liguori
2010-05-20 18:47               ` Luiz Capitulino
2010-05-20 18:52                 ` Anthony Liguori
2010-05-20 19:22                   ` Luiz Capitulino
2010-05-24 19:29                     ` Anthony Liguori
2010-05-24 19:38                       ` Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 3/6] qjson: Handle "\f" Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 4/6] check-qjson: Add more escape tests Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 5/6] json-lexer: Drop 'buf' Luiz Capitulino
2010-05-19 21:15 ` [Qemu-devel] [PATCH 6/6] json-streamer: Don't use qdict_put_obj() Luiz Capitulino
2010-05-19 21:43 ` [Qemu-devel] [PATCH 0/6]: QMP: Fix issues in parser/lexer Anthony Liguori
2010-05-20 13:35   ` Luiz Capitulino [this message]
2010-05-21 18:06     ` Luiz Capitulino
2010-05-20 15:18   ` [Qemu-devel] " Paolo Bonzini
2010-05-20 15:26     ` Luiz Capitulino
2010-05-20 15:52     ` Anthony Liguori
2010-05-20 16:29       ` Luiz Capitulino
2010-05-21  9:08       ` [Qemu-devel] [PATCH] do not require lookahead in json-lexer.c if not necessary Paolo Bonzini
2010-05-21 10:10         ` [Qemu-devel] [PATCH] do not require lookahead for escapes too Paolo Bonzini
2010-05-23  7:50           ` [Qemu-devel] " Paolo Bonzini
2010-05-20 19:49   ` [Qemu-devel] [PATCH 0/6]: QMP: Fix issues in parser/lexer Avi Kivity

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100520103552.1b266a5e@redhat.com \
    --to=lcapitulino@redhat.com \
    --cc=aliguori@us.ibm.com \
    --cc=anthony@codemonkey.ws \
    --cc=qemu-devel@nongnu.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).