* [PATCH v4] Do not decode url protocol.
@ 2010-06-22 20:16 Pascal Obry
2010-06-22 21:39 ` Matthieu Moy
` (2 more replies)
0 siblings, 3 replies; 5+ messages in thread
From: Pascal Obry @ 2010-06-22 20:16 UTC (permalink / raw)
To: git list, Matthieu Moy
When using the protocol git+ssh:// for example we do not want to
decode the '+' as a space. The url decoding must take place only
for the server name and parameters.
This fixes a regression introduced in 9d2e942.
---
url.c | 19 +++++++++++++++----
1 files changed, 15 insertions(+), 4 deletions(-)
Ok, so this is the fourth version of this patch. Thanks again Matthieu
for the review. I think this time I got the place for the message right :)
Anyway, I think this time we properly skip the protocol decoding when
needed.
diff --git a/url.c b/url.c
index cd32b92..0eb7fb3 100644
--- a/url.c
+++ b/url.c
@@ -67,12 +67,23 @@ static int url_decode_char(const char *q)
return val;
}
-static char *url_decode_internal(const char **query, const char *stop_at)
+static char *url_decode_internal(const char **query, const char *stop_at,
+ int with_protocol)
{
const char *q = *query;
+ const char *first_slash;
struct strbuf out;
strbuf_init(&out, 16);
+
+ /* Skip protocol if present. */
+ if (with_protocol) {
+ first_slash = strchr(*query, '/');
+
+ while (q < first_slash)
+ strbuf_addch(&out, *q++);
+ }
+
do {
unsigned char c = *q;
@@ -104,15 +115,15 @@ static char *url_decode_internal(const char
**query, const char *stop_at)
char *url_decode(const char *url)
{
- return url_decode_internal(&url, NULL);
+ return url_decode_internal(&url, NULL, 1);
}
char *url_decode_parameter_name(const char **query)
{
- return url_decode_internal(query, "&=");
+ return url_decode_internal(query, "&=", 0);
}
char *url_decode_parameter_value(const char **query)
{
- return url_decode_internal(query, "&");
+ return url_decode_internal(query, "&", 0);
}
--
1.7.1.426.gb436.dirty
--
--|------------------------------------------------------
--| Pascal Obry Team-Ada Member
--| 45, rue Gabriel Peri - 78114 Magny Les Hameaux FRANCE
--|------------------------------------------------------
--| http://www.obry.net - http://v2p.fr.eu.org
--| "The best way to travel is by means of imagination"
--|
--| gpg --keyserver keys.gnupg.net --recv-key F949BD3B
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v4] Do not decode url protocol.
2010-06-22 20:16 [PATCH v4] Do not decode url protocol Pascal Obry
@ 2010-06-22 21:39 ` Matthieu Moy
2010-06-22 22:46 ` Jeff King
2010-06-23 17:27 ` Junio C Hamano
2 siblings, 0 replies; 5+ messages in thread
From: Matthieu Moy @ 2010-06-22 21:39 UTC (permalink / raw)
To: pascal; +Cc: git list
Pascal Obry <pascal.obry@gmail.com> writes:
> When using the protocol git+ssh:// for example we do not want to
> decode the '+' as a space. The url decoding must take place only
> for the server name and parameters.
>
> This fixes a regression introduced in 9d2e942.
> ---
> url.c | 19 +++++++++++++++----
> 1 files changed, 15 insertions(+), 4 deletions(-)
>
> Ok, so this is the fourth version of this patch. Thanks again Matthieu
> for the review. I think this time I got the place for the message
> right :)
Hmm, what's so unclear in "between the --- (tripple) and the
diffstat." ;-) ? (especially the "between" part)
> + /* Skip protocol if present. */
> + if (with_protocol) {
> + first_slash = strchr(*query, '/');
> +
> + while (q < first_slash)
> + strbuf_addch(&out, *q++);
> + }
Nothing personal, but you messed up indentation. Git indents with tabs
(8 chars width), not 2 spaces.
--
Matthieu Moy
http://www-verimag.imag.fr/~moy/
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v4] Do not decode url protocol.
2010-06-22 20:16 [PATCH v4] Do not decode url protocol Pascal Obry
2010-06-22 21:39 ` Matthieu Moy
@ 2010-06-22 22:46 ` Jeff King
2010-06-23 17:27 ` Junio C Hamano
2 siblings, 0 replies; 5+ messages in thread
From: Jeff King @ 2010-06-22 22:46 UTC (permalink / raw)
To: pascal; +Cc: git list, Matthieu Moy
On Tue, Jun 22, 2010 at 10:16:57PM +0200, Pascal Obry wrote:
> When using the protocol git+ssh:// for example we do not want to
> decode the '+' as a space. The url decoding must take place only
> for the server name and parameters.
>
> This fixes a regression introduced in 9d2e942.
Sorry, this is completely my fault. I obviously didn't test with
git+ssh. I looked at adding a new test case, but I don't think there's
an easy way. The only way to trigger it is by using "git+ssh" or
"ssh+git"; any other protocol (real or fake) will be handled by a custom
protocol handler and won't execute this broken code at all.
The patch looks reasonable (and I agree with all of Matthieu's comments
thus far). With respect to the "what if there is no slash" issue
discussed earlier, I prefer to be a bit defensive about possible inputs
in library-ish functions like this. But:
> +
> + /* Skip protocol if present. */
> + if (with_protocol) {
> + first_slash = strchr(*query, '/');
> +
> + while (q < first_slash)
> + strbuf_addch(&out, *q++);
> + }
> +
I think this actually works OK, given that "q < first_slash" will be
false if first_slash is NULL, assuming NULL is all-bytes-zero or some
low number (which is not guaranteed by the standard, but is a pretty
practical assumption these days).
However, you could make things a little more efficient by handing the
whole thing to strbuf at once:
if (with_protocol) {
const char *first_slash = strchr(q, '/');
if (first_slash) {
strbuf_add(&out, q, first_slash - q);
q = first_slash;
}
}
which of course needs first_slash to be checked explicitly. Not that
the efficiency probably matters in practice.
-Peff
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: [PATCH v4] Do not decode url protocol.
2010-06-22 20:16 [PATCH v4] Do not decode url protocol Pascal Obry
2010-06-22 21:39 ` Matthieu Moy
2010-06-22 22:46 ` Jeff King
@ 2010-06-23 17:27 ` Junio C Hamano
2010-06-23 17:38 ` Jeff King
2 siblings, 1 reply; 5+ messages in thread
From: Junio C Hamano @ 2010-06-23 17:27 UTC (permalink / raw)
To: pascal; +Cc: git list, Jeff King, Matthieu Moy
Pascal Obry <pascal.obry@gmail.com> writes:
> When using the protocol git+ssh:// for example we do not want to
> decode the '+' as a space. The url decoding must take place only
> for the server name and parameters.
>
> This fixes a regression introduced in 9d2e942.
Sign-off?
As the patch was whitespace-broken, I'm proposing to rewrite it like the
following.
-- >8 --
url.c: "<scheme>://" part at the beginning should not be URL decoded
When using the protocol git+ssh:// for example we do not want to
decode the '+' as a space. The url decoding must take place only
for the server name and parameters.
This fixes a regression introduced in 9d2e942.
Initial-fix-by: Pascal Obry <pascal.obry@gmail.com>
Signed-off-by: Junio C Hamano <gitster@pobox.com>
---
url.c | 28 ++++++++++++++++++----------
1 files changed, 18 insertions(+), 10 deletions(-)
diff --git a/url.c b/url.c
index cd32b92..bf5bb9c 100644
--- a/url.c
+++ b/url.c
@@ -67,12 +67,10 @@ static int url_decode_char(const char *q)
return val;
}
-static char *url_decode_internal(const char **query, const char *stop_at)
+static char *url_decode_internal(const char **query, const char *stop_at, struct strbuf *out)
{
const char *q = *query;
- struct strbuf out;
- strbuf_init(&out, 16);
do {
unsigned char c = *q;
@@ -86,33 +84,43 @@ static char *url_decode_internal(const char **query, const char *stop_at)
if (c == '%') {
int val = url_decode_char(q + 1);
if (0 <= val) {
- strbuf_addch(&out, val);
+ strbuf_addch(out, val);
q += 3;
continue;
}
}
if (c == '+')
- strbuf_addch(&out, ' ');
+ strbuf_addch(out, ' ');
else
- strbuf_addch(&out, c);
+ strbuf_addch(out, c);
q++;
} while (1);
*query = q;
- return strbuf_detach(&out, NULL);
+ return strbuf_detach(out, NULL);
}
char *url_decode(const char *url)
{
- return url_decode_internal(&url, NULL);
+ struct strbuf out = STRBUF_INIT;
+ const char *slash = strchr(url, '/');
+
+ /* Skip protocol part if present */
+ if (slash && url < slash) {
+ strbuf_add(&out, url, slash - url);
+ url = slash;
+ }
+ return url_decode_internal(&url, NULL, &out);
}
char *url_decode_parameter_name(const char **query)
{
- return url_decode_internal(query, "&=");
+ struct strbuf out = STRBUF_INIT;
+ return url_decode_internal(query, "&=", &out);
}
char *url_decode_parameter_value(const char **query)
{
- return url_decode_internal(query, "&");
+ struct strbuf out = STRBUF_INIT;
+ return url_decode_internal(query, "&", &out);
}
^ permalink raw reply related [flat|nested] 5+ messages in thread
* Re: [PATCH v4] Do not decode url protocol.
2010-06-23 17:27 ` Junio C Hamano
@ 2010-06-23 17:38 ` Jeff King
0 siblings, 0 replies; 5+ messages in thread
From: Jeff King @ 2010-06-23 17:38 UTC (permalink / raw)
To: Junio C Hamano; +Cc: pascal, git list, Matthieu Moy
On Wed, Jun 23, 2010 at 10:27:39AM -0700, Junio C Hamano wrote:
> > When using the protocol git+ssh:// for example we do not want to
> > decode the '+' as a space. The url decoding must take place only
> > for the server name and parameters.
> >
> > This fixes a regression introduced in 9d2e942.
>
> Sign-off?
>
> As the patch was whitespace-broken, I'm proposing to rewrite it like the
> following.
>
> -- >8 --
> url.c: "<scheme>://" part at the beginning should not be URL decoded
I think this is cleaner than Pascal's patch.
Acked-by: Jeff King <peff@peff.net>
Thanks all for fixing my bug. :)
-Peff
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2010-06-23 17:39 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2010-06-22 20:16 [PATCH v4] Do not decode url protocol Pascal Obry
2010-06-22 21:39 ` Matthieu Moy
2010-06-22 22:46 ` Jeff King
2010-06-23 17:27 ` Junio C Hamano
2010-06-23 17:38 ` Jeff King
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).