From: Michael Buesch <mb@bu3sch.de>
To: Linus Torvalds <torvalds@linux-foundation.org>
Cc: "Uwe Kleine-König" <u.kleine-koenig@pengutronix.de>,
linux-kernel@vger.kernel.org,
"Peter Zijlstra" <a.p.zijlstra@chello.nl>,
"Andrew Morton" <akpm@linux-foundation.org>
Subject: Re: [PATCH] strcmp: fix overflow error
Date: Tue, 17 Nov 2009 19:16:40 +0100 [thread overview]
Message-ID: <200911171916.41727.mb@bu3sch.de> (raw)
In-Reply-To: <alpine.LFD.2.01.0911170922190.9384@localhost.localdomain>
On Tuesday 17 November 2009 18:41:58 Linus Torvalds wrote:
>
> On Tue, 17 Nov 2009, Uwe Kleine-König wrote:
> >
> > strcmp("\x01", "\xef") returns 18 but it should return something < 0.
> > The reason is that the variable holding the result of the subtraction is
> > too small and overflows.
>
> No. The reason is that whoever wrote that function is a moron and doesn't
> know the standard. And your fix is not correct _either_
>
> The comparison should be done as *unsigned char*. As specified by POSIX
>
> "The sign of a non-zero return value shall be determined by the sign of
> the difference between the values of the first pair of bytes (both
> interpreted as type unsigned char) that differ in the strings being
> compared."
>
> and both the original code and your change gets it wrong in different
> ways.
>
> > int strcmp(const char *cs, const char *ct)
> > {
> > - signed char __res;
> > + int __res;
> >
> > while (1) {
> > if ((__res = *cs - *ct++) != 0 || !*cs++)
>
> So this is fundamentally incorrect both with "signed char __res" _and_
> with "int __res", because '*cs' and '*ct' are both (possibly - it depends
> on the compiler and architecture) signed chars.
>
> So in the case you mention, strcmp() _should_ return a negative value,
> because "\x01" is smaller than "\xef", but you have:
>
> - *cs = 1, *ct = (char) 0xef = -17 _OR_ 239 depending on sign of 'char'
>
> and as a result:
>
> - signed char __res = 18 (incorrect, regardless: ct is larger)
>
> - int __res = 18 (incorrect) or -238 (correct) depending on sign of char
>
> so your patch doesn't actually help at all.
>
> What would help is something like the appended, but I have not tested it
> AT ALL. It may be total and utter crap too. Maybe it doesn't compile,
> maybe it buggers your pet hedgehog. I just don't know.
>
> Linus
>
> ---
> lib/string.c | 20 ++++++++++++++------
> 1 files changed, 14 insertions(+), 6 deletions(-)
>
> diff --git a/lib/string.c b/lib/string.c
> index b19b87a..e96421a 100644
> --- a/lib/string.c
> +++ b/lib/string.c
> @@ -246,13 +246,17 @@ EXPORT_SYMBOL(strlcat);
> #undef strcmp
> int strcmp(const char *cs, const char *ct)
> {
> - signed char __res;
> + unsigned char c1, c2;
>
> while (1) {
> - if ((__res = *cs - *ct++) != 0 || !*cs++)
> + c1 = *cs++;
> + c2 = *ct++;
> + if (c1 != c2)
> + return c1 < c2 ? -1 : 1;
> + if (!c1)
> break;
> }
> - return __res;
> + return 0;
> }
Well, that doesn't actually return the difference at all. Is that allowed?
What about this? (Manually hacked pseudo patch)
diff --git a/lib/string.c b/lib/string.c
index b19b87a..661ff06 100644
--- a/lib/string.c
+++ b/lib/string.c
@@ -246,7 +246,7 @@ EXPORT_SYMBOL(strlcat);
#undef strcmp
int strcmp(const char *cs, const char *ct)
{
- signed char __res;
+ int __res;
while (1) {
- if ((__res = *cs - *ct++) != 0 || !*cs++)
+ if ((__res = (unsigned char)(*cs) - (unsigned char)(*ct++)) != 0 || !*cs++)
@@ -266,7 +266,7 @@ EXPORT_SYMBOL(strcmp);
*/
--
Greetings, Michael.
next prev parent reply other threads:[~2009-11-17 18:16 UTC|newest]
Thread overview: 11+ messages / expand[flat|nested] mbox.gz Atom feed top
2009-11-17 16:51 [PATCH] strcmp: fix overflow error Uwe Kleine-König
2009-11-17 17:36 ` Andreas Schwab
2009-11-17 17:41 ` Linus Torvalds
2009-11-17 18:16 ` Michael Buesch [this message]
2009-11-17 18:40 ` Linus Torvalds
2009-11-17 20:34 ` Andreas Schwab
2009-11-17 18:55 ` Uwe Kleine-König
2009-11-17 19:02 ` Linus Torvalds
2009-11-17 19:12 ` Linus Torvalds
2009-11-17 19:19 ` Joe Perches
2009-11-18 21:31 ` [PATCH] strcmp: fix overflow and possibly signedness error Uwe Kleine-König
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=200911171916.41727.mb@bu3sch.de \
--to=mb@bu3sch.de \
--cc=a.p.zijlstra@chello.nl \
--cc=akpm@linux-foundation.org \
--cc=linux-kernel@vger.kernel.org \
--cc=torvalds@linux-foundation.org \
--cc=u.kleine-koenig@pengutronix.de \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.