* [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo
@ 2014-11-01 17:14 John Keeping
2014-11-01 17:58 ` Harald van Dijk
0 siblings, 1 reply; 3+ messages in thread
From: John Keeping @ 2014-11-01 17:14 UTC (permalink / raw)
To: dash; +Cc: John Keeping
printf(1) supports octal escape sequences in its format argument which
are specified as (from POSIX):
"\ddd", where ddd is a one, two, or three-digit octal number
But the argument to the "%b" format specifier allows:
"\0ddd", where ddd is a zero, one, two, or three-digit octal
number
which is similar to the wording for echo(1) (for XSI-conformant
systems):
\0num Write an 8-bit value that is the zero, one, two, or
three-digit octal number num.
Because conv_escape() handles the first case, applying the second
behaviour in conv_escape_str() must also catch the characters '1'-'7' so
that they are not converted as octal numbers.
Signed-off-by: John Keeping <john@keeping.me.uk>
---
This was reported several years ago [1] but the patch proposed in that
case seems to have been a bit aggressive and breaks the handling of
"%b".
[1] http://thread.gmane.org/gmane.comp.shells.dash/664
ChangeLog | 4 ++++
src/bltin/printf.c | 5 +++++
2 files changed, 9 insertions(+)
diff --git a/ChangeLog b/ChangeLog
index 2155764..3212995 100644
--- a/ChangeLog
+++ b/ChangeLog
@@ -1,3 +1,7 @@
+2014-11-01 John Keeping <john@keeping.me.uk>
+
+ * Require leading '0' on octal escapes in echo.
+
2014-10-13 Eric Blake <eblake@redhat.com>
* cd: support drive letters on Cygwin.
diff --git a/src/bltin/printf.c b/src/bltin/printf.c
index 5f9e81c..b17e6dc 100644
--- a/src/bltin/printf.c
+++ b/src/bltin/printf.c
@@ -260,6 +260,11 @@ conv_escape_str(char *str)
ch += k;
} while (--i);
continue;
+ } else if ('1' <= ch && ch < '8') {
+ /* Stop conv_escape handling these as octal. */
+ ch = '\\';
+ str--;
+ continue;
}
/* Finally test for sequences valid in the format string */
--
2.1.0.374.g390713e
^ permalink raw reply related [flat|nested] 3+ messages in thread
* Re: [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo
2014-11-01 17:14 [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo John Keeping
@ 2014-11-01 17:58 ` Harald van Dijk
2014-11-01 18:39 ` John Keeping
0 siblings, 1 reply; 3+ messages in thread
From: Harald van Dijk @ 2014-11-01 17:58 UTC (permalink / raw)
To: John Keeping, dash
On 11/1/2014 6:14 PM, John Keeping wrote:
> printf(1) supports octal escape sequences in its format argument which
> are specified as (from POSIX):
>
> "\ddd", where ddd is a one, two, or three-digit octal number
>
> But the argument to the "%b" format specifier allows:
>
> "\0ddd", where ddd is a zero, one, two, or three-digit octal
> number
>
> which is similar to the wording for echo(1) (for XSI-conformant
> systems):
>
> \0num Write an 8-bit value that is the zero, one, two, or
> three-digit octal number num.
>
> Because conv_escape() handles the first case, applying the second
> behaviour in conv_escape_str() must also catch the characters '1'-'7' so
> that they are not converted as octal numbers.
Your patch seems to have addressed the clear bugs of the patch in that
other thread. Let me attempt to summarise the status:
- POSIX does not specify the behaviour of \1 in echo and in printf %b.
POSIX does not define the behaviour of escape sequences other than the
ones it explicitly specifies. It does not require \1 to be handled as
\\1. It allows it, but it allows the current dash behaviour too.
To quote from the echo specification: "if any of the operands contain a
backslash ( '\' ) character, the results are implementation-defined",
and the bit about XSI doesn't include an exception for \1.
To quote from the printf %b specification: "The interpretation of a
backslash followed by any other sequence of characters is unspecified."
- bash treats \1 as \\1 in echo, but as \01 in printf %b.
- dash treats \1 as \01 in both echo and in printf %b.
- Your patch makes dash treat \1 as \01 in both echo and printf %b.
- The aim of the patch in the other thread was to make dash be more like
bash.
If that is your aim too, if you want dash to behave like bash, in order
to achieve that the code must no longer be shared between echo and
printf %b. Here is a simple test you can run, where dash is without your
patch, and ./src/dash is with your patch:
$ bash -c 'printf "%b" "\1"' | cat -v
^A
$ dash -c 'printf "%b" "\1"' | cat -v
^A
$ ./src/dash -c 'printf "%b" "\1"' | cat -v
\1
If that isn't your aim, if your aim is only to make dash meet POSIX
requirements, then don't worry, it already does so.
Cheers,
Harald van Dijk
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo
2014-11-01 17:58 ` Harald van Dijk
@ 2014-11-01 18:39 ` John Keeping
0 siblings, 0 replies; 3+ messages in thread
From: John Keeping @ 2014-11-01 18:39 UTC (permalink / raw)
To: Harald van Dijk; +Cc: dash
On Sat, Nov 01, 2014 at 06:58:47PM +0100, Harald van Dijk wrote:
> On 11/1/2014 6:14 PM, John Keeping wrote:
> > printf(1) supports octal escape sequences in its format argument which
> > are specified as (from POSIX):
> >
> > "\ddd", where ddd is a one, two, or three-digit octal number
> >
> > But the argument to the "%b" format specifier allows:
> >
> > "\0ddd", where ddd is a zero, one, two, or three-digit octal
> > number
> >
> > which is similar to the wording for echo(1) (for XSI-conformant
> > systems):
> >
> > \0num Write an 8-bit value that is the zero, one, two, or
> > three-digit octal number num.
> >
> > Because conv_escape() handles the first case, applying the second
> > behaviour in conv_escape_str() must also catch the characters '1'-'7' so
> > that they are not converted as octal numbers.
>
> Your patch seems to have addressed the clear bugs of the patch in that
> other thread. Let me attempt to summarise the status:
>
> - POSIX does not specify the behaviour of \1 in echo and in printf %b.
>
> POSIX does not define the behaviour of escape sequences other than the
> ones it explicitly specifies. It does not require \1 to be handled as
> \\1. It allows it, but it allows the current dash behaviour too.
>
> To quote from the echo specification: "if any of the operands contain a
> backslash ( '\' ) character, the results are implementation-defined",
> and the bit about XSI doesn't include an exception for \1.
>
> To quote from the printf %b specification: "The interpretation of a
> backslash followed by any other sequence of characters is unspecified."
>
> - bash treats \1 as \\1 in echo, but as \01 in printf %b.
>
> - dash treats \1 as \01 in both echo and in printf %b.
>
> - Your patch makes dash treat \1 as \01 in both echo and printf %b.
>
> - The aim of the patch in the other thread was to make dash be more like
> bash.
>
> If that is your aim too, if you want dash to behave like bash, in order
> to achieve that the code must no longer be shared between echo and
> printf %b. Here is a simple test you can run, where dash is without your
> patch, and ./src/dash is with your patch:
>
> $ bash -c 'printf "%b" "\1"' | cat -v
> ^A
> $ dash -c 'printf "%b" "\1"' | cat -v
> ^A
> $ ./src/dash -c 'printf "%b" "\1"' | cat -v
> \1
>
> If that isn't your aim, if your aim is only to make dash meet POSIX
> requirements, then don't worry, it already does so.
My primary aim is to get the previous patch removed from Gentoo (since
it breaks `printf '%b' '\0204'`) and I thought I might have more success
if upstream included a patch that fixed the original use case [1] ;-)
OTOH, it appears that the original issue was also fixed in
autoconf-archive (by you, in fact!) [2] so I hope Gentoo will just drop
the broken patch.
In fact, it appears that bash does treat \1 as \01 in echo if you pass
"-e" (as does coreutils), so the main difference between bash and dash
is whether or not backslash escapes are handled by default. POSIX
indicates that no options shall be supported by echo(1) and XSI says
that backslash escapes should be interpreted, so I agree with dash's
behaviour here.
I agree that dash is conformant in all cases, so I withdraw the patch.
[1] https://bugs.gentoo.org/show_bug.cgi?id=337329
[2] http://git.savannah.gnu.org/cgit/autoconf-archive.git/commit/m4/ax_prefix_config_h.m4?id=c9d670b1d7f3d30780996f8ec1bdad2dd98b8592
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2014-11-01 18:39 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2014-11-01 17:14 [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo John Keeping
2014-11-01 17:58 ` Harald van Dijk
2014-11-01 18:39 ` John Keeping
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox