From mboxrd@z Thu Jan 1 00:00:00 1970 From: Harald van Dijk Subject: Re: [PATCH] [BUILTIN] Require leading '0' on octal escapes in echo Date: Sat, 01 Nov 2014 18:58:47 +0100 Message-ID: <54551F57.6090708@gigawatt.nl> References: <53785239c041de620223d4a7beba523e1627b65a.1414861984.git.john@keeping.me.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Return-path: Received: from hosting12.csv-networks.nl ([84.244.151.217]:50744 "EHLO hosting12.csv-networks.nl" rhost-flags-OK-FAIL-OK-FAIL) by vger.kernel.org with ESMTP id S1758762AbaKAR6y (ORCPT ); Sat, 1 Nov 2014 13:58:54 -0400 In-Reply-To: <53785239c041de620223d4a7beba523e1627b65a.1414861984.git.john@keeping.me.uk> Sender: dash-owner@vger.kernel.org List-Id: dash@vger.kernel.org To: John Keeping , dash@vger.kernel.org On 11/1/2014 6:14 PM, John Keeping wrote: > printf(1) supports octal escape sequences in its format argument which > are specified as (from POSIX): > > "\ddd", where ddd is a one, two, or three-digit octal number > > But the argument to the "%b" format specifier allows: > > "\0ddd", where ddd is a zero, one, two, or three-digit octal > number > > which is similar to the wording for echo(1) (for XSI-conformant > systems): > > \0num Write an 8-bit value that is the zero, one, two, or > three-digit octal number num. > > Because conv_escape() handles the first case, applying the second > behaviour in conv_escape_str() must also catch the characters '1'-'7' so > that they are not converted as octal numbers. Your patch seems to have addressed the clear bugs of the patch in that other thread. Let me attempt to summarise the status: - POSIX does not specify the behaviour of \1 in echo and in printf %b. POSIX does not define the behaviour of escape sequences other than the ones it explicitly specifies. It does not require \1 to be handled as \\1. It allows it, but it allows the current dash behaviour too. To quote from the echo specification: "if any of the operands contain a backslash ( '\' ) character, the results are implementation-defined", and the bit about XSI doesn't include an exception for \1. To quote from the printf %b specification: "The interpretation of a backslash followed by any other sequence of characters is unspecified." - bash treats \1 as \\1 in echo, but as \01 in printf %b. - dash treats \1 as \01 in both echo and in printf %b. - Your patch makes dash treat \1 as \01 in both echo and printf %b. - The aim of the patch in the other thread was to make dash be more like bash. If that is your aim too, if you want dash to behave like bash, in order to achieve that the code must no longer be shared between echo and printf %b. Here is a simple test you can run, where dash is without your patch, and ./src/dash is with your patch: $ bash -c 'printf "%b" "\1"' | cat -v ^A $ dash -c 'printf "%b" "\1"' | cat -v ^A $ ./src/dash -c 'printf "%b" "\1"' | cat -v \1 If that isn't your aim, if your aim is only to make dash meet POSIX requirements, then don't worry, it already does so. Cheers, Harald van Dijk