public inbox for linux-man@vger.kernel.org
 help / color / mirror / Atom feed
From: bugzilla-daemon@kernel.org
To: linux-man@vger.kernel.org
Subject: [Bug 217059] New: Please document behaviour of iconv(3) when input is untranslatable
Date: Sun, 19 Feb 2023 10:31:37 +0000	[thread overview]
Message-ID: <bug-217059-11311@https.bugzilla.kernel.org/> (raw)

https://bugzilla.kernel.org/show_bug.cgi?id=217059

            Bug ID: 217059
           Summary: Please document behaviour of iconv(3) when input is
                    untranslatable
           Product: Documentation
           Version: unspecified
          Hardware: All
                OS: Linux
            Status: NEW
          Severity: normal
          Priority: P1
         Component: man-pages
          Assignee: documentation_man-pages@kernel-bugs.osdl.org
          Reporter: rrt@sc3d.org
        Regression: No

See https://sourceware.org/bugzilla/show_bug.cgi?id=29913

The issue is that the man page does not fully reflect the behaviour of glibc's
iconv. The man page says:

The conversion can stop for four reasons:

       1. An invalid multibyte sequence is encountered in the input.  In this
case, it sets errno to EILSEQ and
          returns (size_t) -1.  *inbuf is left pointing to the beginning of the
invalid multibyte sequence.

The phrase "An invalid multibyte sequence is encountered in the input" is
confusing, because it suggests to me (and other readers, see the bug above!)
that it refers only to the validity of the input per se (e.g. a non-UTF-8
sequence in input purporting to be UTF-8).

However, according to the original author of the man page, Bruno Haible (see
the bug above), it also refers to input that cannot be translated to the
desired output encoding; and indeed, glibc's iconv returns EILSEQ when the
input cannot be translated, even though it is valid.

Please clarify the man page to reflect this behaviour. On the one hand, it is
confusing and surprising when compared to the POSIX standard (again, for this
reader and others); on the other hand, it is useful (because it enables
untranslatable input to be detected).

-- 
You may reply to this email to add a comment.

You are receiving this mail because:
You are watching the assignee of the bug.

             reply	other threads:[~2023-02-19 10:31 UTC|newest]

Thread overview: 4+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2023-02-19 10:31 bugzilla-daemon [this message]
2023-05-19 12:09 ` [Bug 217059] Please document behaviour of iconv(3) when input is untranslatable bugzilla-daemon
2023-05-19 12:09 ` bugzilla-daemon
2023-05-20 11:18 ` bugzilla-daemon

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=bug-217059-11311@https.bugzilla.kernel.org/ \
    --to=bugzilla-daemon@kernel.org \
    --cc=linux-man@vger.kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox