* [Bug 215737] New: uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279
@ 2022-03-24 12:01 bugzilla-daemon
2022-03-24 12:16 ` [Bug 215737] " bugzilla-daemon
2022-03-28 18:57 ` bugzilla-daemon
0 siblings, 2 replies; 3+ messages in thread
From: bugzilla-daemon @ 2022-03-24 12:01 UTC (permalink / raw)
To: linux-man
https://bugzilla.kernel.org/show_bug.cgi?id=215737
Bug ID: 215737
Summary: uri.7: DESCRIPTION: Character encoding: Reference to
obsolete IETF RFCs 2718 and 2279
Product: Documentation
Version: unspecified
Hardware: All
OS: Linux
Status: NEW
Severity: normal
Priority: P1
Component: man-pages
Assignee: documentation_man-pages@kernel-bugs.osdl.org
Reporter: alx.manpages@gmail.com
Regression: No
[uri(7)::DESCRIPTION::Character encoding] reads as:
```
Character encoding
URIs use a limited number of characters so that they can
be typed in and used in a variety of situations.
The following characters are reserved, that is, they may
appear in a URI but their use is limited to their re-
served purpose (conflicting data must be escaped before
forming the URI):
; / ? : @ & = + $ ,
Unreserved characters may be included in a URI. Unre-
served characters include uppercase and lowercase Latin
letters, decimal digits, and the following limited set of
punctuation marks and symbols:
- _ . ! ~ * ' ( )
All other characters must be escaped. An escaped octet
is encoded as a character triplet, consisting of the per-
cent character "%" followed by the two hexadecimal digits
representing the octet code (you can use uppercase or
lowercase letters for the hexadecimal digits). For exam-
ple, a blank space must be escaped as "%20", a tab char-
acter as "%09", and the "&" as "%26". Because the per-
cent "%" character always has the reserved purpose of be-
ing the escape indicator, it must be escaped as "%25".
It is common practice to escape space characters as the
plus symbol (+) in query text; this practice isn't uni-
formly defined in the relevant RFCs (which recommend %20
instead) but any tool accepting URIs with query text
should be prepared for them. A URI is always shown in
its "escaped" form.
Unreserved characters can be escaped without changing the
semantics of the URI, but this should not be done unless
the URI is being used in a context that does not allow
the unescaped character to appear. For example, "%7e" is
sometimes used instead of "~" in an HTTP URL path, but
the two are equivalent for an HTTP URL.
For URIs which must handle characters outside the US
ASCII character set, the HTML 4.01 specification (section
B.2) and IETF RFC 2718 (section 2.2.5) recommend the fol-
lowing approach:
1. translate the character sequences into UTF-8 (IETF
RFC 2279)--see utf-8(7)--and then
2. use the URI escaping mechanism, that is, use the %HH
encoding for unsafe octets.
```
It refers to obsolete RFCs[1][2]. We should update the info there.
[1]: <https://www.rfc-editor.org/rfc/rfc2718>
[2]: <https://www.rfc-editor.org/rfc/rfc2279>
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 3+ messages in thread
* [Bug 215737] uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279
2022-03-24 12:01 [Bug 215737] New: uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279 bugzilla-daemon
@ 2022-03-24 12:16 ` bugzilla-daemon
2022-03-28 18:57 ` bugzilla-daemon
1 sibling, 0 replies; 3+ messages in thread
From: bugzilla-daemon @ 2022-03-24 12:16 UTC (permalink / raw)
To: linux-man
https://bugzilla.kernel.org/show_bug.cgi?id=215737
--- Comment #1 from Alejandro Colomar (man-pages) (alx.manpages@gmail.com) ---
Should check the following RFCs that replace them:
2279:
<https://www.rfc-editor.org/rfc/rfc3629>
2718:
<https://www.rfc-editor.org/rfc/rfc7595>
<https://www.rfc-editor.org/rfc/rfc8615> (updates 7595)
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 3+ messages in thread
* [Bug 215737] uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279
2022-03-24 12:01 [Bug 215737] New: uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279 bugzilla-daemon
2022-03-24 12:16 ` [Bug 215737] " bugzilla-daemon
@ 2022-03-28 18:57 ` bugzilla-daemon
1 sibling, 0 replies; 3+ messages in thread
From: bugzilla-daemon @ 2022-03-28 18:57 UTC (permalink / raw)
To: linux-man
https://bugzilla.kernel.org/show_bug.cgi?id=215737
Alejandro Colomar (man-pages) (alx.manpages@gmail.com) changed:
What |Removed |Added
----------------------------------------------------------------------------
Status|NEW |RESOLVED
Resolution|--- |DOCUMENTED
--- Comment #2 from Alejandro Colomar (man-pages) (alx.manpages@gmail.com) ---
2279:
2718:
- It doesn't introduce changes relevant for the manual page.
Update the reference number with no other changes.
2718:
7595:
- It deprecates URL in favor of URI.
Replace the terms in the manual page,
which were (sometimes) ambiguously used.
- It stays the same regarding character encoding.
Update the reference number with no other changes.
8615:
- It doesn't introduce changes relevant for the manual page.
--
You may reply to this email to add a comment.
You are receiving this mail because:
You are watching the assignee of the bug.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2022-03-28 18:57 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2022-03-24 12:01 [Bug 215737] New: uri.7: DESCRIPTION: Character encoding: Reference to obsolete IETF RFCs 2718 and 2279 bugzilla-daemon
2022-03-24 12:16 ` [Bug 215737] " bugzilla-daemon
2022-03-28 18:57 ` bugzilla-daemon
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox