From: Grigory Batalov <grisxa@mail.ru>
To: linux-msdos@vger.kernel.org
Subject: cyrillic next draft
Date: Thu, 13 Mar 2003 08:50:39 +0300 [thread overview]
Message-ID: <20030313085039.30fb284d.grisxa@mail.ru> (raw)
[-- Attachment #1: Type: text/plain, Size: 2542 bytes --]
Hi, this is my next thoughts about different cyrillic charsets
in dosemu.
CP1125
------
Andy Shevchenko <andy@work.smile.org.ua> kindly reported that
there is nice DOS encoding for Ukrainian usage called CP1125.
It contains all Ukrainian symbols and is approved of by Ukraine
government. Great job for supporting it is done in ASPLinux's
dosemu RPM package.
I didn't found better visual description of CP1125 so used this
page for reference:
http://www.ic-chernobyl.kiev.ua/~porokh/cyr/index.htm
It seems to be quite correct.
CP1125 differs from CP866 in most upper characters with codes
0xF2-0xF9:
0x0490, /* 0xF2 - CYRILLIC CAPITAL LETTER GHE WITH UPTURN */
0x0491, /* 0xF3 - CYRILLIC SMALL LETTER GHE WITH UPTURN */
0x0404, /* 0xF4 - CYRILLIC CAPITAL LETTER UKRAINIAN IE */
0x0454, /* 0xF5 - CYRILLIC SMALL LETTER UKRAINIAN IE */
0x0406, /* 0xF6 - CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I */
0x0456, /* 0xF7 - CYRILLIC SMALL LETTER BYELORUSSIAN-UKRAINIAN I */
0x0407, /* 0xF8 - CYRILLIC CAPITAL LETTER YI */
0x0457, /* 0xF9 - CYRILLIC SMALL LETTER YI */
So I made cp1125.c by changing unicode values for these
characters in cp866.c.
KOI8-U
------
KOI8-U is described in RFC2319: http://rfc.net/rfc2319.html
According to it, Perl Unicode::Map8 module gives wrong
value for character 0xB4 - 0x0403 when it must be
0x0404 - CYRILLIC CAPITAL LETTER UKRAINIAN IE.
CP1251, CP866
-------------
cp866.c and cp1251.c are also generated by Unicode::Map8 and
I hope they are correct =). You can find listings at:
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/PC/CP866.TXT
http://www.unicode.org/Public/MAPPINGS/VENDORS/MICSFT/WINDOWS/CP1251.TXT
Again I vote for returning back characters 0xF2-0xF7, 0xFC and
0xFD in cp866 because we have to comply some common rules
(Unicode in this case).
KOI8-RU
-------
KOI8-RU is described in RFC draft:
http://cad.ntu-kpi.kiev.ua/multiling/koi8-ru/rfc-draft-koi8-ru.txt
Table is derived from koi8-r.c by replacing changed codes.
Unicode::CharName Perl module was used for Unicode names.
Character 0xB4 points to 0x0403 while must point to 0x0404.
External/internal
-----------------
Encodings above combine in following charset pairs:
$_external_char_set $internal_char_set
Russian:
koi8-r cp866
cp1251 cp866
cp866 cp866
Ukrainian:
cp1251 cp1125
cp1125 cp1125
koi8-u cp1125
koi8-ru cp1125
Files
-----
cp866.tar.bz2 - changes in cp866 table and fonts,
cyr_ua.tar.bz2 - other tables and cp1125 Xfonts derived from
cp866 Xfonts.
--
Grigory Batalov.
[-- Attachment #2: cp866.tar.bz2 --]
[-- Type: application/x-bzip2, Size: 7176 bytes --]
[-- Attachment #3: cyr_ua.tar.bz2 --]
[-- Type: application/x-bzip2, Size: 9175 bytes --]
reply other threads:[~2003-03-13 5:50 UTC|newest]
Thread overview: [no followups] expand[flat|nested] mbox.gz Atom feed
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=20030313085039.30fb284d.grisxa@mail.ru \
--to=grisxa@mail.ru \
--cc=linux-msdos@vger.kernel.org \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox