view etc/unicode/README @ 5913:1b2fdcc3cc5c

Remove an old, buggy attempt to preserve East Asian chars, encode_unicode_char src/ChangeLog addition: 2015-05-11 Aidan Kehoe <kehoea@parhasard.net> * unicode.c (encode_unicode_char): If we have no information on the Unicode mapping of a character, write U+FFFD unconditionally. Remove an old, incomplete, inappropriate attempt to preserve certain East Asian characters. tests/ChangeLog addition: 2015-05-11 Aidan Kehoe <kehoea@parhasard.net> * automated/mule-tests.el: Assert that a character which we know has no Unicode mapping is represented as REPLACEMENT CHARACTER in UTF-8, rather than as a character corresponding to a very large codepoint.
author Aidan Kehoe <kehoea@parhasard.net>
date Mon, 11 May 2015 18:04:58 +0100
parents a29c4eef8f00
children
line wrap: on
line source

This directory contains Unicode translation tables for most of the
charsets in XEmacs.

The tables in unicode-consortium/ come from:

http://www.unicode.org/Public/MAPPINGS/

The tables in ibm/ come from:

http://oss.software.ibm.com/icu/charset/

The tables in unicode-consortium/ should be used as source data; the ones
in ibm/ can be used to supplement or check the accuracy of the others.