annotate etc/unicode/README @ 5913:1b2fdcc3cc5c
Remove an old, buggy attempt to preserve East Asian chars, encode_unicode_char
src/ChangeLog addition:
2015-05-11 Aidan Kehoe <kehoea@parhasard.net>
* unicode.c (encode_unicode_char):
If we have no information on the Unicode mapping of a character,
write U+FFFD unconditionally. Remove an old, incomplete,
inappropriate attempt to preserve certain East Asian characters.
tests/ChangeLog addition:
2015-05-11 Aidan Kehoe <kehoea@parhasard.net>
* automated/mule-tests.el:
Assert that a character which we know has no Unicode mapping is
represented as REPLACEMENT CHARACTER in UTF-8, rather than as a
character corresponding to a very large codepoint.
author |
Aidan Kehoe <kehoea@parhasard.net> |
date |
Mon, 11 May 2015 18:04:58 +0100 |
parents |
a29c4eef8f00 |
children |
|
rev |
line source |
771
|
1 This directory contains Unicode translation tables for most of the
|
|
2 charsets in XEmacs.
|
|
3
|
|
4 The tables in unicode-consortium/ come from:
|
|
5
|
|
6 http://www.unicode.org/Public/MAPPINGS/
|
|
7
|
|
8 The tables in ibm/ come from:
|
|
9
|
|
10 http://oss.software.ibm.com/icu/charset/
|
|
11
|
779
|
12 The tables in unicode-consortium/ should be used as source data; the ones
|
|
13 in ibm/ can be used to supplement or check the accuracy of the others.
|