Mercurial > hg > xemacs-beta
diff etc/unicode/unicode-consortium/JIS0208.TXT @ 5118:e0db3c197671 ben-lisp-object
merge up to latest default branch, doesn't compile yet
author | Ben Wing <ben@xemacs.org> |
---|---|
date | Sat, 26 Dec 2009 21:18:49 -0600 |
parents | e51807f9eedd |
children |
line wrap: on
line diff
--- a/etc/unicode/unicode-consortium/JIS0208.TXT Sat Dec 26 00:20:27 2009 -0600 +++ b/etc/unicode/unicode-consortium/JIS0208.TXT Sat Dec 26 21:18:49 2009 -0600 @@ -4,8 +4,6 @@ # Table version: 0.9 # Table format: Format A # Date: 8 March 1994 -# Authors: Glenn Adams <glenn@metis.com> -# John H. Jenkins <John_Jenkins@taligent.com> # # Copyright (c) 1991-1994 Unicode, Inc. All Rights reserved. # @@ -25,21 +23,35 @@ # # General notes: # -# This table contains the data the Unicode Consortium has on how -# JIS X 0208 (1983) characters map into Unicode. +# +# This table contains one set of mappings from JIS X 0208 (1990) into Unicode. +# Note that these data are *possible* mappings only and may not be the +# same as those used by actual products, nor may they be the best suited +# for all uses. For more information on the mappings between various code +# pages incorporating the repertoire of JIS X 0208 (1990) and Unicode, consult the +# VENDORS mapping data. Normative information on the mapping between +# JIS X 0208 (1990) and Unicode may be found in the Unihan.txt file in the +# latest Unicode Character Database. +# +# If you have carefully considered the fact that the mappings in +# this table are only one possible set of mappings between JIS X 0208 (1990) +# and Unicode and have no normative status, but still feel that you +# have located an error in the table that requires fixing, you may +# report any such error to errata@unicode.org. +# # # Format: Four tab-separated columns # Column #1 is the shift-JIS code (in hex) # Column #2 is the JIS X 0208 code (in hex as 0xXXXX) # Column #3 is the Unicode (in hex as 0xXXXX) # Column #4 the Unicode name (follows a comment sign, '#') -# The official names for Unicode characters U+4E00 -# to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX", -# where XXXX is the code point. Including all these -# names in this file increases its size substantially -# and needlessly. The token "<CJK>" is used for the -# name of these characters. If necessary, it can be -# expanded algorithmically by a parser or editor. +# The official names for Unicode characters U+4E00 +# to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX", +# where XXXX is the code point. Including all these +# names in this file increases its size substantially +# and needlessly. The token "<CJK>" is used for the +# name of these characters. If necessary, it can be +# expanded algorithmically by a parser or editor. # # The entries are in JIS X 0208 order #