Mercurial > hg > xemacs-beta

diff etc/unicode/unicode-consortium/JIS0208.TXT @ 5118:e0db3c197671 ben-lisp-object
merge up to latest default branch, doesn't compile yet
author: Ben Wing <ben@xemacs.org>
date: Sat, 26 Dec 2009 21:18:49 -0600
parents: e51807f9eedd
--- a/etc/unicode/unicode-consortium/JIS0208.TXT	Sat Dec 26 00:20:27 2009 -0600
+++ b/etc/unicode/unicode-consortium/JIS0208.TXT	Sat Dec 26 21:18:49 2009 -0600
@@ -4,8 +4,6 @@
 #	Table version:    0.9
 #	Table format:     Format A
 #	Date:             8 March 1994
-#	Authors:          Glenn Adams <glenn@metis.com>
-#                     John H. Jenkins <John_Jenkins@taligent.com>
 #
 #	Copyright (c) 1991-1994 Unicode, Inc.  All Rights reserved.
 #
@@ -25,21 +23,35 @@
 #
 #	General notes:
 #
-#	This table contains the data the Unicode Consortium has on how
-#       JIS X 0208 (1983) characters map into Unicode.
+#
+# This table contains one set of mappings from JIS X 0208 (1990) into Unicode.
+# Note that these data are *possible* mappings only and may not be the
+# same as those used by actual products, nor may they be the best suited
+# for all uses.  For more information on the mappings between various code
+# pages incorporating the repertoire of JIS X 0208 (1990) and Unicode, consult the
+# VENDORS mapping data.  Normative information on the mapping between
+# JIS X 0208 (1990) and Unicode may be found in the Unihan.txt file in the
+# latest Unicode Character Database.
+#
+# If you have carefully considered the fact that the mappings in
+# this table are only one possible set of mappings between JIS X 0208 (1990)
+# and Unicode and have no normative status, but still feel that you
+# have located an error in the table that requires fixing, you may
+# report any such error to errata@unicode.org.
+#
 #
 #	Format:  Four tab-separated columns
 #		 Column #1 is the shift-JIS code (in hex)
 #		 Column #2 is the JIS X 0208 code (in hex as 0xXXXX)
 #		 Column #3 is the Unicode (in hex as 0xXXXX)
 #		 Column #4 the Unicode name (follows a comment sign, '#')
-#					The official names for Unicode characters U+4E00
-#					to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
-#					where XXXX is the code point.  Including all these
-#					names in this file increases its size substantially
-#					and needlessly.  The token "<CJK>" is used for the
-#					name of these characters.  If necessary, it can be
-#					expanded algorithmically by a parser or editor.
+#			The official names for Unicode characters U+4E00
+#			to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
+#			where XXXX is the code point.  Including all these
+#			names in this file increases its size substantially
+#			and needlessly.  The token "<CJK>" is used for the
+#			name of these characters.  If necessary, it can be
+#			expanded algorithmically by a parser or editor.
 #
 #	The entries are in JIS X 0208 order
 #
author	Ben Wing <ben@xemacs.org>
date	Sat, 26 Dec 2009 21:18:49 -0600
parents	e51807f9eedd
children