diff etc/unicode/unicode-consortium/SHIFTJIS.TXT @ 5118:e0db3c197671 ben-lisp-object

merge up to latest default branch, doesn't compile yet
author Ben Wing <ben@xemacs.org>
date Sat, 26 Dec 2009 21:18:49 -0600
parents e51807f9eedd
children
line wrap: on
line diff
--- a/etc/unicode/unicode-consortium/SHIFTJIS.TXT	Sat Dec 26 00:20:27 2009 -0600
+++ b/etc/unicode/unicode-consortium/SHIFTJIS.TXT	Sat Dec 26 21:18:49 2009 -0600
@@ -4,8 +4,6 @@
 #	Table version:    0.9
 #	Table format:     Format A
 #	Date:             8 March 1994
-#	Authors:          Glenn Adams <glenn@metis.com>
-#                     John H. Jenkins <John_Jenkins@taligent.com>
 #
 #	Copyright (c) 1991-1994 Unicode, Inc.  All Rights reserved.
 #
@@ -25,20 +23,34 @@
 #
 #	General notes:
 #
-#	This table contains the data the Unicode Consortium has on how
-#       Shift-JIS (a combination of JIS 0201 and JIS 0208) maps into Unicode.
+#
+# This table contains one set of mappings from Shift-JIS into Unicode.
+# Note that these data are *possible* mappings only and may not be the
+# same as those used by actual products, nor may they be the best suited
+# for all uses.  For more information on the mappings between various code
+# pages incorporating the repertoire of Shift-JIS and Unicode, consult the
+# VENDORS mapping data.  Normative information on the mapping between
+# Shift-JIS and Unicode may be found in the Unihan.txt file in the
+# latest Unicode Character Database.
+#
+# If you have carefully considered the fact that the mappings in
+# this table are only one possible set of mappings between Shift-JIS and
+# Unicode and have no normative status, but still feel that you
+# have located an error in the table that requires fixing, you may
+# report any such error to errata@unicode.org.
+#
 #
 #	Format:  Three tab-separated columns
 #		 Column #1 is the shift-JIS code (in hex)
 #		 Column #2 is the Unicode (in hex as 0xXXXX)
 #		 Column #3 the Unicode name (follows a comment sign, '#')
-#					The official names for Unicode characters U+4E00
-#					to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
-#					where XXXX is the code point.  Including all these
-#					names in this file increases its size substantially
-#					and needlessly.  The token "<CJK>" is used for the
-#					name of these characters.  If necessary, it can be
-#					expanded algorithmically by a parser or editor.
+#			The official names for Unicode characters U+4E00
+#			to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
+#			where XXXX is the code point.  Including all these
+#			names in this file increases its size substantially
+#			and needlessly.  The token "<CJK>" is used for the
+#			name of these characters.  If necessary, it can be
+#			expanded algorithmically by a parser or editor.
 #
 #	The entries are ordered by their Shift-JIS codes as follows:
 #		Single-byte characters precede double-byte characters