xemacs-beta: etc/unicode/unicode-consortium/BIG5.TXT comparison

comparison etc/unicode/unicode-consortium/BIG5.TXT @ 3803:e51807f9eedd

[xemacs-hg @ 2007-01-27 18:28:57 by stephent] Fix up copying situation in etc/unicode/unicode-consortium. <87mz4471zg.fsf@uwakimon.sk.tsukuba.ac.jp>

author	stephent
date	Sat, 27 Jan 2007 18:29:06 +0000
parents	943eaba38521
children	49c847ce8aa6

comparison

equal deleted inserted replaced

-:d6f975442bd3
+:e51807f9eedd
 #	Name:             BIG5 to Unicode table (complete)
 #	Unicode version:  1.1
 #	Table version:    0.0d3
 #	Table format:     Format A
 #	Date:             11 February 1994
-#	Authors:          Glenn Adams <glenn@metis.com>
-#                     John H. Jenkins <John_Jenkins@taligent.com>
 #
 #	Copyright (c) 1991-1994 Unicode, Inc.  All Rights reserved.
 #
 #	This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
 #	No claims are made as to fitness for any particular purpose.  No
 #	specifically excludes the right to re-distribute this file directly
 #	to third parties or other organizations whether for profit or not.
 #
 #	General notes:
 #
-#	This table contains the data Metis and Taligent currently have on how
+#
-#       BIG5 characters map into Unicode.
+# This table contains one set of mappings from BIG5 into Unicode.
+# Note that these data are *possible* mappings only and may not be the
+# same as those used by actual products, nor may they be the best suited
+# for all uses.  For more information on the mappings between various code
+# pages incorporating the repertoire of BIG5 and Unicode, consult the
+# VENDORS mapping data.  Normative information on the mapping between
+# BIG5 and Unicode may be found in the Unihan.txt file in the
+# latest Unicode Character Database.
+#
+# If you have carefully considered the fact that the mappings in
+# this table are only one possible set of mappings between BIG5 and
+# Unicode and have no normative status, but still feel that you
+# have located an error in the table that requires fixing, you may
+# report any such error to errata@unicode.org.
 #
 #	WARNING!  It is currently impossible to provide round-trip compatibility
 #		between BIG5 and Unicode.
 #
 #	A number of characters are not currently mapped because
 #
 #	Notes:
 #
 #	1. In addition to the above, there is some uncertainty about the
 #       mappings in the range C6A1 - C8FE, and F9DD - F9FE.  The ETEN
-#		version of BIG5 organizes the former range differently, and adds
+#	version of BIG5 organizes the former range differently, and adds
-#		additional characters in the latter range.  The correct mappings
+#	additional characters in the latter range.  The correct mappings
-#		these ranges need to be determined.
+#	these ranges need to be determined.
 #
 #	2.  There is an uncertainty in the mapping of the Big Five character
-#		0xA3BC.  This character occurs within the Big Five block of tone marks
+#	0xA3BC.  This character occurs within the Big Five block of tone marks
-#		for bopomofo and is intended to be the tone mark for the first tone in
+#	for bopomofo and is intended to be the tone mark for the first tone in
-#		Mandarin Chinese.  We have selected the mapping U+02C9 MODIFIER LETTER
+#	Mandarin Chinese.  We have selected the mapping U+02C9 MODIFIER LETTER
-#		MACRON (Mandarin Chinese first tone) to reflect this semantic.
+#	MACRON (Mandarin Chinese first tone) to reflect this semantic.
-#		However, because bopomofo uses the absense of a tone mark to indicate
+#	However, because bopomofo uses the absense of a tone mark to indicate
-#		the first Mandarin tone, most implementations of Big Five represent
+#	the first Mandarin tone, most implementations of Big Five represent
-#		this character with a blank space, and so a mapping such as U+2003 EM SPACE
+#	this character with a blank space, and so a mapping such as U+2003 EM
-#		might be preferred.
+#	SPACE might be preferred.
-#
-#
 #
 #	Format:  Three tab-separated columns
 #		 Column #1 is the BIG5 code (in hex as 0xXXXX)
 #		 Column #2 is the Unicode (in hex as 0xXXXX)
 #		 Column #3  is the Unicode name (follows a comment sign, '#')
-#					The official names for Unicode characters U+4E00
+#			The official names for Unicode characters U+4E00
-#					to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
+#			to U+9FA5, inclusive, is "CJK UNIFIED IDEOGRAPH-XXXX",
-#					where XXXX is the code point.  Including all these
+#			where XXXX is the code point.  Including all these
-#					names in this file increases its size substantially
+#			names in this file increases its size substantially
-#					and needlessly.  The token "<CJK>" is used for the
+#			and needlessly.  The token "<CJK>" is used for the
-#					name of these characters.  If necessary, it can be
+#			name of these characters.  If necessary, it can be
-#					expanded algorithmically by a parser or editor.
+#			expanded algorithmically by a parser or editor.
 #
 #	The entries are in BIG5 order
-#
-#	Any comments or problems, contact <John_Jenkins@taligent.com>
 #
 #
 0xA140	0x3000	# IDEOGRAPHIC SPACE
 0xA141	0xFF0C	# FULLWIDTH COMMA
 0xA142	0x3001	# IDEOGRAPHIC COMMA

Mercurial > hg > xemacs-beta

comparison etc/unicode/unicode-consortium/BIG5.TXT @ 3803:e51807f9eedd