comparison etc/unicode/unicode-consortium/8859-7.TXT @ 3767:6b2ef948e140

[xemacs-hg @ 2006-12-29 18:09:38 by aidan] etc/ChangeLog addition: 2006-12-21 Aidan Kehoe <kehoea@parhasard.net> * unicode/unicode-consortium/8859-7.TXT: Update the mapping to the 2003 version of ISO 8859-7. lisp/ChangeLog addition: 2006-12-21 Aidan Kehoe <kehoea@parhasard.net> * mule/cyrillic.el: * mule/cyrillic.el (iso-8859-5): * mule/cyrillic.el (cyrillic-koi8-r-encode-table): Add syntax, case support for Cyrillic; make some parentheses more Lispy. * mule/european.el: Content moved to latin.el, file deleted. * mule/general-late.el: If Unicode tables are to be loaded at dump time, do it here, not in loadup.el. * mule/greek.el: Add syntax, case support for Greek. * mule/latin.el: Move the content of european.el here. Change the case table mappings to use hexadecimal codes, to make cross reference to the standards easier. In all cases, take character syntax from similar characters in Latin-1 , rather than deciding separately what syntax they should take. Add (incomplete) support for case with Turkish. Remove description of the character sets used from the language environments' doc strings, since now that we create variant language environments on the fly, such descriptions will often be inaccurate. Set the native-coding-system language info property while setting the other coding-system properties of the language. * mule/misc-lang.el (ipa): Remove the language environment. The International Phonetic _Alphabet_ is not a language, it's inane to have a corresponding language environment in XEmacs. * mule/mule-cmds.el (create-variant-language-environment): Also modify the coding-priority when creating a new language environment; document that. * mule/mule-cmds.el (get-language-environment-from-locale): Recognise that the 'native-coding-system language-info property can be a list, interpret it correctly when it is one. 2006-12-21 Aidan Kehoe <kehoea@parhasard.net> * coding.el (coding-system-category): Use the new 'unicode-type property for finding what sort of Unicode coding system subtype a coding system is, instead of the overshadowed 'type property. * dumped-lisp.el (preloaded-file-list): mule/european.el has been removed. * loadup.el (really-early-error-handler): Unicode tables loaded at dump time are now in mule/general-late.el. * simple.el (count-lines): Add some backslashes to to parentheses in docstrings to help fontification along. * simple.el (what-cursor-position): Wrap a line to fit in 80 characters. * unicode.el: Use the 'unicode-type property, not 'type, for setting the Unicode coding-system subtype. src/ChangeLog addition: 2006-12-21 Aidan Kehoe <kehoea@parhasard.net> * file-coding.c: Update the make-coding-system docstring to reflect unicode-type * general-slots.h: New symbol, unicode-type, since 'type was being overridden when accessing a coding system's Unicode subtype. * intl-win32.c: Backslash a few parentheses, to help fontification along. * intl-win32.c (complex_vars_of_intl_win32): Use the 'unicode-type symbol, not 'type, when creating the Microsoft Unicode coding system. * unicode.c (unicode_putprop): * unicode.c (unicode_getprop): * unicode.c (unicode_print): Using 'type as the property name when working out what Unicode subtype a given coding system is was broken, since there's a general coding system property called 'type. Change the former to use 'unicode-type instead.
author aidan
date Fri, 29 Dec 2006 18:09:51 +0000
parents 943eaba38521
children
comparison
equal deleted inserted replaced
3766:a3dcf9d17a40 3767:6b2ef948e140
1 # 1 #
2 # Name: ISO 8859-7:1987 to Unicode 2 # Name: ISO 8859-7:2003 to Unicode
3 # Unicode version: 3.0 3 # Unicode version: 4.0
4 # Table version: 1.0 4 # Table version: 2.0
5 # Table format: Format A 5 # Table format: Format A
6 # Date: 1999 July 27 6 # Date: 2003-Nov-12
7 # Authors: Ken Whistler <kenw@sybase.com> 7 # Authors: Ken Whistler <kenw@sybase.com>
8 # 8 #
9 # Copyright (c) 1991-1999 Unicode, Inc. All Rights reserved. 9 # Copyright (c) 1991-2003 Unicode, Inc. All Rights reserved.
10 # 10 #
11 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium). 11 # This file is provided as-is by Unicode, Inc. (The Unicode Consortium).
12 # No claims are made as to fitness for any particular purpose. No 12 # No claims are made as to fitness for any particular purpose. No
13 # warranties of any kind are expressed or implied. The recipient 13 # warranties of any kind are expressed or implied. The recipient
14 # agrees to determine applicability of information provided. If this 14 # agrees to determine applicability of information provided. If this
23 # attached. 23 # attached.
24 # 24 #
25 # General notes: 25 # General notes:
26 # 26 #
27 # This table contains the data the Unicode Consortium has on how 27 # This table contains the data the Unicode Consortium has on how
28 # ISO 8859-7:1987 characters map into Unicode. 28 # ISO 8859-7:2003 characters map into Unicode.
29 # 29 #
30 # ISO 8859-7:1987 is equivalent to ISO-IR-126, ELOT 928, 30 # ISO 8859-7:1987 is equivalent to ISO-IR-126, ELOT 928,
31 # and ECMA 118. 31 # and ECMA 118. ISO 8859-7:2003 adds two currency signs
32 # and one other character not in the earlier standard.
32 # 33 #
33 # Format: Three tab-separated columns 34 # Format: Three tab-separated columns
34 # Column #1 is the ISO 8859-7 code (in hex as 0xXX) 35 # Column #1 is the ISO 8859-7 code (in hex as 0xXX)
35 # Column #2 is the Unicode (in hex as 0xXXXX) 36 # Column #2 is the Unicode (in hex as 0xXXXX)
36 # Column #3 the Unicode name (follows a comment sign, '#') 37 # Column #3 the Unicode name (follows a comment sign, '#')
41 # 1.0 version updates 0.1 version by adding mappings for all 42 # 1.0 version updates 0.1 version by adding mappings for all
42 # control characters. 43 # control characters.
43 # Remap 0xA1 to U+2018 (instead of 0x02BD) to match text of 8859-7 44 # Remap 0xA1 to U+2018 (instead of 0x02BD) to match text of 8859-7
44 # Remap 0xA2 to U+2019 (instead of 0x02BC) to match text of 8859-7 45 # Remap 0xA2 to U+2019 (instead of 0x02BC) to match text of 8859-7
45 # 46 #
47 # 2.0 version updates 1.0 version by adding mappings for the
48 # three newly added characters 0xA4, 0xA5, 0xAA.
49 #
46 # Updated versions of this file may be found in: 50 # Updated versions of this file may be found in:
47 # <ftp://ftp.unicode.org/Public/MAPPINGS/> 51 # <http://www.unicode.org/Public/MAPPINGS/>
48 # 52 #
49 # Any comments or problems, contact <errata@unicode.org> 53 # Any comments or problems, contact the Unicode Consortium at:
50 # Please note that <errata@unicode.org> is an archival address; 54 # <http://www.unicode.org/reporting.html>
51 # notices will be checked, but do not expect an immediate response.
52 # 55 #
53 0x00 0x0000 # NULL 56 0x00 0x0000 # NULL
54 0x01 0x0001 # START OF HEADING 57 0x01 0x0001 # START OF HEADING
55 0x02 0x0002 # START OF TEXT 58 0x02 0x0002 # START OF TEXT
56 0x03 0x0003 # END OF TEXT 59 0x03 0x0003 # END OF TEXT
212 0x9F 0x009F # <control> 215 0x9F 0x009F # <control>
213 0xA0 0x00A0 # NO-BREAK SPACE 216 0xA0 0x00A0 # NO-BREAK SPACE
214 0xA1 0x2018 # LEFT SINGLE QUOTATION MARK 217 0xA1 0x2018 # LEFT SINGLE QUOTATION MARK
215 0xA2 0x2019 # RIGHT SINGLE QUOTATION MARK 218 0xA2 0x2019 # RIGHT SINGLE QUOTATION MARK
216 0xA3 0x00A3 # POUND SIGN 219 0xA3 0x00A3 # POUND SIGN
220 0xA4 0x20AC # EURO SIGN
221 0xA5 0x20AF # DRACHMA SIGN
217 0xA6 0x00A6 # BROKEN BAR 222 0xA6 0x00A6 # BROKEN BAR
218 0xA7 0x00A7 # SECTION SIGN 223 0xA7 0x00A7 # SECTION SIGN
219 0xA8 0x00A8 # DIAERESIS 224 0xA8 0x00A8 # DIAERESIS
220 0xA9 0x00A9 # COPYRIGHT SIGN 225 0xA9 0x00A9 # COPYRIGHT SIGN
226 0xAA 0x037A # GREEK YPOGEGRAMMENI
221 0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 227 0xAB 0x00AB # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
222 0xAC 0x00AC # NOT SIGN 228 0xAC 0x00AC # NOT SIGN
223 0xAD 0x00AD # SOFT HYPHEN 229 0xAD 0x00AD # SOFT HYPHEN
224 0xAF 0x2015 # HORIZONTAL BAR 230 0xAF 0x2015 # HORIZONTAL BAR
225 0xB0 0x00B0 # DEGREE SIGN 231 0xB0 0x00B0 # DEGREE SIGN