comparison lisp/mule/greek.el @ 4604:e0a8715fdb1f

Support new IGNORE-INVALID-SEQUENCESP argument, #'query-coding-region. lisp/ChangeLog addition: 2009-02-07 Aidan Kehoe <kehoea@parhasard.net> * coding.el (query-coding-clear-highlights): Rename the BUFFER argument to BUFFER-OR-STRING, describe it as possibly being a string in its documentation. (default-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document that this function does not support it. Bind case-fold-search to nil, we don't want this to influence what the function thinks is encodable or not. (query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does; reflect this new argument in the associated compiler macro. (query-coding-string): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does. Support the HIGHLIGHT argument correctly. * unicode.el (unicode-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does, implement this. Document a potential problem. Use #'query-coding-clear-highlights instead of reimplementing it ourselves. Remove some debugging messages. * mule/arabic.el (iso-8859-6): * mule/cyrillic.el (iso-8859-5): * mule/greek.el (iso-8859-7): * mule/hebrew.el (iso-8859-8): * mule/latin.el (iso-8859-2): * mule/latin.el (iso-8859-3): * mule/latin.el (iso-8859-4): * mule/latin.el (iso-8859-14): * mule/latin.el (iso-8859-15): * mule/latin.el (iso-8859-16): * mule/latin.el (iso-8859-9): * mule/latin.el (windows-1252): * mule/mule-coding.el (iso-8859-1): Avoid the assumption that characters not given an explicit mapping in these coding systems map to the ISO 8859-1 characters corresponding to the octets on disk; this makes it much more reasonable to implement the IGNORE-INVALID-SEQUENCESP argument to query-coding-region. * mule/mule-cmds.el (set-language-info): Correct the docstring. * mule/mule-cmds.el (finish-set-language-environment): Treat invalid Unicode sequences produced from invalid-sequence-coding-system and corresponding to control characters the same as control characters in redisplay. * mule/mule-cmds.el: Document that encode-coding-char is available in coding.el * mule/mule-coding.el (make-8-bit-generate-helper): Change to return the both the encode-program generated and the relevant non-ASCII charset; update the docstring to reflect this. * mule/mule-coding.el (make-8-bit-generate-encode-program-and-skip-chars-strings): Rename this function; have it return skip-chars-strings as well as the encode program. Have these skip-chars-strings use ranges for charsets, where possible. * mule/mule-coding.el (make-8-bit-create-decode-encode-tables): Revise this to allow people to specify explicitly characters that should be undefined (= corresponding to keys in unicode-error-default-translation-table), and treating unspecified octets above #x7f as undefined by default. * mule/mule-coding.el (8-bit-fixed-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, implement support for it using the 8-bit-fixed-invalid-sequences-skip-chars coding system property; remove some debugging messages. * mule/mule-coding.el (make-8-bit-coding-system): This function is dumped, autoloading it makes no sense. Document what happens when characters above #x7f are not specified, implement this. * mule/vietnamese.el: Correct spelling. tests/ChangeLog addition: 2009-02-07 Aidan Kehoe <kehoea@parhasard.net> * automated/query-coding-tests.el: Add FAILING-CASE arguments to the Assert calls, making #'q-c-debug mostly unnecessary. Remove #'q-c-debug. Add new tests that use the IGNORE-INVALID-SEQUENCESP argument to #'query-coding-region; rework the existing ones to respect it.
author Aidan Kehoe <kehoea@parhasard.net>
date Sat, 07 Feb 2009 17:13:37 +0000
parents 67fbcaf3dbdc
children 257b468bf2ca
comparison
equal deleted inserted replaced
4603:202cb69c4d87 4604:e0a8715fdb1f
118 (string (char-syntax (make-char 'latin-iso8859-1 latin-1))) 118 (string (char-syntax (make-char 'latin-iso8859-1 latin-1)))
119 syntax-table)) 119 syntax-table))
120 120
121 (make-8-bit-coding-system 121 (make-8-bit-coding-system
122 'iso-8859-7 122 'iso-8859-7
123 '((#xA1 ?\u2018) ;; LEFT SINGLE QUOTATION MARK 123 '((#x80 ?\u0080) ;; <control>
124 (#x81 ?\u0081) ;; <control>
125 (#x82 ?\u0082) ;; <control>
126 (#x83 ?\u0083) ;; <control>
127 (#x84 ?\u0084) ;; <control>
128 (#x85 ?\u0085) ;; <control>
129 (#x86 ?\u0086) ;; <control>
130 (#x87 ?\u0087) ;; <control>
131 (#x88 ?\u0088) ;; <control>
132 (#x89 ?\u0089) ;; <control>
133 (#x8A ?\u008A) ;; <control>
134 (#x8B ?\u008B) ;; <control>
135 (#x8C ?\u008C) ;; <control>
136 (#x8D ?\u008D) ;; <control>
137 (#x8E ?\u008E) ;; <control>
138 (#x8F ?\u008F) ;; <control>
139 (#x90 ?\u0090) ;; <control>
140 (#x91 ?\u0091) ;; <control>
141 (#x92 ?\u0092) ;; <control>
142 (#x93 ?\u0093) ;; <control>
143 (#x94 ?\u0094) ;; <control>
144 (#x95 ?\u0095) ;; <control>
145 (#x96 ?\u0096) ;; <control>
146 (#x97 ?\u0097) ;; <control>
147 (#x98 ?\u0098) ;; <control>
148 (#x99 ?\u0099) ;; <control>
149 (#x9A ?\u009A) ;; <control>
150 (#x9B ?\u009B) ;; <control>
151 (#x9C ?\u009C) ;; <control>
152 (#x9D ?\u009D) ;; <control>
153 (#x9E ?\u009E) ;; <control>
154 (#x9F ?\u009F) ;; <control>
155 (#xA0 ?\u00A0) ;; NO-BREAK SPACE
156 (#xA1 ?\u2018) ;; LEFT SINGLE QUOTATION MARK
124 (#xA2 ?\u2019) ;; RIGHT SINGLE QUOTATION MARK 157 (#xA2 ?\u2019) ;; RIGHT SINGLE QUOTATION MARK
158 (#xA3 ?\u00A3) ;; POUND SIGN
125 (#xA4 ?\u20AC) ;; EURO SIGN 159 (#xA4 ?\u20AC) ;; EURO SIGN
126 (#xA5 ?\u20AF) ;; DRACHMA SIGN 160 (#xA5 ?\u20AF) ;; DRACHMA SIGN
161 (#xA6 ?\u00A6) ;; BROKEN BAR
162 (#xA7 ?\u00A7) ;; SECTION SIGN
163 (#xA8 ?\u00A8) ;; DIAERESIS
164 (#xA9 ?\u00A9) ;; COPYRIGHT SIGN
127 (#xAA ?\u037A) ;; GREEK YPOGEGRAMMENI 165 (#xAA ?\u037A) ;; GREEK YPOGEGRAMMENI
166 (#xAB ?\u00AB) ;; LEFT-POINTING DOUBLE ANGLE QUOTATION MARK
167 (#xAC ?\u00AC) ;; NOT SIGN
168 (#xAD ?\u00AD) ;; SOFT HYPHEN
128 (#xAF ?\u2015) ;; HORIZONTAL BAR 169 (#xAF ?\u2015) ;; HORIZONTAL BAR
170 (#xB0 ?\u00B0) ;; DEGREE SIGN
171 (#xB1 ?\u00B1) ;; PLUS-MINUS SIGN
172 (#xB2 ?\u00B2) ;; SUPERSCRIPT TWO
173 (#xB3 ?\u00B3) ;; SUPERSCRIPT THREE
129 (#xB4 ?\u0384) ;; GREEK TONOS 174 (#xB4 ?\u0384) ;; GREEK TONOS
130 (#xB5 ?\u0385) ;; GREEK DIALYTIKA TONOS 175 (#xB5 ?\u0385) ;; GREEK DIALYTIKA TONOS
131 (#xB6 ?\u0386) ;; GREEK CAPITAL LETTER ALPHA WITH TONOS 176 (#xB6 ?\u0386) ;; GREEK CAPITAL LETTER ALPHA WITH TONOS
177 (#xB7 ?\u00B7) ;; MIDDLE DOT
132 (#xB8 ?\u0388) ;; GREEK CAPITAL LETTER EPSILON WITH TONOS 178 (#xB8 ?\u0388) ;; GREEK CAPITAL LETTER EPSILON WITH TONOS
133 (#xB9 ?\u0389) ;; GREEK CAPITAL LETTER ETA WITH TONOS 179 (#xB9 ?\u0389) ;; GREEK CAPITAL LETTER ETA WITH TONOS
134 (#xBA ?\u038A) ;; GREEK CAPITAL LETTER IOTA WITH TONOS 180 (#xBA ?\u038A) ;; GREEK CAPITAL LETTER IOTA WITH TONOS
181 (#xBB ?\u00BB) ;; RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK
135 (#xBC ?\u038C) ;; GREEK CAPITAL LETTER OMICRON WITH TONOS 182 (#xBC ?\u038C) ;; GREEK CAPITAL LETTER OMICRON WITH TONOS
183 (#xBD ?\u00BD) ;; VULGAR FRACTION ONE HALF
136 (#xBE ?\u038E) ;; GREEK CAPITAL LETTER UPSILON WITH TONOS 184 (#xBE ?\u038E) ;; GREEK CAPITAL LETTER UPSILON WITH TONOS
137 (#xBF ?\u038F) ;; GREEK CAPITAL LETTER OMEGA WITH TONOS 185 (#xBF ?\u038F) ;; GREEK CAPITAL LETTER OMEGA WITH TONOS
138 (#xC0 ?\u0390) ;; GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS 186 (#xC0 ?\u0390) ;; GREEK SMALL LETTER IOTA WITH DIALYTIKA AND TONOS
139 (#xC1 ?\u0391) ;; GREEK CAPITAL LETTER ALPHA 187 (#xC1 ?\u0391) ;; GREEK CAPITAL LETTER ALPHA
140 (#xC2 ?\u0392) ;; GREEK CAPITAL LETTER BETA 188 (#xC2 ?\u0392) ;; GREEK CAPITAL LETTER BETA
194 (#xF9 ?\u03C9) ;; GREEK SMALL LETTER OMEGA 242 (#xF9 ?\u03C9) ;; GREEK SMALL LETTER OMEGA
195 (#xFA ?\u03CA) ;; GREEK SMALL LETTER IOTA WITH DIALYTIKA 243 (#xFA ?\u03CA) ;; GREEK SMALL LETTER IOTA WITH DIALYTIKA
196 (#xFB ?\u03CB) ;; GREEK SMALL LETTER UPSILON WITH DIALYTIKA 244 (#xFB ?\u03CB) ;; GREEK SMALL LETTER UPSILON WITH DIALYTIKA
197 (#xFC ?\u03CC) ;; GREEK SMALL LETTER OMICRON WITH TONOS 245 (#xFC ?\u03CC) ;; GREEK SMALL LETTER OMICRON WITH TONOS
198 (#xFD ?\u03CD) ;; GREEK SMALL LETTER UPSILON WITH TONOS 246 (#xFD ?\u03CD) ;; GREEK SMALL LETTER UPSILON WITH TONOS
199 (#xFE ?\u03CE)) ;; GREEK SMALL LETTER OMEGA WITH TONOS 247 (#xFE ?\u03CE));; GREEK SMALL LETTER OMEGA WITH TONOS
200 "ISO-8859-7 (Greek)" 248 "ISO-8859-7 (Greek)"
201 '(mnemonic "Grk" 249 '(mnemonic "Grk"
202 aliases (greek-iso-8bit))) 250 aliases (greek-iso-8bit)))
203 251
204 (make-8-bit-coding-system 252 (make-8-bit-coding-system