Mercurial > hg > xemacs-beta
comparison lisp/mule/cyrillic.el @ 4604:e0a8715fdb1f
Support new IGNORE-INVALID-SEQUENCESP argument, #'query-coding-region.
lisp/ChangeLog addition:
2009-02-07 Aidan Kehoe <kehoea@parhasard.net>
* coding.el (query-coding-clear-highlights):
Rename the BUFFER argument to BUFFER-OR-STRING, describe it as
possibly being a string in its documentation.
(default-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document that this
function does not support it.
Bind case-fold-search to nil, we don't want this to influence what the
function thinks is encodable or not.
(query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does; reflect this new argument in the associated compiler macro.
(query-coding-string):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does. Support the HIGHLIGHT argument correctly.
* unicode.el (unicode-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does, implement this. Document a potential problem.
Use #'query-coding-clear-highlights instead of reimplementing it
ourselves.
Remove some debugging messages.
* mule/arabic.el (iso-8859-6):
* mule/cyrillic.el (iso-8859-5):
* mule/greek.el (iso-8859-7):
* mule/hebrew.el (iso-8859-8):
* mule/latin.el (iso-8859-2):
* mule/latin.el (iso-8859-3):
* mule/latin.el (iso-8859-4):
* mule/latin.el (iso-8859-14):
* mule/latin.el (iso-8859-15):
* mule/latin.el (iso-8859-16):
* mule/latin.el (iso-8859-9):
* mule/latin.el (windows-1252):
* mule/mule-coding.el (iso-8859-1):
Avoid the assumption that characters not given an explicit mapping
in these coding systems map to the ISO 8859-1 characters
corresponding to the octets on disk; this makes it much more
reasonable to implement the IGNORE-INVALID-SEQUENCESP argument to
query-coding-region.
* mule/mule-cmds.el (set-language-info):
Correct the docstring.
* mule/mule-cmds.el (finish-set-language-environment):
Treat invalid Unicode sequences produced from
invalid-sequence-coding-system and corresponding to control
characters the same as control characters in redisplay.
* mule/mule-cmds.el:
Document that encode-coding-char is available in coding.el
* mule/mule-coding.el (make-8-bit-generate-helper):
Change to return the both the encode-program generated and the
relevant non-ASCII charset; update the docstring to reflect this.
* mule/mule-coding.el
(make-8-bit-generate-encode-program-and-skip-chars-strings):
Rename this function; have it return skip-chars-strings as well as
the encode program. Have these skip-chars-strings use ranges for
charsets, where possible.
* mule/mule-coding.el (make-8-bit-create-decode-encode-tables):
Revise this to allow people to specify explicitly characters that
should be undefined (= corresponding to keys in
unicode-error-default-translation-table), and treating unspecified
octets above #x7f as undefined by default.
* mule/mule-coding.el (8-bit-fixed-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, implement support
for it using the 8-bit-fixed-invalid-sequences-skip-chars coding
system property; remove some debugging messages.
* mule/mule-coding.el (make-8-bit-coding-system):
This function is dumped, autoloading it makes no sense.
Document what happens when characters above #x7f are not
specified, implement this.
* mule/vietnamese.el:
Correct spelling.
tests/ChangeLog addition:
2009-02-07 Aidan Kehoe <kehoea@parhasard.net>
* automated/query-coding-tests.el:
Add FAILING-CASE arguments to the Assert calls, making #'q-c-debug
mostly unnecessary. Remove #'q-c-debug.
Add new tests that use the IGNORE-INVALID-SEQUENCESP argument to
#'query-coding-region; rework the existing ones to respect it.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Sat, 07 Feb 2009 17:13:37 +0000 |
parents | 67fbcaf3dbdc |
children | 257b468bf2ca |
comparison
equal
deleted
inserted
replaced
4603:202cb69c4d87 | 4604:e0a8715fdb1f |
---|---|
106 (standard-syntax-table)) | 106 (standard-syntax-table)) |
107 | 107 |
108 ;; And create the coding system. | 108 ;; And create the coding system. |
109 (make-8-bit-coding-system | 109 (make-8-bit-coding-system |
110 'iso-8859-5 | 110 'iso-8859-5 |
111 '((#xA1 ?\u0401) ;; CYRILLIC CAPITAL LETTER IO | 111 '((#x80 ?\u0080) ;; <control> |
112 (#x81 ?\u0081) ;; <control> | |
113 (#x82 ?\u0082) ;; <control> | |
114 (#x83 ?\u0083) ;; <control> | |
115 (#x84 ?\u0084) ;; <control> | |
116 (#x85 ?\u0085) ;; <control> | |
117 (#x86 ?\u0086) ;; <control> | |
118 (#x87 ?\u0087) ;; <control> | |
119 (#x88 ?\u0088) ;; <control> | |
120 (#x89 ?\u0089) ;; <control> | |
121 (#x8A ?\u008A) ;; <control> | |
122 (#x8B ?\u008B) ;; <control> | |
123 (#x8C ?\u008C) ;; <control> | |
124 (#x8D ?\u008D) ;; <control> | |
125 (#x8E ?\u008E) ;; <control> | |
126 (#x8F ?\u008F) ;; <control> | |
127 (#x90 ?\u0090) ;; <control> | |
128 (#x91 ?\u0091) ;; <control> | |
129 (#x92 ?\u0092) ;; <control> | |
130 (#x93 ?\u0093) ;; <control> | |
131 (#x94 ?\u0094) ;; <control> | |
132 (#x95 ?\u0095) ;; <control> | |
133 (#x96 ?\u0096) ;; <control> | |
134 (#x97 ?\u0097) ;; <control> | |
135 (#x98 ?\u0098) ;; <control> | |
136 (#x99 ?\u0099) ;; <control> | |
137 (#x9A ?\u009A) ;; <control> | |
138 (#x9B ?\u009B) ;; <control> | |
139 (#x9C ?\u009C) ;; <control> | |
140 (#x9D ?\u009D) ;; <control> | |
141 (#x9E ?\u009E) ;; <control> | |
142 (#x9F ?\u009F) ;; <control> | |
143 (#xA0 ?\u00A0) ;; NO-BREAK SPACE | |
144 (#xA1 ?\u0401) ;; CYRILLIC CAPITAL LETTER IO | |
112 (#xA2 ?\u0402) ;; CYRILLIC CAPITAL LETTER DJE | 145 (#xA2 ?\u0402) ;; CYRILLIC CAPITAL LETTER DJE |
113 (#xA3 ?\u0403) ;; CYRILLIC CAPITAL LETTER GJE | 146 (#xA3 ?\u0403) ;; CYRILLIC CAPITAL LETTER GJE |
114 (#xA4 ?\u0404) ;; CYRILLIC CAPITAL LETTER UKRAINIAN IE | 147 (#xA4 ?\u0404) ;; CYRILLIC CAPITAL LETTER UKRAINIAN IE |
115 (#xA5 ?\u0405) ;; CYRILLIC CAPITAL LETTER DZE | 148 (#xA5 ?\u0405) ;; CYRILLIC CAPITAL LETTER DZE |
116 (#xA6 ?\u0406) ;; CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I | 149 (#xA6 ?\u0406) ;; CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I |
118 (#xA8 ?\u0408) ;; CYRILLIC CAPITAL LETTER JE | 151 (#xA8 ?\u0408) ;; CYRILLIC CAPITAL LETTER JE |
119 (#xA9 ?\u0409) ;; CYRILLIC CAPITAL LETTER LJE | 152 (#xA9 ?\u0409) ;; CYRILLIC CAPITAL LETTER LJE |
120 (#xAA ?\u040A) ;; CYRILLIC CAPITAL LETTER NJE | 153 (#xAA ?\u040A) ;; CYRILLIC CAPITAL LETTER NJE |
121 (#xAB ?\u040B) ;; CYRILLIC CAPITAL LETTER TSHE | 154 (#xAB ?\u040B) ;; CYRILLIC CAPITAL LETTER TSHE |
122 (#xAC ?\u040C) ;; CYRILLIC CAPITAL LETTER KJE | 155 (#xAC ?\u040C) ;; CYRILLIC CAPITAL LETTER KJE |
156 (#xAD ?\u00AD) ;; SOFT HYPHEN | |
123 (#xAE ?\u040E) ;; CYRILLIC CAPITAL LETTER SHORT U | 157 (#xAE ?\u040E) ;; CYRILLIC CAPITAL LETTER SHORT U |
124 (#xAF ?\u040F) ;; CYRILLIC CAPITAL LETTER DZHE | 158 (#xAF ?\u040F) ;; CYRILLIC CAPITAL LETTER DZHE |
125 (#xB0 ?\u0410) ;; CYRILLIC CAPITAL LETTER A | 159 (#xB0 ?\u0410) ;; CYRILLIC CAPITAL LETTER A |
126 (#xB1 ?\u0411) ;; CYRILLIC CAPITAL LETTER BE | 160 (#xB1 ?\u0411) ;; CYRILLIC CAPITAL LETTER BE |
127 (#xB2 ?\u0412) ;; CYRILLIC CAPITAL LETTER VE | 161 (#xB2 ?\u0412) ;; CYRILLIC CAPITAL LETTER VE |
203 (#xFE ?\u045E) ;; CYRILLIC SMALL LETTER SHORT U | 237 (#xFE ?\u045E) ;; CYRILLIC SMALL LETTER SHORT U |
204 (#xFF ?\u045F)) ;; CYRILLIC SMALL LETTER DZHE | 238 (#xFF ?\u045F)) ;; CYRILLIC SMALL LETTER DZHE |
205 "ISO-8859-5 (Cyrillic)" | 239 "ISO-8859-5 (Cyrillic)" |
206 '(mnemonic "ISO8/Cyr" | 240 '(mnemonic "ISO8/Cyr" |
207 documentation "The ISO standard for encoding Cyrillic. Not used in practice. | 241 documentation "The ISO standard for encoding Cyrillic. Not used in practice. |
208 See `koi8-r' and `windows-1250'. " | 242 See `koi8-r' and `windows-1251'. " |
209 aliases (cyrillic-iso-8bit))) | 243 aliases (cyrillic-iso-8bit))) |
210 | 244 |
211 ;; Provide this locale; but don't allow it to be picked up from the Unix | 245 ;; Provide this locale; but don't allow it to be picked up from the Unix |
212 ;; locale (it has no locale entry in the alist), we leave that to Russian. | 246 ;; locale (it has no locale entry in the alist), we leave that to Russian. |
213 (set-language-info-alist | 247 (set-language-info-alist |