xemacs-beta: lisp/mule/cyrillic.el comparison

comparison lisp/mule/cyrillic.el @ 4604:e0a8715fdb1f

Support new IGNORE-INVALID-SEQUENCESP argument, #'query-coding-region. lisp/ChangeLog addition: 2009-02-07 Aidan Kehoe <kehoea@parhasard.net> * coding.el (query-coding-clear-highlights): Rename the BUFFER argument to BUFFER-OR-STRING, describe it as possibly being a string in its documentation. (default-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document that this function does not support it. Bind case-fold-search to nil, we don't want this to influence what the function thinks is encodable or not. (query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does; reflect this new argument in the associated compiler macro. (query-coding-string): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does. Support the HIGHLIGHT argument correctly. * unicode.el (unicode-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, document what it does, implement this. Document a potential problem. Use #'query-coding-clear-highlights instead of reimplementing it ourselves. Remove some debugging messages. * mule/arabic.el (iso-8859-6): * mule/cyrillic.el (iso-8859-5): * mule/greek.el (iso-8859-7): * mule/hebrew.el (iso-8859-8): * mule/latin.el (iso-8859-2): * mule/latin.el (iso-8859-3): * mule/latin.el (iso-8859-4): * mule/latin.el (iso-8859-14): * mule/latin.el (iso-8859-15): * mule/latin.el (iso-8859-16): * mule/latin.el (iso-8859-9): * mule/latin.el (windows-1252): * mule/mule-coding.el (iso-8859-1): Avoid the assumption that characters not given an explicit mapping in these coding systems map to the ISO 8859-1 characters corresponding to the octets on disk; this makes it much more reasonable to implement the IGNORE-INVALID-SEQUENCESP argument to query-coding-region. * mule/mule-cmds.el (set-language-info): Correct the docstring. * mule/mule-cmds.el (finish-set-language-environment): Treat invalid Unicode sequences produced from invalid-sequence-coding-system and corresponding to control characters the same as control characters in redisplay. * mule/mule-cmds.el: Document that encode-coding-char is available in coding.el * mule/mule-coding.el (make-8-bit-generate-helper): Change to return the both the encode-program generated and the relevant non-ASCII charset; update the docstring to reflect this. * mule/mule-coding.el (make-8-bit-generate-encode-program-and-skip-chars-strings): Rename this function; have it return skip-chars-strings as well as the encode program. Have these skip-chars-strings use ranges for charsets, where possible. * mule/mule-coding.el (make-8-bit-create-decode-encode-tables): Revise this to allow people to specify explicitly characters that should be undefined (= corresponding to keys in unicode-error-default-translation-table), and treating unspecified octets above #x7f as undefined by default. * mule/mule-coding.el (8-bit-fixed-query-coding-region): Add a new IGNORE-INVALID-SEQUENCESP argument, implement support for it using the 8-bit-fixed-invalid-sequences-skip-chars coding system property; remove some debugging messages. * mule/mule-coding.el (make-8-bit-coding-system): This function is dumped, autoloading it makes no sense. Document what happens when characters above #x7f are not specified, implement this. * mule/vietnamese.el: Correct spelling. tests/ChangeLog addition: 2009-02-07 Aidan Kehoe <kehoea@parhasard.net> * automated/query-coding-tests.el: Add FAILING-CASE arguments to the Assert calls, making #'q-c-debug mostly unnecessary. Remove #'q-c-debug. Add new tests that use the IGNORE-INVALID-SEQUENCESP argument to #'query-coding-region; rework the existing ones to respect it.

author	Aidan Kehoe <kehoea@parhasard.net>
date	Sat, 07 Feb 2009 17:13:37 +0000
parents	67fbcaf3dbdc
children	257b468bf2ca

comparison

equal deleted inserted replaced

-:202cb69c4d87
+:e0a8715fdb1f
 (standard-syntax-table))
 ;; And create the coding system.
 (make-8-bit-coding-system
 'iso-8859-5
-'((#xA1 ?\u0401) ;; CYRILLIC CAPITAL LETTER IO
+'((#x80 ?\u0080) ;; <control>
+(#x81 ?\u0081) ;; <control>
+(#x82 ?\u0082) ;; <control>
+(#x83 ?\u0083) ;; <control>
+(#x84 ?\u0084) ;; <control>
+(#x85 ?\u0085) ;; <control>
+(#x86 ?\u0086) ;; <control>
+(#x87 ?\u0087) ;; <control>
+(#x88 ?\u0088) ;; <control>
+(#x89 ?\u0089) ;; <control>
+(#x8A ?\u008A) ;; <control>
+(#x8B ?\u008B) ;; <control>
+(#x8C ?\u008C) ;; <control>
+(#x8D ?\u008D) ;; <control>
+(#x8E ?\u008E) ;; <control>
+(#x8F ?\u008F) ;; <control>
+(#x90 ?\u0090) ;; <control>
+(#x91 ?\u0091) ;; <control>
+(#x92 ?\u0092) ;; <control>
+(#x93 ?\u0093) ;; <control>
+(#x94 ?\u0094) ;; <control>
+(#x95 ?\u0095) ;; <control>
+(#x96 ?\u0096) ;; <control>
+(#x97 ?\u0097) ;; <control>
+(#x98 ?\u0098) ;; <control>
+(#x99 ?\u0099) ;; <control>
+(#x9A ?\u009A) ;; <control>
+(#x9B ?\u009B) ;; <control>
+(#x9C ?\u009C) ;; <control>
+(#x9D ?\u009D) ;; <control>
+(#x9E ?\u009E) ;; <control>
+(#x9F ?\u009F) ;; <control>
+(#xA0 ?\u00A0) ;; NO-BREAK SPACE
+(#xA1 ?\u0401) ;; CYRILLIC CAPITAL LETTER IO
 (#xA2 ?\u0402) ;; CYRILLIC CAPITAL LETTER DJE
 (#xA3 ?\u0403) ;; CYRILLIC CAPITAL LETTER GJE
 (#xA4 ?\u0404) ;; CYRILLIC CAPITAL LETTER UKRAINIAN IE
 (#xA5 ?\u0405) ;; CYRILLIC CAPITAL LETTER DZE
 (#xA6 ?\u0406) ;; CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I
 (#xA8 ?\u0408) ;; CYRILLIC CAPITAL LETTER JE
 (#xA9 ?\u0409) ;; CYRILLIC CAPITAL LETTER LJE
 (#xAA ?\u040A) ;; CYRILLIC CAPITAL LETTER NJE
 (#xAB ?\u040B) ;; CYRILLIC CAPITAL LETTER TSHE
 (#xAC ?\u040C) ;; CYRILLIC CAPITAL LETTER KJE
+(#xAD ?\u00AD) ;; SOFT HYPHEN
 (#xAE ?\u040E) ;; CYRILLIC CAPITAL LETTER SHORT U
 (#xAF ?\u040F) ;; CYRILLIC CAPITAL LETTER DZHE
 (#xB0 ?\u0410) ;; CYRILLIC CAPITAL LETTER A
 (#xB1 ?\u0411) ;; CYRILLIC CAPITAL LETTER BE
 (#xB2 ?\u0412) ;; CYRILLIC CAPITAL LETTER VE
 (#xFE ?\u045E) ;; CYRILLIC SMALL LETTER SHORT U
 (#xFF ?\u045F)) ;; CYRILLIC SMALL LETTER DZHE
 "ISO-8859-5 (Cyrillic)"
 '(mnemonic "ISO8/Cyr"
 documentation "The ISO standard for encoding Cyrillic. Not used in practice.
-See `koi8-r' and `windows-1250'.  "
+See `koi8-r' and `windows-1251'.  "
 aliases (cyrillic-iso-8bit)))
 ;; Provide this locale; but don't allow it to be picked up from the Unix
 ;; locale (it has no locale entry in the alist), we leave that to Russian.
 (set-language-info-alist

Mercurial > hg > xemacs-beta

comparison lisp/mule/cyrillic.el @ 4604:e0a8715fdb1f