Mercurial > hg > xemacs-beta
view lisp/mule/mule-win32-init.el @ 4604:e0a8715fdb1f
Support new IGNORE-INVALID-SEQUENCESP argument, #'query-coding-region.
lisp/ChangeLog addition:
2009-02-07 Aidan Kehoe <kehoea@parhasard.net>
* coding.el (query-coding-clear-highlights):
Rename the BUFFER argument to BUFFER-OR-STRING, describe it as
possibly being a string in its documentation.
(default-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document that this
function does not support it.
Bind case-fold-search to nil, we don't want this to influence what the
function thinks is encodable or not.
(query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does; reflect this new argument in the associated compiler macro.
(query-coding-string):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does. Support the HIGHLIGHT argument correctly.
* unicode.el (unicode-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, document what it
does, implement this. Document a potential problem.
Use #'query-coding-clear-highlights instead of reimplementing it
ourselves.
Remove some debugging messages.
* mule/arabic.el (iso-8859-6):
* mule/cyrillic.el (iso-8859-5):
* mule/greek.el (iso-8859-7):
* mule/hebrew.el (iso-8859-8):
* mule/latin.el (iso-8859-2):
* mule/latin.el (iso-8859-3):
* mule/latin.el (iso-8859-4):
* mule/latin.el (iso-8859-14):
* mule/latin.el (iso-8859-15):
* mule/latin.el (iso-8859-16):
* mule/latin.el (iso-8859-9):
* mule/latin.el (windows-1252):
* mule/mule-coding.el (iso-8859-1):
Avoid the assumption that characters not given an explicit mapping
in these coding systems map to the ISO 8859-1 characters
corresponding to the octets on disk; this makes it much more
reasonable to implement the IGNORE-INVALID-SEQUENCESP argument to
query-coding-region.
* mule/mule-cmds.el (set-language-info):
Correct the docstring.
* mule/mule-cmds.el (finish-set-language-environment):
Treat invalid Unicode sequences produced from
invalid-sequence-coding-system and corresponding to control
characters the same as control characters in redisplay.
* mule/mule-cmds.el:
Document that encode-coding-char is available in coding.el
* mule/mule-coding.el (make-8-bit-generate-helper):
Change to return the both the encode-program generated and the
relevant non-ASCII charset; update the docstring to reflect this.
* mule/mule-coding.el
(make-8-bit-generate-encode-program-and-skip-chars-strings):
Rename this function; have it return skip-chars-strings as well as
the encode program. Have these skip-chars-strings use ranges for
charsets, where possible.
* mule/mule-coding.el (make-8-bit-create-decode-encode-tables):
Revise this to allow people to specify explicitly characters that
should be undefined (= corresponding to keys in
unicode-error-default-translation-table), and treating unspecified
octets above #x7f as undefined by default.
* mule/mule-coding.el (8-bit-fixed-query-coding-region):
Add a new IGNORE-INVALID-SEQUENCESP argument, implement support
for it using the 8-bit-fixed-invalid-sequences-skip-chars coding
system property; remove some debugging messages.
* mule/mule-coding.el (make-8-bit-coding-system):
This function is dumped, autoloading it makes no sense.
Document what happens when characters above #x7f are not
specified, implement this.
* mule/vietnamese.el:
Correct spelling.
tests/ChangeLog addition:
2009-02-07 Aidan Kehoe <kehoea@parhasard.net>
* automated/query-coding-tests.el:
Add FAILING-CASE arguments to the Assert calls, making #'q-c-debug
mostly unnecessary. Remove #'q-c-debug.
Add new tests that use the IGNORE-INVALID-SEQUENCESP argument to
#'query-coding-region; rework the existing ones to respect it.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Sat, 07 Feb 2009 17:13:37 +0000 |
parents | d402d7b18bd8 |
children | a63e666bb68a 308d34e9f07d |
line wrap: on
line source
;;; mule-win32-init.el --- initialization code for MS Windows/Cygwin under MULE ;;; Copyright (C) 2001, 2002 Ben Wing. ;; This file is part of XEmacs. ;; XEmacs is free software; you can redistribute it and/or modify it ;; under the terms of the GNU General Public License as published by ;; the Free Software Foundation; either version 2, or (at your option) ;; any later version. ;; XEmacs is distributed in the hope that it will be useful, but ;; WITHOUT ANY WARRANTY; without even the implied warranty of ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU ;; General Public License for more details. ;; You should have received a copy of the GNU General Public License ;; along with XEmacs; see the file COPYING. If not, write to the ;; Free Software Foundation, Inc., 59 Temple Place - Suite 330, ;; Boston, MA 02111-1307, USA. (make-coding-system 'mswindows-multibyte 'mswindows-multibyte "MS Windows Multibyte (current code page)" '(mnemonic "MSW-MB" documentation "MS Windows multibyte -- current code page. This implements the encoding specified by the current code page -- i.e. the ANSI code page corresponding to the current locale, as returned by (mswindows-locale-code-page (mswindows-current-locale)) " locale current code-page ansi)) ;; we temporarily aliased this to raw-text in intl-win32.c. (define-coding-system-alias 'mswindows-multibyte-system-default nil) (make-coding-system 'mswindows-multibyte-system-default 'mswindows-multibyte "MS Windows Multibyte (system default code page)" '(mnemonic "MSW-MB-SysDef" documentation "MS Windows multibyte -- system default code page. This implements the encoding specified by the system default code page -- i.e. the ANSI code page corresponding to the system default locale, as returned by (mswindows-locale-code-page (mswindows-system-default-locale)) " locale system-default code-page ansi)) (make-coding-system 'mswindows-multibyte-user-default 'mswindows-multibyte "MS Windows Multibyte (user default code page)" '(mnemonic "MSW-MB-UserDef" documentation "MS Windows multibyte -- user default code page. This implements the encoding specified by the user default code page -- i.e. the ANSI code page corresponding to the user default locale, as returned by (mswindows-locale-code-page (mswindows-user-default-locale)) " locale user-default code-page ansi)) (make-coding-system 'mswindows-multibyte-oem 'mswindows-multibyte "MS Windows Multibyte (current OEM code page)" '(mnemonic "MSW-MB-OEM" documentation "MS Windows multibyte -- current OEM code page. This implements the encoding specified by the current OEM code page -- i.e. the OEM code page corresponding to the current locale, as returned by (mswindows-locale-oem-code-page (mswindows-current-locale)) " locale current code-page oem)) (make-coding-system 'mswindows-multibyte-oem-system-default 'mswindows-multibyte "MS Windows Multibyte (system default OEM code page)" '(mnemonic "MSW-MB-OEM-SysDef" documentation "MS Windows multibyte -- system default OEM code page. This implements the encoding specified by the system default OEM code page -- i.e. the OEM code page corresponding to the system default locale, as returned by (mswindows-locale-oem-code-page (mswindows-system-default-locale)) " locale system-default code-page oem)) (make-coding-system 'mswindows-multibyte-oem-user-default 'mswindows-multibyte "MS Windows Multibyte (user default OEM code page)" '(mnemonic "MSW-MB-OEM-UserDef" documentation "MS Windows multibyte -- user default OEM code page. This implements the encoding specified by the user default OEM code page -- i.e. the OEM code page corresponding to the user default locale, as returned by (mswindows-locale-oem-code-page (mswindows-user-default-locale)) " locale user-default code-page oem)) (loop for (ansioem cp category name) in '(("EBCDIC" 037 no-conversion "EBCDIC") ("OEM" 437 no-conversion "MS-DOS United States") ("EBCDIC" 500 no-conversion "EBCDIC \"500V1\"") ;; This is ISO-8859-6. ;; ("OEM" 708 "Arabic (ASMO 708)") ("OEM" 709 no-conversion "Arabic (ASMO 449+, BCON V4)") ("OEM" 710 no-conversion "Arabic (Transparent Arabic)") ("OEM" 720 no-conversion "Arabic (Transparent ASMO)") ("OEM" 737 no-conversion "Greek (formerly 437G)") ("OEM" 775 no-conversion "Baltic") ("OEM" 850 no-conversion "MS-DOS Multilingual (Latin I)") ("OEM" 852 no-conversion "MS-DOS Slavic (Latin II)") ("OEM" 855 no-conversion "IBM Cyrillic (primarily Russian)") ("OEM" 857 no-conversion "IBM Turkish") ("OEM" 860 no-conversion "MS-DOS Portuguese") ("OEM" 861 no-conversion "MS-DOS Icelandic") ("OEM" 862 no-conversion "Hebrew") ("OEM" 863 no-conversion "MS-DOS Canadian-French") ("OEM" 864 no-conversion "Arabic") ("OEM" 865 no-conversion "MS-DOS Nordic") ; ("OEM" 866 no-conversion "MS-DOS Russian") ("OEM" 869 no-conversion "IBM Modern Greek") ("Ansi/OEM" 874 no-conversion "Thai") ("EBCDIC" 875 no-conversion "EBCDIC") ("Ansi/OEM" 932 shift_jis "Japanese") ("Ansi/OEM" 936 iso_8_2 "Chinese (PRC, Singapore)") ("Ansi/OEM" 949 iso_8_2 "Korean") ("Ansi/OEM" 950 big5 "Chinese (Taiwan; Hong Kong SAR, PRC)") ("EBCDIC" 1026 no-conversion "EBCDIC") ;; This code page doesn't work. See ;; http://blogs.msdn.com/michkap/archive/2005/08/01/446475.aspx ; ("ANSI" 1200 utf-16-little-endian "Unicode (BMP of ISO 10646)") ;; We implement this ourselves, and keeping the same implementation ;; across platforms means behaviour is a bit more consistent. ; ("ANSI" 1250 no-conversion "Windows 3.1 Eastern European") ; ("ANSI" 1251 no-conversion "Windows 3.1 Cyrillic") ; ("ANSI" 1252 no-conversion "Windows 3.1 US (ANSI)") ; ("ANSI" 1253 no-conversion "Windows 3.1 Greek") ("ANSI" 1254 no-conversion "Windows 3.1 Turkish") ("ANSI" 1255 no-conversion "Hebrew") ;; We implement these ourselves. ; ("ANSI" 1256 no-conversion "Arabic") ("ANSI" 1257 no-conversion "Baltic") ("ANSI" 1258 no-conversion "VietNam") ;; #### Is this category right? I don't have Lunde to hand, and the ;; online information on Johab is scant. ("Ansi/OEM" 1361 iso_8_2 "Korean (Johab)") ("Mac" 10000 no-conversion "Macintosh Roman") ("Mac" 10001 shift_jis "Macintosh Japanese") ("Mac" 10006 no-conversion "Macintosh Greek I") ("Mac" 10007 no-conversion "Macintosh Cyrillic") ("Mac" 10029 no-conversion "Macintosh Latin 2") ("Mac" 10079 no-conversion "Macintosh Icelandic") ("Mac" 10081 no-conversion "Macintosh Turkish")) do (make-coding-system (intern (format "windows-%s" cp)) 'mswindows-multibyte (format "MS Windows code page %s (%s, %s)" cp ansioem name) `(mnemonic ,(format "MSW-%s" cp) code-page ,cp documentation ,(format "MS Windows Multibyte -- code page %s (%s, %s). This implements the encoding specified by code page %s. For more information on code pages, see `mswindows-charset-code-page'." cp ansioem name cp))) (define-coding-system-alias (intern (format "cp%s" cp)) (intern (format "windows-%s" cp))) (coding-system-put (intern (format "windows-%s" cp)) 'category category))