Mercurial > hg > xemacs-beta
view lisp/mule/chinese.el @ 1315:70921960b980
[xemacs-hg @ 2003-02-20 08:19:28 by ben]
check in makefile fixes et al
Makefile.in.in: Major surgery. Move all stuff related to building anything in the
src/ directory into src/. Simplify the dependencies -- everything
in src/ is dependent on the single entry `src' in MAKE_SUBDIRS.
Remove weirdo targets like `all-elc[s]', dump-elc[s], etc.
mule/mule-msw-init.el: Removed.
Delete this file.
mule/mule-win32-init.el: New file, with stuff from mule-msw-init.el -- not just for MS Windows
native, boys and girls!
bytecomp.el: Change code inserted to catch trying to load a Mule-only .elc
file in a non-Mule XEmacs. Formerly you got the rather cryptic
"The required feature `mule' cannot be provided". Now you get
"Loading this file requires Mule support".
finder.el: Remove dependency on which directory this function is invoked
from.
update-elc.el: Don't mess around with ../src/BYTECOMPILE_CHANGE. Now that
Makefile.in.in and xemacs.mak are in sync, both of them use
NEEDTODUMP and the other one isn't used.
dumped-lisp.el: Rewrite in terms of `list' and `nconc' instead of assemble-list, so
we can have arbitrary forms, not just `when-feature'.
very-early-lisp.el: Nuke this file.
finder-inf.el, packages.el, update-elc.el, update-elc-2.el, loadup.el, make-docfile.el: Eliminate references to very-early-lisp.
msw-glyphs.el: Comment clarification.
xemacs.mak: Add macros DO_TEMACS, DO_XEMACS, and a few others; this macro
section is now completely in sync with src/Makefile.in.in. Copy
check-features, load-shadows, and rebuilding finder-inf.el from
src/Makefile.in.in. The main build/dump/recompile process is now
synchronized with src/Makefile.in.in. Change `WARNING' to `NOTE'
and `error checking' to `error-checking' TO avoid tripping
faux warnings and errors in the VC++ IDE.
Makefile.in.in: Major surgery. Move all stuff related to building anything in the
src/ directory from top-level Makefile.in.in to here. Simplify
the dependencies. Rearrange into logical subsections.
Synchronize the main compile/dump/build-elcs section with
xemacs.mak, which is already clean and in good working order.
Remove weirdo targets like `all-elc[s]', dump-elc[s], etc. Add
additional levels of macros \(e.g. DO_TEMACS, DO_XEMACS,
TEMACS_BATCH, XEMACS_BATCH, XEMACS_BATCH_PACKAGES) to factor out
duplicated stuff. Clean up handling of "HEAP_IN_DATA" (Cygwin) so
it doesn't need to ignore the return value from dumping. Add
.NO_PARALLEL since various aspects of building and dumping must be
serialized but do not always have dependencies between them
(this is impossible in some cases). Everything related to src/
now gets built in one pass in this directory by just running
`make' (except the Makefiles themselves and config.h, paths.h,
Emacs.ad.h, and other generated .h files).
console.c: Update list of possibly valid console types.
emacs.c: Rationalize the specifying and handling of the type of the first
frame. This was originally prompted by a workspace in which I got
GTK to compile under C++ and in the process fixed it so it could
coexist with X in the same build -- hence, a combined
TTY/X/MS-Windows/GTK build is now possible under Cygwin. (However,
you can't simultaneously *display* more than one kind of device
connection -- but getting that to work is not that difficult.
Perhaps a project for a bored grad student. I (ben) would do it
but don't see the use.) To make sense of this, I added new
switches that can be used to specifically indicate the window
system: -x [aka --use-x], -tty \[aka --use-tty], -msw [aka
--use-ms-windows], -gtk [aka --use-gtk], and -gnome [aka
--use-gnome, same as --use-gtk]. -nw continues as an alias for
-tty. When none have been given, XEmacs checks for other
parameters implying particular device types (-t -> tty, -display
-> x [or should it have same treatment as DISPLAY below?]), and
has ad-hoc logic afterwards: if env var DISPLAY is set, use x (or
gtk? perhaps should check whether gnome is running), else MS
Windows if it exsits, else TTY if it exists, else stream, and you
must be running in batch mode. This also fixes an existing bug
whereby compiling with no x, no mswin, no tty, when running non-
interactively (e.g. to dump) I get "sorry, must have TTY support".
emacs.c: Turn on Vstack_trace_on_error so that errors are debuggable even
when occurring extremely early in reinitialization.
emacs.c: Try to make sure that the user can see message output under
Windows (i.e. it doesn't just disappear right away) regardless of
when it occurs, e.g. in the middle of creating the first frame.
emacs.c: Define new function `emacs-run-status', indicating whether XEmacs
is noninteractive or interactive, whether raw,
post-dump/pdump-load or run-temacs, whether we are dumping,
whether pdump is in effect.
event-stream.c: It's "mommas are fat", not "momas are fat".
Fix other typo.
event-stream.c: Conditionalize in_menu_callback check on HAVE_MENUBARS,
because it won't exist on w/o menubar support,
lisp.h: More hackery on RETURN_NOT_REACHED. Cygwin v3.2 DOES complain here
if RETURN_NOT_REACHED() is blank, as it is for GCC 2.5+. So make it
blank only for GCC 2.5 through 2.999999999999999.
Declare Vstack_trace_on_error.
profile.c: Need to include "profile.h" to fix warnings.
sheap.c: Don't fatal() when need to rerun Make, just stderr_out() and exit(0).
That way we can distinguish between a dumping failing expectedly
(due to lack of stack space, triggering another dump) and unexpectedly,
in which case, we want to stop building. (or go on, if -K is given)
syntax.c, syntax.h: Use ints where they belong, and enum syntaxcode's where they belong,
and fix warnings thereby.
syntax.h: Fix crash caused by an edge condition in the syntax-cache macros.
text.h: Spacing fixes.
xmotif.h: New file, to get around shadowing warnings.
EmacsManager.c, event-Xt.c, glyphs-x.c, gui-x.c, input-method-motif.c, xmmanagerp.h, xmprimitivep.h: Include xmotif.h.
alloc.c: Conditionalize in_malloc on ERROR_CHECK_MALLOC.
config.h.in, file-coding.h, fileio.c, getloadavg.c, select-x.c, signal.c, sysdep.c, sysfile.h, systime.h, text.c, unicode.c: Eliminate HAVE_WIN32_CODING_SYSTEMS, use WIN32_ANY instead.
Replace defined (WIN32_NATIVE) || defined (CYGWIN) with WIN32_ANY.
lisp.h: More futile attempts to walk and chew gum at the same time when
dealing with subr's that don't return.
author | ben |
---|---|
date | Thu, 20 Feb 2003 08:19:44 +0000 |
parents | 0f42d0a17667 |
children | c112a340ac91 |
line wrap: on
line source
;;; chinese.el --- Support for Chinese -*- coding: iso-2022-7bit; -*- ;; Copyright (C) 1995 Electrotechnical Laboratory, JAPAN. ;; Licensed to the Free Software Foundation. ;; Copyright (C) 1997 MORIOKA Tomohiko ;; Copyright (C) 2000, 2001, 2002 Ben Wing. ;; Keywords: multilingual, Chinese ;; This file is part of XEmacs. ;; XEmacs is free software; you can redistribute it and/or modify it ;; under the terms of the GNU General Public License as published by ;; the Free Software Foundation; either version 2, or (at your option) ;; any later version. ;; XEmacs is distributed in the hope that it will be useful, but ;; WITHOUT ANY WARRANTY; without even the implied warranty of ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU ;; General Public License for more details. ;; You should have received a copy of the GNU General Public License ;; along with XEmacs; see the file COPYING. If not, write to the Free ;; Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA ;; 02111-1307, USA. ;;; Commentary: ;; For Chinese, three character sets GB2312, BIG5, and CNS11643 are ;; supported. ;;; Code: (eval-when-compile (require 'china-util)) ; (make-charset 'chinese-gb2312 ; "GB2312 Chinese simplified: ISO-IR-58" ; '(dimension ; 2 ; registry "GB2312.1980" ; chars 94 ; columns 2 ; direction l2r ; final ?A ; graphic 0 ; short-name "GB2312" ; long-name "GB2312: ISO-IR-58" ; )) ; (make-charset 'chinese-cns11643-1 ; "CNS11643 Plane 1 Chinese traditional: ISO-IR-171" ; '(dimension ; 2 ; registry "CNS11643.1992-1" ; chars 94 ; columns 2 ; direction l2r ; final ?G ; graphic 0 ; short-name "CNS11643-1" ; long-name "CNS11643-1 (Chinese traditional): ISO-IR-171" ; )) ; (make-charset 'chinese-cns11643-2 ; "CNS11643 Plane 2 Chinese traditional: ISO-IR-172" ; '(dimension ; 2 ; registry "CNS11643.1992-2" ; chars 94 ; columns 2 ; direction l2r ; final ?H ; graphic 0 ; short-name "CNS11643-2" ; long-name "CNS11643-2 (Chinese traditional): ISO-IR-172" ; )) ; (make-charset 'chinese-big5-1 ; "Frequently used part (A141-C67F) of Big5 (Chinese traditional)" ; '(dimension ; 2 ; registry "Big5" ; chars 94 ; columns 2 ; direction l2r ; final ?0 ; graphic 0 ; short-name "Big5 (Level-1)" ; long-name "Big5 (Level-1) A141-C67F" ; )) ; (make-charset 'chinese-big5-2 ; "Less frequently used part (C940-FEFE) of Big5 (Chinese traditional)" ; '(dimension ; 2 ; registry "Big5" ; chars 94 ; columns 2 ; direction l2r ; final ?1 ; graphic 0 ; short-name "Big5 (Level-2)" ; long-name "Big5 (Level-2) C940-FEFE" ; )) ;; Syntax of Chinese characters. (modify-syntax-entry 'chinese-gb2312 "w") (loop for row in '(33 34 41) do (modify-syntax-entry `[chinese-gb2312 ,row] ".")) ;;(loop for row from 35 to 40 ;; do (modify-syntax-entry `[chinese-gb2312 ,row] "w")) ;;(loop for row from 42 to 126 ;; do (modify-syntax-entry `[chinese-gb2312 ,row] "w")) (modify-syntax-entry 'chinese-cns11643-1 "w") (modify-syntax-entry 'chinese-cns11643-2 "w") (modify-syntax-entry 'chinese-big5-1 "w") (modify-syntax-entry 'chinese-big5-2 "w") ; ;; Chinese CNS11643 Plane3 thru Plane7. Although these are official ; ;; character sets, the use is rare and don't have to be treated ; ;; space-efficiently in the buffer. ; (make-charset 'chinese-cns11643-3 ; "CNS11643 Plane 3 Chinese Traditional: ISO-IR-183" ; '(dimension ; 2 ; registry "CNS11643.1992-3" ; chars 94 ; columns 2 ; direction l2r ; final ?I ; graphic 0 ; short-name "CNS11643-3" ; long-name "CNS11643-3 (Chinese traditional): ISO-IR-183" ; )) ;; CNS11643 Plane3 thru Plane7 ;; These represent more and more obscure Chinese characters. ;; By the time you get to Plane 7, we're talking about characters ;; that appear once in some ancient manuscript and whose meaning ;; is unknown. (flet ((make-chinese-cns11643-charset (name plane final) (make-charset name (concat "CNS 11643 Plane " plane " (Chinese traditional)") `(registry ,(concat "CNS11643[.-]\\(.*[.-]\\)?" plane "$") dimension 2 chars 94 final ,final graphic 0 short-name ,(concat "CNS11643-" plane) long-name ,(format "CNS11643-%s (Chinese traditional): ISO-IR-183" plane))) (modify-syntax-entry name "w") (modify-category-entry name ?t) )) (make-chinese-cns11643-charset 'chinese-cns11643-3 "3" ?I) (make-chinese-cns11643-charset 'chinese-cns11643-4 "4" ?J) (make-chinese-cns11643-charset 'chinese-cns11643-5 "5" ?K) (make-chinese-cns11643-charset 'chinese-cns11643-6 "6" ?L) (make-chinese-cns11643-charset 'chinese-cns11643-7 "7" ?M) ) ;; ISO-IR-165 (CCITT Extended GB) ;; It is based on CCITT Recommendation T.101, includes GB 2312-80 + ;; GB 8565-88 table A4 + 293 characters. (make-charset ;; not in FSF 21.1 'chinese-isoir165 "ISO-IR-165 (CCITT Extended GB; Chinese simplified)" `(registry "isoir165" dimension 2 chars 94 final ?E graphic 0 short-name "ISO-IR-165" long-name "ISO-IR-165 (CCITT Extended GB; Chinese simplified)")) ;; PinYin-ZhuYin (make-charset 'chinese-sisheng "SiSheng characters for PinYin/ZhuYin" '(dimension 1 ;; XEmacs addition: second half of registry spec registry "sisheng_cwnn\\|OMRON_UDC_ZH" chars 94 columns 1 direction l2r final ?0 graphic 0 short-name "SiSheng" long-name "SiSheng (PinYin/ZhuYin)" )) ;; If you prefer QUAIL to EGG, please modify below as you wish. ;;(when (and (featurep 'egg) (featurep 'wnn)) ;; (setq wnn-server-type 'cserver) ;; (load "pinyin") ;; (setq its:*standard-modes* ;; (cons (its:get-mode-map "PinYin") its:*standard-modes*))) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;;; Chinese (general) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; (make-coding-system ;; 'iso-2022-cn 2 ?C ;; "ISO 2022 based 7bit encoding for Chinese GB and CNS (MIME:ISO-2022-CN)" ;; '(ascii ;; (nil chinese-gb2312 chinese-cns11643-1) ;; (nil chinese-cns11643-2) ;; nil ;; nil ascii-eol ascii-cntl seven locking-shift single-shift nil nil nil ;; init-bol) ;; '((safe-charsets ascii chinese-gb2312 chinese-cns11643-1 chinese-cns11643-2) ;; (mime-charset . iso-2022-cn))) ;; (define-coding-system-alias 'chinese-iso-7bit 'iso-2022-cn) ;; (make-coding-system ;; 'iso-2022-cn-ext 2 ?C ;; "ISO 2022 based 7bit encoding for Chinese GB and CNS (MIME:ISO-2022-CN-EXT)" ;; '(ascii ;; (nil chinese-gb2312 chinese-cns11643-1) ;; (nil chinese-cns11643-2) ;; (nil chinese-cns11643-3 chinese-cns11643-4 chinese-cns11643-5 ;; chinese-cns11643-6 chinese-cns11643-7) ;; nil ascii-eol ascii-cntl seven locking-shift single-shift nil nil nil ;; init-bol) ;; '((safe-charsets ascii chinese-gb2312 chinese-cns11643-1 chinese-cns11643-2 ;; chinese-cns11643-3 chinese-cns11643-4 chinese-cns11643-5 ;; chinese-cns11643-6 chinese-cns11643-7) ;; (mime-charset . iso-2022-cn-ext))) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;;; Chinese GB2312 (simplified) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; (make-coding-system ;; 'chinese-iso-8bit 2 ?c ;; "ISO 2022 based EUC encoding for Chinese GB2312 (MIME:CN-GB-2312)" ;; '(ascii chinese-gb2312 nil nil ;; nil ascii-eol ascii-cntl nil nil nil nil) ;; '((safe-charsets ascii chinese-gb2312) ;; (mime-charset . cn-gb-2312))) (make-coding-system 'cn-gb-2312 'iso2022 "Chinese EUC" '(charset-g0 ascii charset-g1 chinese-gb2312 charset-g2 chinese-sisheng charset-g3 t mnemonic "Zh-GB/EUC" documentation "Chinese EUC (Extended Unix Code), the standard Chinese encoding on Unix. This follows the same overall EUC principles as Japanese EUC (see the description under Japanese EUC), but specifies different character sets: G0: ASCII G1: Chinese-GB2312 G2: Sisheng (PinYin - ZhuYin)" )) ;; (define-coding-system-alias 'cn-gb-2312 'chinese-iso-8bit) ;; (define-coding-system-alias 'euc-china 'chinese-iso-8bit) ;; (define-coding-system-alias 'euc-cn 'chinese-iso-8bit) (define-coding-system-alias 'gb2312 'cn-gb-2312) (define-coding-system-alias 'chinese-euc 'cn-gb-2312) ;; (make-coding-system ;; 'chinese-hz 0 ?z ;; "Hz/ZW 7-bit encoding for Chinese GB2312 (MIME:HZ-GB-2312)" ;; nil ;; '((safe-charsets ascii chinese-gb2312) ;; (mime-charset . hz-gb-2312) ;; (post-read-conversion . post-read-decode-hz) ;; (pre-write-conversion . pre-write-encode-hz))) ;; (put 'chinese-hz 'post-read-conversion 'post-read-decode-hz) ;; (put 'chinese-hz 'pre-write-conversion 'pre-write-encode-hz) (make-coding-system 'hz-gb-2312 'no-conversion "Hz/ZW (Chinese)" '(mnemonic "Zh-GB/Hz" eol-type lf post-read-conversion post-read-decode-hz pre-write-conversion pre-write-encode-hz documentation "Hz/ZW 7-bit encoding for Chinese GB2312 (MIME:HZ-GB-2312)" )) ;; (define-coding-system-alias 'hz-gb-2312 'chinese-hz) ;; (define-coding-system-alias 'hz 'chinese-hz) (define-coding-system-alias 'hz 'hz-gb-2312) (defun post-read-decode-hz (len) (let ((pos (point)) (buffer-modified-p (buffer-modified-p)) ;last-coding-system-used ) (prog1 (decode-hz-region pos (+ pos len)) (set-buffer-modified-p buffer-modified-p)))) (defun pre-write-encode-hz (from to) (let ((buf (current-buffer))) (set-buffer (generate-new-buffer " *temp*")) (if (stringp from) (insert from) (insert-buffer-substring buf from to)) ;(let (last-coding-system-used) (encode-hz-region 1 (point-max)) nil)) (set-language-info-alist "Chinese-GB" '((setup-function . setup-chinese-gb-environment-internal) (charset chinese-gb2312 chinese-sisheng) (coding-system cn-gb-2312 iso-2022-7bit hz-gb-2312) (coding-priority cn-gb-2312 big5 iso-2022-7bit) (cygwin-locale "zh") (locale "zh_CN.eucCN" "zh_CN.EUC" "zh_CN" "chinese-s" "zh" (lambda (arg) (and arg (let ((case-fold-search t)) (string-match "^zh_.*.GB.*" arg))))) (mswindows-locale ("CHINESE" . "CHINESE_SIMPLIFIED")) (native-coding-system cn-gb-2312) (input-method . "chinese-py-punct") (features china-util) (sample-text . "Chinese ($AVPND(B,$AFUM(;0(B,$A::So(B) $ADc:C(B") (documentation . "Supports Simplified Chinese, used in mainland China. Uses the GB2312 character set.")) '("Chinese")) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; Chinese BIG5 (traditional) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; (make-coding-system ;; 'chinese-big5 3 ?B "BIG5 8-bit encoding for Chinese (MIME:CN-BIG5)" ;; nil ;; '((safe-charsets ascii chinese-big5-1 chinese-big5-2) ;; (mime-charset . cn-big5) ;; (charset-origin-alist (chinese-big5-1 "BIG5" encode-big5-char) ;; (chinese-big5-2 "BIG5" encode-big5-char)))) (make-coding-system 'big5 'big5 "Big5" '(mnemonic "Zh/Big5" documentation "A non-modal encoding formed by five large Taiwanese companies \(hence \"Big5\") to produce a character set and encoding for traditional Chinese writing. Big5 encodes some 13,000+ characters. ASCII is encoded as normal, and Chinese characters as two bytes, but Chinese characters do not exclusively use the high half. The first byte is in the high half standard position A1-FE, but the second byte is in either low 40-7E or high A1-FE. Thus Big5 suffers from the classic \"it might look like a slash, but it's really the second byte of a Chinese character\".")) ;; (define-coding-system-alias 'big5 'chinese-big5) ;; (define-coding-system-alias 'cn-big5 'chinese-big5) (define-coding-system-alias 'cn-big5 'big5) ;; Big5 font requires special encoding. (define-ccl-program ccl-encode-big5-font `(0 ;; In: R0:chinese-big5-1 or chinese-big5-2 ;; R1:position code 1 ;; R2:position code 2 ;; Out: R1:font code point 1 ;; R2:font code point 2 ((r2 = ((((r1 - ?\x21) * 94) + r2) - ?\x21)) (if (r0 == ,(charset-id 'chinese-big5-2)) (r2 += 6280)) (r1 = ((r2 / 157) + ?\xA1)) (r2 %= 157) (if (r2 < ?\x3F) (r2 += ?\x40) (r2 += ?\x62)))) "CCL program to encode a Big5 code to code point of Big5 font.") ;; (setq font-ccl-encoder-alist ;; (cons (cons "big5" ccl-encode-big5-font) font-ccl-encoder-alist)) (set-charset-ccl-program 'chinese-big5-1 'ccl-encode-big5-font) (set-charset-ccl-program 'chinese-big5-2 'ccl-encode-big5-font) (set-language-info-alist "Chinese-BIG5" '((charset chinese-big5-1 chinese-big5-2) (coding-system big5 iso-2022-7bit) (coding-priority big5 cn-gb-2312 iso-2022-7bit) (cygwin-locale "zh_TW") (locale "zh_TW.Big5" "zh_TW.big5" "zh_CN.big5" "zh_TW" "chinese-t" (lambda (arg) (and arg (let ((case-fold-search t)) (string-match "^zh_.*.BIG5.*" arg))))) (mswindows-locale ("CHINESE" . "CHINESE_TRADITIONAL")) (native-coding-system big5) (input-method . "chinese-py-punct-b5") (features china-util) (sample-text . "Cantonese ($(0GnM$(B,$(0N]0*Hd(B) $(0*/=((B, $(0+$)p(B") (documentation . "Supports Traditional Chinese, used in Taiwan, Hong Kong, and Singapore. Uses the Chinese Big5 character set." )) '("Chinese")) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; Chinese CNS11643 (traditional) ;;;;;;;;;;;;;;;;;;;;;;;;;;;;;; ;; (set-language-info-alist ;; "Chinese-CNS" '((charset chinese-cns11643-1 chinese-cns11643-2 ;; chinese-cns11643-3 chinese-cns11643-4 ;; chinese-cns11643-5 chinese-cns11643-6 ;; chinese-cns11643-7) ;; (coding-system iso-2022-cn) ;; (coding-priority iso-2022-cn chinese-big5 chinese-iso-8bit) ;; (features china-util) ;; (input-method . "chinese-cns-quick") ;; (documentation . "Support for Chinese CNS character sets.")) ;; '("Chinese")) ;;; chinese.el ends here