Mercurial > hg > xemacs-beta
view lisp/mule/thai-xtis.el @ 4568:1d74a1d115ee
Add #'query-coding-region tests; do the work necessary to get them running.
lisp/ChangeLog addition:
2008-12-28 Aidan Kehoe <kehoea@parhasard.net>
* coding.el (default-query-coding-region):
Declare using defun*, so we can #'return-from to it on
encountering a safe-charsets value of t. Comment out a few
debug messages.
(query-coding-region):
Correct the docstring, it deals with a region, not a string.
(unencodable-char-position):
Correct the implementation for non-nil COUNT, special-case a zero
value for count, treat it as one. Don't rely on dynamic scope when
calling the main lambda.
* unicode.el (unicode-query-coding-region):
Comment out some debug messages here.
* mule/mule-coding.el (8-bit-fixed-query-coding-region):
Comment out some debug messages here.
* code-init.el (raw-text):
Add a safe-charsets property to this coding system.
* mule/korean.el (iso-2022-int-1):
* mule/korean.el (euc-kr):
* mule/korean.el (iso-2022-kr):
Add safe-charsets properties for these coding systems.
* mule/japanese.el (iso-2022-jp):
* mule/japanese.el (jis7):
* mule/japanese.el (jis8):
* mule/japanese.el (shift-jis):
* mule/japanese.el (iso-2022-jp-1978-irv):
* mule/japanese.el (euc-jp):
Add safe-charsets properties for all these coding systems.
* mule/iso-with-esc.el:
Add safe-charsets properties to all the coding systems in
here. Comment on the downside of a safe-charsets value of t for
iso-latin-1-with-esc.
* mule/hebrew.el (ctext-hebrew):
Add a safe-charsets property for this coding system.
* mule/devanagari.el (in-is13194-devanagari):
Add a safe-charsets property for this coding system.
* mule/chinese.el (cn-gb-2312):
* mule/chinese.el (hz-gb-2312):
* mule/chinese.el (big5):
Add safe-charsets properties for these coding systems.
* mule/latin.el (iso-8859-14):
Add an implementation for this, using #'make-8-bit-coding-system.
* mule/mule-coding.el (ctext):
* mule/mule-coding.el (iso-2022-8bit-ss2):
* mule/mule-coding.el (iso-2022-7bit-ss2):
* mule/mule-coding.el (iso-2022-jp-2):
* mule/mule-coding.el (iso-2022-7bit):
* mule/mule-coding.el (iso-2022-8):
* mule/mule-coding.el (escape-quoted):
* mule/mule-coding.el (iso-2022-lock):
Add safe-charsets properties for all these coding systems.
src/ChangeLog addition:
2008-12-28 Aidan Kehoe <kehoea@parhasard.net>
* file-coding.c (Fmake_coding_system):
Document our use of the safe-chars and safe-charsets properties,
and the differences compared to GNU.
(make_coding_system_1): Don't drop the safe-chars and
safe-charsets properties.
(Fcoding_system_property): Return the safe-chars and safe-charsets
properties when asked for them.
* file-coding.h (CODING_SYSTEM_SAFE_CHARSETS):
* coding-system-slots.h:
Make the safe-chars and safe-charsets slots available in these
headers.
tests/ChangeLog addition:
2008-12-28 Aidan Kehoe <kehoea@parhasard.net>
* automated/query-coding-tests.el:
New file, testing the functionality of #'query-coding-region and
#'query-coding-string.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Sun, 28 Dec 2008 14:46:24 +0000 |
parents | 98af8a976fc3 |
children | 257b468bf2ca |
line wrap: on
line source
;;; thai-xtis.el --- Support for Thai (XTIS) -*- coding: iso-2022-7bit; -*- ;; Copyright (C) 1999 Electrotechnical Laboratory, JAPAN. ;; Licensed to the Free Software Foundation. ;; Author: TAKAHASHI Naoto <ntakahas@etl.go.jp> ;; MORIOKA Tomohiko <tomo@etl.go.jp> ;; Created: 1998-03-27 for Emacs-20.3 by TAKAHASHI Naoto ;; 1999-03-29 imported and modified for XEmacs by MORIOKA Tomohiko ;; Keywords: mule, multilingual, Thai, XTIS ;; This file is part of XEmacs. ;; XEmacs is free software; you can redistribute it and/or modify it ;; under the terms of the GNU General Public License as published by ;; the Free Software Foundation; either version 2, or (at your option) ;; any later version. ;; XEmacs is distributed in the hope that it will be useful, but ;; WITHOUT ANY WARRANTY; without even the implied warranty of ;; MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU ;; General Public License for more details. ;; You should have received a copy of the GNU General Public License ;; along with XEmacs; see the file COPYING. If not, write to the Free ;; Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA ;; 02111-1307, USA. ;;; Commentary: ;; For Thai, the pre-composed character set proposed by ;; Virach Sornlertlamvanich <virach@links.nectec.or.th> is supported. ;;; Code: (make-charset 'thai-xtis "Precomposed Thai (XTIS by Virach)." '(registries ["xtis-0"] dimension 2 columns 1 chars 94 final ?? graphic 0)) (define-category ?x "Precomposed Thai character.") (modify-category-entry 'thai-xtis ?x) (when (featurep 'xemacs) (let ((deflist '(;; chars syntax ("$(?!0(B-$(?NxP0R0S0`0(B-$(?e0(B" "w") ("$(?p0(B-$(?y0(B" "w") ("$(?O0f0_0o0z0{0(B" "_") )) elm chars len syntax to ch i) (while deflist (setq elm (car deflist)) (setq chars (car elm) len (length chars) syntax (nth 1 elm) i 0) (while (< i len) (if (= (aref chars i) ?-) (setq i (1+ i) to (nth 1 (split-char (aref chars i)))) (setq ch (nth 1 (split-char (aref chars i))) to ch)) (while (<= ch to) (modify-syntax-entry (vector 'thai-xtis ch) syntax) (setq ch (1+ ch))) (setq i (1+ i))) (setq deflist (cdr deflist)))) (put-charset-property 'thai-xtis 'preferred-coding-system 'tis-620) ) ;; This is the ccl-decode-thai-xtis automaton. ;; ;; "WRITE x y" == (insert (make-char 'thai-xtis x y)) ;; "write x" == (insert x) ;; rx' == (tis620-to-thai-xtis-second-byte-bitpattern rx) ;; r3 == "no vower nor tone" ;; r4 == (charset-id 'thai-xtis) ;; ;; | input (= r0) ;; state |-------------------------------------------- ;; | consonant | vowel | tone ;; ---------+-------------+-------------+---------------- ;; r1 == 0 | r1 = r0 | WRITE r0,r3 | WRITE r0,r3 ;; r2 == 0 | | | ;; ---------+-------------+-------------+---------------- ;; r1 == C | WRITE r1,r3 | r2 = r0' | WRITE r1,r3|r0' ;; r2 == 0 | r1 = r0 | | r1 = 0 ;; ---------+-------------+-------------+---------------- ;; r1 == C | WRITE r1,r2 | WRITE r1,r2 | WRITE r1,r2|r0' ;; r2 == V | r1 = r0 | WRITE r0,r3 | r1 = r2 = 0 ;; | r2 = 0 | r1 = r2 = 0 | ;; ;; ;; | input (= r0) ;; state |----------------------------------------- ;; | symbol | ASCII | EOF ;; ---------+-------------+-------------+------------- ;; r1 == 0 | WRITE r0,r3 | write r0 | ;; r2 == 0 | | | ;; ---------+-------------+-------------+------------- ;; r1 == C | WRITE r1,r3 | WRITE r1,r3 | WRITE r1,r3 ;; r2 == 0 | WRITE r0,r3 | write r0 | ;; | r1 = 0 | r1 = 0 | ;; ---------+-------------+-------------+------------- ;; r1 == C | WRITE r1,r2 | WRITE r1,r2 | WRITE r1,r2 ;; r2 == V | WRITE r0,r3 | write r0 | ;; | r1 = r2 = 0 | r1 = r2 = 0 | (eval-and-compile ;; input : r5 = 1st byte, r6 = 2nd byte ;; Their values will be destroyed. (define-ccl-program ccl-thai-xtis-write '(0 ((r5 = ((r5 & #x7F) << 7)) (r6 = ((r6 & #x7F) | r5)) (write-multibyte-character r4 r6)))) (define-ccl-program ccl-thai-xtis-consonant '(0 (if (r1 == 0) (r1 = r0) (if (r2 == 0) ((r5 = r1) (r6 = r3) (call ccl-thai-xtis-write) (r1 = r0)) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (r1 = r0) (r2 = 0)))))) (define-ccl-program ccl-thai-xtis-vowel '(0 ((if (r1 == 0) ((r5 = r0) (r6 = r3) (call ccl-thai-xtis-write)) ((if (r2 == 0) (r2 = ((r0 - 204) << 3)) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (r5 = r0) (r6 = r3) (call ccl-thai-xtis-write) (r1 = 0) (r2 = 0)))))))) (define-ccl-program ccl-thai-xtis-vowel-d1 '(0 ((if (r1 == 0) ((r5 = r0) (r6 = r3) (call ccl-thai-xtis-write)) ((if (r2 == 0) (r2 = #x38) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (r5 = r0) (r6 = r3) (call ccl-thai-xtis-write) (r1 = 0) (r2 = 0)))))))) (define-ccl-program ccl-thai-xtis-vowel-ee '(0 ((if (r1 == 0) ((r5 = r0) (r6 = r3) (call ccl-thai-xtis-write)) ((if (r2 == 0) (r2 = #x78) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (r5 = r0) (r6 = r3) (call ccl-thai-xtis-write) (r1 = 0) (r2 = 0)))))))) (define-ccl-program ccl-thai-xtis-tone '(0 (if (r1 == 0) ((r5 = r0) (r6 = r3) (call ccl-thai-xtis-write)) (if (r2 == 0) ((r5 = r1) (r6 = ((r0 - #xE6) | r3)) (call ccl-thai-xtis-write) (r1 = 0)) ((r5 = r1) (r6 = ((r0 - #xE6) | r2)) (call ccl-thai-xtis-write) (r1 = 0) (r2 = 0)))))) (define-ccl-program ccl-thai-xtis-symbol '(0 (if (r1 == 0) ((r5 = r0) (r6 = r3) (call ccl-thai-xtis-write)) (if (r2 == 0) ((r5 = r1) (r6 = r3) (call ccl-thai-xtis-write) (r5 = r0) (r6 = r3) (call ccl-thai-xtis-write) (r1 = 0)) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (r5 = r0) (r6 = r3) (call ccl-thai-xtis-write) (r1 = 0) (r2 = 0)))))) (define-ccl-program ccl-thai-xtis-ascii '(0 (if (r1 == 0) (write r0) (if (r2 == 0) ((r5 = r1) (r6 = r3) (call ccl-thai-xtis-write) (write r0) (r1 = 0)) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write) (write r0) (r1 = 0) (r2 = 0)))))) (define-ccl-program ccl-thai-xtis-eof '(0 (if (r1 != 0) (if (r2 == 0) ((r5 = r1) (r6 = r3) (call ccl-thai-xtis-write)) ((r5 = r1) (r6 = r2) (call ccl-thai-xtis-write)))))) (define-ccl-program ccl-decode-thai-xtis `(4 ((read r0) (r1 = 0) (r2 = 0) (r3 = #x30) (r4 = ,(charset-id 'thai-xtis)) (loop (if (r0 < 161) (call ccl-thai-xtis-ascii) (branch (r0 - 161) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-consonant) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-vowel-d1) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) (call ccl-thai-xtis-vowel) nil nil nil nil (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-tone) (call ccl-thai-xtis-vowel-ee) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) (call ccl-thai-xtis-symbol) nil nil nil)) (read r0) (repeat))) (call ccl-thai-xtis-eof))) ) (defconst leading-code-private-21 #x9F) (define-ccl-program ccl-encode-thai-xtis `(1 ((read r0) (loop (if (r0 == ,leading-code-private-21) ((read r1) (if (r1 == ,(charset-id 'thai-xtis)) ((read r0) (write r0) (read r0) (r1 = (r0 & 7)) (r0 = ((r0 - #xB0) >> 3)) (if (r0 != 0) (write r0 [0 209 212 213 214 215 216 217 218 238])) (if (r1 != 0) (write r1 [0 231 232 233 234 235 236 237])) (read r0) (repeat)) ((write r0 r1) (read r0) (repeat)))) (write-read-repeat r0)))))) (if (featurep 'xemacs) (progn (make-coding-system 'tis-620 'ccl "TIS620 (Thai)" `(mnemonic "TIS620" decode ccl-decode-thai-xtis encode ccl-encode-thai-xtis safe-charsets (ascii thai-xtis) documentation "external=tis620, internal=thai-xtis")) (coding-system-put 'tis-620 'category 'iso-8-1)) (make-coding-system 'tis-620 4 ?T "external=tis620, internal=thai-xtis" '(ccl-decode-thai-xtis . ccl-encode-thai-xtis) '((safe-charsets . t))) ) (set-language-info-alist "Thai-XTIS" '((charset thai-xtis) (coding-system tis-620 iso-2022-7bit) (tutorial . "TUTORIAL.th") (tutorial-coding-system . tis-620) (coding-priority tis-620 iso-2022-7bit) (sample-text . "$(?!:(B") (documentation . t))) ;; thai-xtis.el ends here.