Mercurial > hg > xemacs-beta

diff lisp/mule/misc-lang.el @ 5662:bed39edf91ba
Be better about word boundaries with JIT unicode characters, mule-category.el lisp/ChangeLog addition: 2012-05-10 Aidan Kehoe <kehoea@parhasard.net> * mule/mule-category.el (word-combining-categories): Be better about default word boundaries when text contains just-in-time-allocated Unicode code points. Document what we should do instead once we have Unicode internally. * mule/misc-lang.el: IPA characters are Latin.
author: Aidan Kehoe <kehoea@parhasard.net>
date: Thu, 10 May 2012 13:53:06 +0100
parents: 308d34e9f07d
--- a/lisp/mule/misc-lang.el	Tue May 08 09:47:41 2012 +0100
+++ b/lisp/mule/misc-lang.el	Thu May 10 13:53:06 2012 +0100
@@ -41,4 +41,26 @@
 		short-name "IPA"
 		long-name "IPA"))
 
+;; XEmacs; these are Latin, it's not useful to put word boundaries between
+;; them and ASCII.
+(modify-category-entry 'ipa ?l nil t)
+
+;; XEmacs; why are these Latin? See the following:
+;;
+;; (let ((scripts
+;;        (mapcar #'(lambda (character)
+;;                    (car
+;;                     (split-string
+;;                      (cadr (assoc "Name" (describe-char-unicode-data
+;;                                           character))))))
+;;                (loop
+;;                  for i from 33 to 127
+;;                  if (not (eql -1 (char-to-unicode (make-char 'ipa i))))
+;;                  nconc (list (make-char 'ipa i))))))
+;;   (mapcar #'(lambda (script)
+;;               (cons script (count script scripts :test #'equal)))
+;;           (remove-duplicates scripts :test #'equal)))
+;; => (("GREEK" . 1) ("LATIN" . 55) ("MODIFIER" . 3))
+
+
 ;;; misc-lang.el ends here
author	Aidan Kehoe <kehoea@parhasard.net>
date	Thu, 10 May 2012 13:53:06 +0100
parents	308d34e9f07d
children