Mercurial > hg > xemacs-beta
view etc/unicode/mule-ucs/README @ 5867:e0f1dfaa821e
Add the non-ASCII digit support, now #'parse-integer can handle it.
lisp/ChangeLog addition:
2015-03-08 Aidan Kehoe <kehoea@parhasard.net>
* mule/digit.el: New file, specifying integer weights for those
decimal digits specified by Unicode, and for the Latin characters
in ASCII.
* dumped-lisp.el (preloaded-file-list):
Dump this file, so those weights are available at runtime.
tests/ChangeLog addition:
2015-03-16 Aidan Kehoe <kehoea@parhasard.net>
* automated/lisp-tests.el:
Extend the tests for #'parse-integer, #'digit-char, #'digit-char-p
substantially, testing the Unicode decimal digits in detail.
src/ChangeLog addition:
2015-03-08 Aidan Kehoe <kehoea@parhasard.net>
* data.c (Fparse_integer):
Document a complication that arises when a leading minus sign has
a digit weight but is nonetheles interpreted as a minus.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Mon, 16 Mar 2015 00:28:18 +0000 |
parents | a29c4eef8f00 |
children |
line wrap: on
line source
The files in this directory were converted from data in the mule-ucs package (mule-ucs/lisp/reldata/*), using this code: (defun convert-mule-ucs-file (start end) (interactive "r") (with-output-to-temp-buffer "*mule-ucs-convert*" (save-excursion (goto-char start) (while (re-search-forward "(\\?\\(.\\) \\. \"\\(.*\\)\") ;+ \\(.*\\)$" end t) (let ((ch (string-to-char (match-string 1))) (codepoint (match-string 2)) (name (match-string 3))) (if (= 1 (charset-dimension (char-charset ch))) (princ (format "0x%x %s # %s\n" (char-octet ch) codepoint name)) (princ (format "0x%x%x %s # %s\n" (char-octet ch 0) (char-octet ch 1) codepoint name)))))))) Each file is named after the XEmacs charset it represents. The CNS files contain more codepoints than those in unicode-consortium/ because they list codepoints above 0xFFFF, those handled by surrogates (supported starting in Windows 2000, I think, but not yet by XEmacs).