comparison lisp/unicode.el @ 4568:1d74a1d115ee

Add #'query-coding-region tests; do the work necessary to get them running. lisp/ChangeLog addition: 2008-12-28 Aidan Kehoe <kehoea@parhasard.net> * coding.el (default-query-coding-region): Declare using defun*, so we can #'return-from to it on encountering a safe-charsets value of t. Comment out a few debug messages. (query-coding-region): Correct the docstring, it deals with a region, not a string. (unencodable-char-position): Correct the implementation for non-nil COUNT, special-case a zero value for count, treat it as one. Don't rely on dynamic scope when calling the main lambda. * unicode.el (unicode-query-coding-region): Comment out some debug messages here. * mule/mule-coding.el (8-bit-fixed-query-coding-region): Comment out some debug messages here. * code-init.el (raw-text): Add a safe-charsets property to this coding system. * mule/korean.el (iso-2022-int-1): * mule/korean.el (euc-kr): * mule/korean.el (iso-2022-kr): Add safe-charsets properties for these coding systems. * mule/japanese.el (iso-2022-jp): * mule/japanese.el (jis7): * mule/japanese.el (jis8): * mule/japanese.el (shift-jis): * mule/japanese.el (iso-2022-jp-1978-irv): * mule/japanese.el (euc-jp): Add safe-charsets properties for all these coding systems. * mule/iso-with-esc.el: Add safe-charsets properties to all the coding systems in here. Comment on the downside of a safe-charsets value of t for iso-latin-1-with-esc. * mule/hebrew.el (ctext-hebrew): Add a safe-charsets property for this coding system. * mule/devanagari.el (in-is13194-devanagari): Add a safe-charsets property for this coding system. * mule/chinese.el (cn-gb-2312): * mule/chinese.el (hz-gb-2312): * mule/chinese.el (big5): Add safe-charsets properties for these coding systems. * mule/latin.el (iso-8859-14): Add an implementation for this, using #'make-8-bit-coding-system. * mule/mule-coding.el (ctext): * mule/mule-coding.el (iso-2022-8bit-ss2): * mule/mule-coding.el (iso-2022-7bit-ss2): * mule/mule-coding.el (iso-2022-jp-2): * mule/mule-coding.el (iso-2022-7bit): * mule/mule-coding.el (iso-2022-8): * mule/mule-coding.el (escape-quoted): * mule/mule-coding.el (iso-2022-lock): Add safe-charsets properties for all these coding systems. src/ChangeLog addition: 2008-12-28 Aidan Kehoe <kehoea@parhasard.net> * file-coding.c (Fmake_coding_system): Document our use of the safe-chars and safe-charsets properties, and the differences compared to GNU. (make_coding_system_1): Don't drop the safe-chars and safe-charsets properties. (Fcoding_system_property): Return the safe-chars and safe-charsets properties when asked for them. * file-coding.h (CODING_SYSTEM_SAFE_CHARSETS): * coding-system-slots.h: Make the safe-chars and safe-charsets slots available in these headers. tests/ChangeLog addition: 2008-12-28 Aidan Kehoe <kehoea@parhasard.net> * automated/query-coding-tests.el: New file, testing the functionality of #'query-coding-region and #'query-coding-string.
author Aidan Kehoe <kehoea@parhasard.net>
date Sun, 28 Dec 2008 14:46:24 +0000
parents 26aae3bacf99
children e6a7054a9c30
comparison
equal deleted inserted replaced
4567:84d618b355f5 4568:1d74a1d115ee
624 (check-argument-type #'integer-or-marker-p begin) 624 (check-argument-type #'integer-or-marker-p begin)
625 (check-argument-type #'integer-or-marker-p end) 625 (check-argument-type #'integer-or-marker-p end)
626 (let* ((skip-chars-arg unicode-query-coding-skip-chars-arg) 626 (let* ((skip-chars-arg unicode-query-coding-skip-chars-arg)
627 (ranges (make-range-table)) 627 (ranges (make-range-table))
628 (looking-at-arg (concat "[" skip-chars-arg "]")) 628 (looking-at-arg (concat "[" skip-chars-arg "]"))
629 fail-range-start fail-range-end previous-fail char-after failed 629 fail-range-start fail-range-end char-after failed
630 extent) 630 extent)
631 (save-excursion 631 (save-excursion
632 (when highlightp 632 (when highlightp
633 (map-extents #'(lambda (extent ignored-arg) 633 (map-extents #'(lambda (extent ignored-arg)
634 (when (eq 'query-coding-warning-face 634 (when (eq 'query-coding-warning-face
636 (delete-extent extent))) buffer begin end)) 636 (delete-extent extent))) buffer begin end))
637 (goto-char begin buffer) 637 (goto-char begin buffer)
638 (skip-chars-forward skip-chars-arg end buffer) 638 (skip-chars-forward skip-chars-arg end buffer)
639 (while (< (point buffer) end) 639 (while (< (point buffer) end)
640 ; (message 640 ; (message
641 ; "fail-range-start is %S, previous-fail %S, point is %S, end is %S" 641 ; "fail-range-start is %S, point is %S, end is %S"
642 ; fail-range-start previous-fail (point buffer) end) 642 ; fail-range-start (point buffer) end)
643 (setq char-after (char-after (point buffer) buffer) 643 (setq char-after (char-after (point buffer) buffer)
644 fail-range-start (point buffer)) 644 fail-range-start (point buffer))
645 (while (and 645 (while (and
646 (< (point buffer) end) 646 (< (point buffer) end)
647 (not (looking-at looking-at-arg)) 647 (not (looking-at looking-at-arg))
648 (= -1 (char-to-unicode char-after))) 648 (= -1 (char-to-unicode char-after)))
649 (forward-char 1 buffer) 649 (forward-char 1 buffer)
650 (message "what?!?")
651 (setq char-after (char-after (point buffer) buffer) 650 (setq char-after (char-after (point buffer) buffer)
652 failed t)) 651 failed t))
653 (if (= fail-range-start (point buffer)) 652 (if (= fail-range-start (point buffer))
654 ;; The character can actually be encoded by the coding 653 ;; The character can actually be encoded by the coding
655 ;; system; check the characters past it. 654 ;; system; check the characters past it.