comparison lisp/unicode.el @ 985:7f62a956b825

[xemacs-hg @ 2002-09-01 06:41:40 by youngs] 2002-08-31 Seiichi Ikiuo <ikiuo@polyphony.co.jp> * code-init.el (reset-coding-categories-to-default): Add new coding category 'utf-8-bom'. * coding.el (coding-system-category): Add check for 'utf-8-bom'. * unicode.el: Add new coding system 'utf-8-bom'. 2002-08-31 Seiichi Ikiuo <ikiuo@polyphony.co.jp> * file-coding.h (detection_result): fix DET_LOWEST. * mule-coding.c (big5_detector): Add member 'seen_euc_char'. (big5_detect): It was made not to become the trouble of EUC. (iso2022_detector): Add member 'longest_even_high_byte'. (iso2022_detect): Fix checking for even/odd_high_byte_group. The judgment of EUC is changed. * unicode.c: Add DETECTOR_CATEGORY utf_8_bom. (utf_8_detector): Add member byteno, first_byte and second_byte. (utf_8_detect): The judgment of UTF-8-BOM is added.
author youngs
date Sun, 01 Sep 2002 06:41:45 +0000
parents e54d47b2d736
children b531bf8658e9
comparison
equal deleted inserted replaced
984:c55a519aa13f 985:7f62a956b825
279 -- Given only the leading byte, you know how many following bytes 279 -- Given only the leading byte, you know how many following bytes
280 are present. 280 are present.
281 " 281 "
282 type utf-8)) 282 type utf-8))
283 283
284 (make-coding-system
285 'utf-8-bom 'unicode
286 "UTF-8 w/BOM"
287 '(mnemonic "MSW-UTF8"
288 documentation
289 "UTF-8 Unicode encoding, with byte order mark.
290 Standard encoding for representing UTF-8 under MS Windows."
291 type utf-8
292 little-endian t
293 need-bom t))
294
284 ;; #### UTF-7 is not yet implemented, and it's tricky to do. There's 295 ;; #### UTF-7 is not yet implemented, and it's tricky to do. There's
285 ;; an implementation in appendix A.1 of the Unicode Standard, Version 296 ;; an implementation in appendix A.1 of the Unicode Standard, Version
286 ;; 2.0, but I don't know its licensing characteristics. 297 ;; 2.0, but I don't know its licensing characteristics.
287 298
288 ; (make-coding-system 299 ; (make-coding-system