annotate src/file-coding.c @ 665:fdefd0186b75

[xemacs-hg @ 2001-09-20 06:28:42 by ben] The great integral types renaming. The purpose of this is to rationalize the names used for various integral types, so that they match their intended uses and follow consist conventions, and eliminate types that were not semantically different from each other. The conventions are: -- All integral types that measure quantities of anything are signed. Some people disagree vociferously with this, but their arguments are mostly theoretical, and are vastly outweighed by the practical headaches of mixing signed and unsigned values, and more importantly by the far increased likelihood of inadvertent bugs: Because of the broken "viral" nature of unsigned quantities in C (operations involving mixed signed/unsigned are done unsigned, when exactly the opposite is nearly always wanted), even a single error in declaring a quantity unsigned that should be signed, or even the even more subtle error of comparing signed and unsigned values and forgetting the necessary cast, can be catastrophic, as comparisons will yield wrong results. -Wsign-compare is turned on specifically to catch this, but this tends to result in a great number of warnings when mixing signed and unsigned, and the casts are annoying. More has been written on this elsewhere. -- All such quantity types just mentioned boil down to EMACS_INT, which is 32 bits on 32-bit machines and 64 bits on 64-bit machines. This is guaranteed to be the same size as Lisp objects of type `int', and (as far as I can tell) of size_t (unsigned!) and ssize_t. The only type below that is not an EMACS_INT is Hashcode, which is an unsigned value of the same size as EMACS_INT. -- Type names should be relatively short (no more than 10 characters or so), with the first letter capitalized and no underscores if they can at all be avoided. -- "count" == a zero-based measurement of some quantity. Includes sizes, offsets, and indexes. -- "bpos" == a one-based measurement of a position in a buffer. "Charbpos" and "Bytebpos" count text in the buffer, rather than bytes in memory; thus Bytebpos does not directly correspond to the memory representation. Use "Membpos" for this. -- "Char" refers to internal-format characters, not to the C type "char", which is really a byte. -- For the actual name changes, see the script below. I ran the following script to do the conversion. (NOTE: This script is idempotent. You can safely run it multiple times and it will not screw up previous results -- in fact, it will do nothing if nothing has changed. Thus, it can be run repeatedly as necessary to handle patches coming in from old workspaces, or old branches.) There are two tags, just before and just after the change: `pre-integral-type-rename' and `post-integral-type-rename'. When merging code from the main trunk into a branch, the best thing to do is first merge up to `pre-integral-type-rename', then apply the script and associated changes, then merge from `post-integral-type-change' to the present. (Alternatively, just do the merging in one operation; but you may then have a lot of conflicts needing to be resolved by hand.) Script `fixtypes.sh' follows: ----------------------------------- cut ------------------------------------ files="*.[ch] s/*.h m/*.h config.h.in ../configure.in Makefile.in.in ../lib-src/*.[ch] ../lwlib/*.[ch]" gr Memory_Count Bytecount $files gr Lstream_Data_Count Bytecount $files gr Element_Count Elemcount $files gr Hash_Code Hashcode $files gr extcount bytecount $files gr bufpos charbpos $files gr bytind bytebpos $files gr memind membpos $files gr bufbyte intbyte $files gr Extcount Bytecount $files gr Bufpos Charbpos $files gr Bytind Bytebpos $files gr Memind Membpos $files gr Bufbyte Intbyte $files gr EXTCOUNT BYTECOUNT $files gr BUFPOS CHARBPOS $files gr BYTIND BYTEBPOS $files gr MEMIND MEMBPOS $files gr BUFBYTE INTBYTE $files gr MEMORY_COUNT BYTECOUNT $files gr LSTREAM_DATA_COUNT BYTECOUNT $files gr ELEMENT_COUNT ELEMCOUNT $files gr HASH_CODE HASHCODE $files ----------------------------------- cut ------------------------------------ `fixtypes.sh' is a Bourne-shell script; it uses 'gr': ----------------------------------- cut ------------------------------------ #!/bin/sh # Usage is like this: # gr FROM TO FILES ... # globally replace FROM with TO in FILES. FROM and TO are regular expressions. # backup files are stored in the `backup' directory. from="$1" to="$2" shift 2 echo ${1+"$@"} | xargs global-replace "s/$from/$to/g" ----------------------------------- cut ------------------------------------ `gr' in turn uses a Perl script to do its real work, `global-replace', which follows: ----------------------------------- cut ------------------------------------ : #-*- Perl -*- ### global-modify --- modify the contents of a file by a Perl expression ## Copyright (C) 1999 Martin Buchholz. ## Copyright (C) 2001 Ben Wing. ## Authors: Martin Buchholz <martin@xemacs.org>, Ben Wing <ben@xemacs.org> ## Maintainer: Ben Wing <ben@xemacs.org> ## Current Version: 1.0, May 5, 2001 # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2, or (at your option) # any later version. # # This program is distributed in the hope that it will be useful, but # WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU # General Public License for more details. # # You should have received a copy of the GNU General Public License # along with XEmacs; see the file COPYING. If not, write to the Free # Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA # 02111-1307, USA. eval 'exec perl -w -S $0 ${1+"$@"}' if 0; use strict; use FileHandle; use Carp; use Getopt::Long; use File::Basename; (my $myName = $0) =~ s@.*/@@; my $usage=" Usage: $myName [--help] [--backup-dir=DIR] [--line-mode] [--hunk-mode] PERLEXPR FILE ... Globally modify a file, either line by line or in one big hunk. Typical usage is like this: [with GNU print, GNU xargs: guaranteed to handle spaces, quotes, etc. in file names] find . -name '*.[ch]' -print0 | xargs -0 $0 's/\bCONST\b/const/g'\n [with non-GNU print, xargs] find . -name '*.[ch]' -print | xargs $0 's/\bCONST\b/const/g'\n The file is read in, either line by line (with --line-mode specified) or in one big hunk (with --hunk-mode specified; it's the default), and the Perl expression is then evalled with \$_ set to the line or hunk of text, including the terminating newline if there is one. It should destructively modify the value there, storing the changed result in \$_. Files in which any modifications are made are backed up to the directory specified using --backup-dir, or to `backup' by default. To disable this, use --backup-dir= with no argument. Hunk mode is the default because it is MUCH MUCH faster than line-by-line. Use line-by-line only when it matters, e.g. you want to do a replacement only once per line (the default without the `g' argument). Conversely, when using hunk mode, *ALWAYS* use `g'; otherwise, you will only make one replacement in the entire file! "; my %options = (); $Getopt::Long::ignorecase = 0; &GetOptions ( \%options, 'help', 'backup-dir=s', 'line-mode', 'hunk-mode', ); die $usage if $options{"help"} or @ARGV <= 1; my $code = shift; die $usage if grep (-d || ! -w, @ARGV); sub SafeOpen { open ((my $fh = new FileHandle), $_[0]); confess "Can't open $_[0]: $!" if ! defined $fh; return $fh; } sub SafeClose { close $_[0] or confess "Can't close $_[0]: $!"; } sub FileContents { my $fh = SafeOpen ("< $_[0]"); my $olddollarslash = $/; local $/ = undef; my $contents = <$fh>; $/ = $olddollarslash; return $contents; } sub WriteStringToFile { my $fh = SafeOpen ("> $_[0]"); binmode $fh; print $fh $_[1] or confess "$_[0]: $!\n"; SafeClose $fh; } foreach my $file (@ARGV) { my $changed_p = 0; my $new_contents = ""; if ($options{"line-mode"}) { my $fh = SafeOpen $file; while (<$fh>) { my $save_line = $_; eval $code; $changed_p = 1 if $save_line ne $_; $new_contents .= $_; } } else { my $orig_contents = $_ = FileContents $file; eval $code; if ($_ ne $orig_contents) { $changed_p = 1; $new_contents = $_; } } if ($changed_p) { my $backdir = $options{"backup-dir"}; $backdir = "backup" if !defined ($backdir); if ($backdir) { my ($name, $path, $suffix) = fileparse ($file, ""); my $backfulldir = $path . $backdir; my $backfile = "$backfulldir/$name"; mkdir $backfulldir, 0755 unless -d $backfulldir; print "modifying $file (original saved in $backfile)\n"; rename $file, $backfile; } WriteStringToFile ($file, $new_contents); } } ----------------------------------- cut ------------------------------------ In addition to those programs, I needed to fix up a few other things, particularly relating to the duplicate definitions of types, now that some types merged with others. Specifically: 1. in lisp.h, removed duplicate declarations of Bytecount. The changed code should now look like this: (In each code snippet below, the first and last lines are the same as the original, as are all lines outside of those lines. That allows you to locate the section to be replaced, and replace the stuff in that section, verifying that there isn't anything new added that would need to be kept.) --------------------------------- snip ------------------------------------- /* Counts of bytes or chars */ typedef EMACS_INT Bytecount; typedef EMACS_INT Charcount; /* Counts of elements */ typedef EMACS_INT Elemcount; /* Hash codes */ typedef unsigned long Hashcode; /* ------------------------ dynamic arrays ------------------- */ --------------------------------- snip ------------------------------------- 2. in lstream.h, removed duplicate declaration of Bytecount. Rewrote the comment about this type. The changed code should now look like this: --------------------------------- snip ------------------------------------- #endif /* The have been some arguments over the what the type should be that specifies a count of bytes in a data block to be written out or read in, using Lstream_read(), Lstream_write(), and related functions. Originally it was long, which worked fine; Martin "corrected" these to size_t and ssize_t on the grounds that this is theoretically cleaner and is in keeping with the C standards. Unfortunately, this practice is horribly error-prone due to design flaws in the way that mixed signed/unsigned arithmetic happens. In fact, by doing this change, Martin introduced a subtle but fatal error that caused the operation of sending large mail messages to the SMTP server under Windows to fail. By putting all values back to be signed, avoiding any signed/unsigned mixing, the bug immediately went away. The type then in use was Lstream_Data_Count, so that it be reverted cleanly if a vote came to that. Now it is Bytecount. Some earlier comments about why the type must be signed: This MUST BE SIGNED, since it also is used in functions that return the number of bytes actually read to or written from in an operation, and these functions can return -1 to signal error. Note that the standard Unix read() and write() functions define the count going in as a size_t, which is UNSIGNED, and the count going out as an ssize_t, which is SIGNED. This is a horrible design flaw. Not only is it highly likely to lead to logic errors when a -1 gets interpreted as a large positive number, but operations are bound to fail in all sorts of horrible ways when a number in the upper-half of the size_t range is passed in -- this number is unrepresentable as an ssize_t, so code that checks to see how many bytes are actually written (which is mandatory if you are dealing with certain types of devices) will get completely screwed up. --ben */ typedef enum lstream_buffering --------------------------------- snip ------------------------------------- 3. in dumper.c, there are four places, all inside of switch() statements, where XD_BYTECOUNT appears twice as a case tag. In each case, the two case blocks contain identical code, and you should *REMOVE THE SECOND* and leave the first.
author ben
date Thu, 20 Sep 2001 06:31:11 +0000
parents b39c14581166
children 943eaba38521
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1 /* Code conversion functions.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2 Copyright (C) 1991, 1995 Free Software Foundation, Inc.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3 Copyright (C) 1995 Sun Microsystems, Inc.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5 This file is part of XEmacs.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
6
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
7 XEmacs is free software; you can redistribute it and/or modify it
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
8 under the terms of the GNU General Public License as published by the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
9 Free Software Foundation; either version 2, or (at your option) any
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
10 later version.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
11
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
12 XEmacs is distributed in the hope that it will be useful, but WITHOUT
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
13 ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
14 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
15 for more details.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
16
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
17 You should have received a copy of the GNU General Public License
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
18 along with XEmacs; see the file COPYING. If not, write to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
19 the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
20 Boston, MA 02111-1307, USA. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
21
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
22 /* Synched up with: Mule 2.3. Not in FSF. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
23
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
24 /* Rewritten by Ben Wing <ben@xemacs.org>. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
25
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
26 #include <config.h>
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
27 #include "lisp.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
28
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
29 #include "buffer.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
30 #include "elhash.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
31 #include "insdel.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
32 #include "lstream.h"
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
33 #include "opaque.h"
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
34 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
35 #include "mule-ccl.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
36 #include "chartab.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
37 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
38 #include "file-coding.h"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
39
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
40 Lisp_Object Vkeyboard_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
41 Lisp_Object Vterminal_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
42 Lisp_Object Vcoding_system_for_read;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
43 Lisp_Object Vcoding_system_for_write;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
44 Lisp_Object Vfile_name_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
45
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
46 /* Table of symbols identifying each coding category. */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
47 Lisp_Object coding_category_symbol[CODING_CATEGORY_LAST];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
48
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
49
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
50
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
51 struct file_coding_dump {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
52 /* Coding system currently associated with each coding category. */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
53 Lisp_Object coding_category_system[CODING_CATEGORY_LAST];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
54
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
55 /* Table of all coding categories in decreasing order of priority.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
56 This describes a permutation of the possible coding categories. */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
57 int coding_category_by_priority[CODING_CATEGORY_LAST];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
58
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
59 #ifdef MULE
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
60 Lisp_Object ucs_to_mule_table[65536];
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
61 #endif
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
62 } *fcd;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
63
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
64 static const struct lrecord_description fcd_description_1[] = {
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
65 { XD_LISP_OBJECT_ARRAY, offsetof (struct file_coding_dump, coding_category_system), CODING_CATEGORY_LAST },
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
66 #ifdef MULE
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
67 { XD_LISP_OBJECT_ARRAY, offsetof (struct file_coding_dump, ucs_to_mule_table), countof (fcd->ucs_to_mule_table) },
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
68 #endif
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
69 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
70 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
71
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
72 static const struct struct_description fcd_description = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
73 sizeof (struct file_coding_dump),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
74 fcd_description_1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
75 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
76
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
77 Lisp_Object mule_to_ucs_table;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
78
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
79 Lisp_Object Qcoding_systemp;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
80
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
81 Lisp_Object Qraw_text, Qno_conversion, Qccl, Qiso2022;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
82 /* Qinternal in general.c */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
83
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
84 Lisp_Object Qmnemonic, Qeol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
85 Lisp_Object Qcr, Qcrlf, Qlf;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
86 Lisp_Object Qeol_cr, Qeol_crlf, Qeol_lf;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
87 Lisp_Object Qpost_read_conversion;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
88 Lisp_Object Qpre_write_conversion;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
89
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
90 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
91 Lisp_Object Qucs4, Qutf8;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
92 Lisp_Object Qbig5, Qshift_jis;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
93 Lisp_Object Qcharset_g0, Qcharset_g1, Qcharset_g2, Qcharset_g3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
94 Lisp_Object Qforce_g0_on_output, Qforce_g1_on_output;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
95 Lisp_Object Qforce_g2_on_output, Qforce_g3_on_output;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
96 Lisp_Object Qno_iso6429;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
97 Lisp_Object Qinput_charset_conversion, Qoutput_charset_conversion;
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
98 Lisp_Object Qescape_quoted;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
99 Lisp_Object Qshort, Qno_ascii_eol, Qno_ascii_cntl, Qseven, Qlock_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
100 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
101 Lisp_Object Qencode, Qdecode;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
102
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
103 Lisp_Object Vcoding_system_hash_table;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
104
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
105 int enable_multibyte_characters;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
106
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
107 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
108 /* Additional information used by the ISO2022 decoder and detector. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
109 struct iso2022_decoder
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
110 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
111 /* CHARSET holds the character sets currently assigned to the G0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
112 through G3 variables. It is initialized from the array
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
113 INITIAL_CHARSET in CODESYS. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
114 Lisp_Object charset[4];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
115
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
116 /* Which registers are currently invoked into the left (GL) and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
117 right (GR) halves of the 8-bit encoding space? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
118 int register_left, register_right;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
119
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
120 /* ISO_ESC holds a value indicating part of an escape sequence
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
121 that has already been seen. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
122 enum iso_esc_flag esc;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
123
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
124 /* This records the bytes we've seen so far in an escape sequence,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
125 in case the sequence is invalid (we spit out the bytes unchanged). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
126 unsigned char esc_bytes[8];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
127
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
128 /* Index for next byte to store in ISO escape sequence. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
129 int esc_bytes_index;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
130
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
131 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
132 /* Stuff seen so far when composing a string. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
133 unsigned_char_dynarr *composite_chars;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
134 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
135
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
136 /* If we saw an invalid designation sequence for a particular
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
137 register, we flag it here and switch to ASCII. The next time we
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
138 see a valid designation for this register, we turn off the flag
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
139 and do the designation normally, but pretend the sequence was
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
140 invalid. The effect of all this is that (most of the time) the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
141 escape sequences for both the switch to the unknown charset, and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
142 the switch back to the known charset, get inserted literally into
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
143 the buffer and saved out as such. The hope is that we can
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
144 preserve the escape sequences so that the resulting written out
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
145 file makes sense. If we don't do any of this, the designation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
146 to the invalid charset will be preserved but that switch back
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
147 to the known charset will probably get eaten because it was
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
148 the same charset that was already present in the register. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
149 unsigned char invalid_designated[4];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
150
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
151 /* We try to do similar things as above for direction-switching
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
152 sequences. If we encountered a direction switch while an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
153 invalid designation was present, or an invalid designation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
154 just after a direction switch (i.e. no valid designation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
155 encountered yet), we insert the direction-switch escape
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
156 sequence literally into the output stream, and later on
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
157 insert the corresponding direction-restoring escape sequence
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
158 literally also. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
159 unsigned int switched_dir_and_no_valid_charset_yet :1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
160 unsigned int invalid_switch_dir :1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
161
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
162 /* Tells the decoder to output the escape sequence literally
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
163 even though it was valid. Used in the games we play to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
164 avoid lossage when we encounter invalid designations. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
165 unsigned int output_literally :1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
166 /* We encountered a direction switch followed by an invalid
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
167 designation. We didn't output the direction switch
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
168 literally because we didn't know about the invalid designation;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
169 but we have to do so now. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
170 unsigned int output_direction_sequence :1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
171 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
172 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
173 EXFUN (Fcopy_coding_system, 2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
174 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
175 struct detection_state;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
176 static int detect_coding_sjis (struct detection_state *st,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
177 const Extbyte *src, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
178 static void decode_coding_sjis (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
179 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
180 static void encode_coding_sjis (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
181 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
182 static int detect_coding_big5 (struct detection_state *st,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
183 const Extbyte *src, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
184 static void decode_coding_big5 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
185 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
186 static void encode_coding_big5 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
187 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
188 static int detect_coding_ucs4 (struct detection_state *st,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
189 const Extbyte *src, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
190 static void decode_coding_ucs4 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
191 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
192 static void encode_coding_ucs4 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
193 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
194 static int detect_coding_utf8 (struct detection_state *st,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
195 const Extbyte *src, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
196 static void decode_coding_utf8 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
197 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
198 static void encode_coding_utf8 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
199 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
200 static int postprocess_iso2022_mask (int mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
201 static void reset_iso2022 (Lisp_Object coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
202 struct iso2022_decoder *iso);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
203 static int detect_coding_iso2022 (struct detection_state *st,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
204 const Extbyte *src, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
205 static void decode_coding_iso2022 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
206 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
207 static void encode_coding_iso2022 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
208 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
209 #endif /* MULE */
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
210 static void decode_coding_no_conversion (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
211 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
212 static void encode_coding_no_conversion (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
213 unsigned_char_dynarr *dst, Bytecount n);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
214 static void mule_decode (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
215 unsigned_char_dynarr *dst, Bytecount n);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
216 static void mule_encode (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
217 unsigned_char_dynarr *dst, Bytecount n);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
218
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
219 typedef struct codesys_prop codesys_prop;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
220 struct codesys_prop
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
221 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
222 Lisp_Object sym;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
223 int prop_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
224 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
225
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
226 typedef struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
227 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
228 Dynarr_declare (codesys_prop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
229 } codesys_prop_dynarr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
230
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
231 static const struct lrecord_description codesys_prop_description_1[] = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
232 { XD_LISP_OBJECT, offsetof (codesys_prop, sym) },
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
233 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
234 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
235
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
236 static const struct struct_description codesys_prop_description = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
237 sizeof (codesys_prop),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
238 codesys_prop_description_1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
239 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
240
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
241 static const struct lrecord_description codesys_prop_dynarr_description_1[] = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
242 XD_DYNARR_DESC (codesys_prop_dynarr, &codesys_prop_description),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
243 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
244 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
245
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
246 static const struct struct_description codesys_prop_dynarr_description = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
247 sizeof (codesys_prop_dynarr),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
248 codesys_prop_dynarr_description_1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
249 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
250
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
251 codesys_prop_dynarr *the_codesys_prop_dynarr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
252
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
253 enum codesys_prop_enum
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
254 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
255 CODESYS_PROP_ALL_OK,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
256 CODESYS_PROP_ISO2022,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
257 CODESYS_PROP_CCL
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
258 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
259
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
260
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
261 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
262 /* Coding system functions */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
263 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
264
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
265 static Lisp_Object mark_coding_system (Lisp_Object);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
266 static void print_coding_system (Lisp_Object, Lisp_Object, int);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
267 static void finalize_coding_system (void *header, int for_disksave);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
268
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
269 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
270 static const struct lrecord_description ccs_description_1[] = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
271 { XD_LISP_OBJECT, offsetof (charset_conversion_spec, from_charset) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
272 { XD_LISP_OBJECT, offsetof (charset_conversion_spec, to_charset) },
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
273 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
274 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
275
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
276 static const struct struct_description ccs_description = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
277 sizeof (charset_conversion_spec),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
278 ccs_description_1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
279 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
280
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
281 static const struct lrecord_description ccsd_description_1[] = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
282 XD_DYNARR_DESC (charset_conversion_spec_dynarr, &ccs_description),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
283 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
284 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
285
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
286 static const struct struct_description ccsd_description = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
287 sizeof (charset_conversion_spec_dynarr),
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
288 ccsd_description_1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
289 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
290 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
291
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
292 static const struct lrecord_description coding_system_description[] = {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
293 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, name) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
294 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, doc_string) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
295 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, mnemonic) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
296 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, post_read_conversion) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
297 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, pre_write_conversion) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
298 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, eol_lf) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
299 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, eol_crlf) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
300 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, eol_cr) },
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
301 #ifdef MULE
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
302 { XD_LISP_OBJECT_ARRAY, offsetof (Lisp_Coding_System, iso2022.initial_charset), 4 },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
303 { XD_STRUCT_PTR, offsetof (Lisp_Coding_System, iso2022.input_conv), 1, &ccsd_description },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
304 { XD_STRUCT_PTR, offsetof (Lisp_Coding_System, iso2022.output_conv), 1, &ccsd_description },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
305 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, ccl.decode) },
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
306 { XD_LISP_OBJECT, offsetof (Lisp_Coding_System, ccl.encode) },
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
307 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
308 { XD_END }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
309 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
310
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
311 DEFINE_LRECORD_IMPLEMENTATION ("coding-system", coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
312 mark_coding_system, print_coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
313 finalize_coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
314 0, 0, coding_system_description,
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
315 Lisp_Coding_System);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
316
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
317 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
318 mark_coding_system (Lisp_Object obj)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
319 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
320 Lisp_Coding_System *codesys = XCODING_SYSTEM (obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
321
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
322 mark_object (CODING_SYSTEM_NAME (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
323 mark_object (CODING_SYSTEM_DOC_STRING (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
324 mark_object (CODING_SYSTEM_MNEMONIC (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
325 mark_object (CODING_SYSTEM_EOL_LF (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
326 mark_object (CODING_SYSTEM_EOL_CRLF (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
327 mark_object (CODING_SYSTEM_EOL_CR (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
328
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
329 switch (CODING_SYSTEM_TYPE (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
330 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
331 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
332 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
333 case CODESYS_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
334 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
335 mark_object (CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, i));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
336 if (codesys->iso2022.input_conv)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
337 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
338 for (i = 0; i < Dynarr_length (codesys->iso2022.input_conv); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
339 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
340 struct charset_conversion_spec *ccs =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
341 Dynarr_atp (codesys->iso2022.input_conv, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
342 mark_object (ccs->from_charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
343 mark_object (ccs->to_charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
344 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
345 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
346 if (codesys->iso2022.output_conv)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
347 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
348 for (i = 0; i < Dynarr_length (codesys->iso2022.output_conv); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
349 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
350 struct charset_conversion_spec *ccs =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
351 Dynarr_atp (codesys->iso2022.output_conv, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
352 mark_object (ccs->from_charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
353 mark_object (ccs->to_charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
354 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
355 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
356 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
357
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
358 case CODESYS_CCL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
359 mark_object (CODING_SYSTEM_CCL_DECODE (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
360 mark_object (CODING_SYSTEM_CCL_ENCODE (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
361 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
362 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
363 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
364 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
365 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
366
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
367 mark_object (CODING_SYSTEM_PRE_WRITE_CONVERSION (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
368 return CODING_SYSTEM_POST_READ_CONVERSION (codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
369 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
370
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
371 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
372 print_coding_system (Lisp_Object obj, Lisp_Object printcharfun,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
373 int escapeflag)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
374 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
375 Lisp_Coding_System *c = XCODING_SYSTEM (obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
376 if (print_readably)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
377 printing_unreadable_object ("#<coding_system 0x%x>", c->header.uid);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
378
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
379 write_c_string ("#<coding_system ", printcharfun);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
380 print_internal (c->name, printcharfun, 1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
381 write_c_string (">", printcharfun);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
382 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
383
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
384 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
385 finalize_coding_system (void *header, int for_disksave)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
386 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
387 Lisp_Coding_System *c = (Lisp_Coding_System *) header;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
388 /* Since coding systems never go away, this function is not
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
389 necessary. But it would be necessary if we changed things
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
390 so that coding systems could go away. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
391 if (!for_disksave) /* see comment in lstream.c */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
392 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
393 switch (CODING_SYSTEM_TYPE (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
394 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
395 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
396 case CODESYS_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
397 if (c->iso2022.input_conv)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
398 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
399 Dynarr_free (c->iso2022.input_conv);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
400 c->iso2022.input_conv = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
401 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
402 if (c->iso2022.output_conv)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
403 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
404 Dynarr_free (c->iso2022.output_conv);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
405 c->iso2022.output_conv = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
406 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
407 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
408 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
409 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
410 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
411 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
412 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
413 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
414
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
415 static eol_type_t
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
416 symbol_to_eol_type (Lisp_Object symbol)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
417 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
418 CHECK_SYMBOL (symbol);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
419 if (NILP (symbol)) return EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
420 if (EQ (symbol, Qlf)) return EOL_LF;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
421 if (EQ (symbol, Qcrlf)) return EOL_CRLF;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
422 if (EQ (symbol, Qcr)) return EOL_CR;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
423
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
424 invalid_constant ("Unrecognized eol type", symbol);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
425 return EOL_AUTODETECT; /* not reached */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
426 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
427
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
428 static Lisp_Object
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
429 eol_type_to_symbol (eol_type_t type)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
430 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
431 switch (type)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
432 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
433 default: abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
434 case EOL_LF: return Qlf;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
435 case EOL_CRLF: return Qcrlf;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
436 case EOL_CR: return Qcr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
437 case EOL_AUTODETECT: return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
438 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
439 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
440
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
441 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
442 setup_eol_coding_systems (Lisp_Coding_System *codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
443 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
444 Lisp_Object codesys_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
445 int len = string_length (XSYMBOL (CODING_SYSTEM_NAME (codesys))->name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
446 char *codesys_name = (char *) alloca (len + 7);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
447 int mlen = -1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
448 char *codesys_mnemonic=0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
449
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
450 Lisp_Object codesys_name_sym, sub_codesys_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
451
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
452 /* kludge */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
453
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
454 XSETCODING_SYSTEM (codesys_obj, codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
455
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
456 memcpy (codesys_name,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
457 string_data (XSYMBOL (CODING_SYSTEM_NAME (codesys))->name), len);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
458
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
459 if (STRINGP (CODING_SYSTEM_MNEMONIC (codesys)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
460 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
461 mlen = XSTRING_LENGTH (CODING_SYSTEM_MNEMONIC (codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
462 codesys_mnemonic = (char *) alloca (mlen + 7);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
463 memcpy (codesys_mnemonic,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
464 XSTRING_DATA (CODING_SYSTEM_MNEMONIC (codesys)), mlen);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
465 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
466
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
467 #define DEFINE_SUB_CODESYS(op_sys, op_sys_abbr, Type) do { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
468 strcpy (codesys_name + len, "-" op_sys); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
469 if (mlen != -1) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
470 strcpy (codesys_mnemonic + mlen, op_sys_abbr); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
471 codesys_name_sym = intern (codesys_name); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
472 sub_codesys_obj = Fcopy_coding_system (codesys_obj, codesys_name_sym); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
473 XCODING_SYSTEM_EOL_TYPE (sub_codesys_obj) = Type; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
474 if (mlen != -1) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
475 XCODING_SYSTEM_MNEMONIC(sub_codesys_obj) = \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
476 build_string (codesys_mnemonic); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
477 CODING_SYSTEM_##Type (codesys) = sub_codesys_obj; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
478 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
479
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
480 DEFINE_SUB_CODESYS("unix", "", EOL_LF);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
481 DEFINE_SUB_CODESYS("dos", ":T", EOL_CRLF);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
482 DEFINE_SUB_CODESYS("mac", ":t", EOL_CR);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
483 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
484
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
485 DEFUN ("coding-system-p", Fcoding_system_p, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
486 Return t if OBJECT is a coding system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
487 A coding system is an object that defines how text containing multiple
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
488 character sets is encoded into a stream of (typically 8-bit) bytes.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
489 The coding system is used to decode the stream into a series of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
490 characters (which may be from multiple charsets) when the text is read
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
491 from a file or process, and is used to encode the text back into the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
492 same format when it is written out to a file or process.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
493
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
494 For example, many ISO2022-compliant coding systems (such as Compound
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
495 Text, which is used for inter-client data under the X Window System)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
496 use escape sequences to switch between different charsets -- Japanese
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
497 Kanji, for example, is invoked with "ESC $ ( B"; ASCII is invoked
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
498 with "ESC ( B"; and Cyrillic is invoked with "ESC - L". See
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
499 `make-coding-system' for more information.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
500
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
501 Coding systems are normally identified using a symbol, and the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
502 symbol is accepted in place of the actual coding system object whenever
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
503 a coding system is called for. (This is similar to how faces work.)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
504 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
505 (object))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
506 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
507 return CODING_SYSTEMP (object) ? Qt : Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
508 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
509
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
510 DEFUN ("find-coding-system", Ffind_coding_system, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
511 Retrieve the coding system of the given name.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
512
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
513 If CODING-SYSTEM-OR-NAME is a coding-system object, it is simply
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
514 returned. Otherwise, CODING-SYSTEM-OR-NAME should be a symbol.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
515 If there is no such coding system, nil is returned. Otherwise the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
516 associated coding system object is returned.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
517 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
518 (coding_system_or_name))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
519 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
520 if (NILP (coding_system_or_name))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
521 coding_system_or_name = Qbinary;
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
522 else if (CODING_SYSTEMP (coding_system_or_name))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
523 return coding_system_or_name;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
524 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
525 CHECK_SYMBOL (coding_system_or_name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
526
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
527 while (1)
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
528 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
529 coding_system_or_name =
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
530 Fgethash (coding_system_or_name, Vcoding_system_hash_table, Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
531
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
532 if (CODING_SYSTEMP (coding_system_or_name) || NILP (coding_system_or_name))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
533 return coding_system_or_name;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
534 }
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
535 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
536
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
537 DEFUN ("get-coding-system", Fget_coding_system, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
538 Retrieve the coding system of the given name.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
539 Same as `find-coding-system' except that if there is no such
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
540 coding system, an error is signaled instead of returning nil.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
541 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
542 (name))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
543 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
544 Lisp_Object coding_system = Ffind_coding_system (name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
545
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
546 if (NILP (coding_system))
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
547 invalid_argument ("No such coding system", name);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
548 return coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
549 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
550
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
551 /* We store the coding systems in hash tables with the names as the key and the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
552 actual coding system object as the value. Occasionally we need to use them
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
553 in a list format. These routines provide us with that. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
554 struct coding_system_list_closure
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
555 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
556 Lisp_Object *coding_system_list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
557 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
558
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
559 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
560 add_coding_system_to_list_mapper (Lisp_Object key, Lisp_Object value,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
561 void *coding_system_list_closure)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
562 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
563 /* This function can GC */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
564 struct coding_system_list_closure *cscl =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
565 (struct coding_system_list_closure *) coding_system_list_closure;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
566 Lisp_Object *coding_system_list = cscl->coding_system_list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
567
432
3a7e78e1142d Import from CVS: tag r21-2-24
cvs
parents: 428
diff changeset
568 *coding_system_list = Fcons (key, *coding_system_list);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
569 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
570 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
571
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
572 DEFUN ("coding-system-list", Fcoding_system_list, 0, 0, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
573 Return a list of the names of all defined coding systems.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
574 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
575 ())
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
576 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
577 Lisp_Object coding_system_list = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
578 struct gcpro gcpro1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
579 struct coding_system_list_closure coding_system_list_closure;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
580
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
581 GCPRO1 (coding_system_list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
582 coding_system_list_closure.coding_system_list = &coding_system_list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
583 elisp_maphash (add_coding_system_to_list_mapper, Vcoding_system_hash_table,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
584 &coding_system_list_closure);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
585 UNGCPRO;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
586
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
587 return coding_system_list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
588 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
589
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
590 DEFUN ("coding-system-name", Fcoding_system_name, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
591 Return the name of the given coding system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
592 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
593 (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
594 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
595 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
596 return XCODING_SYSTEM_NAME (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
597 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
598
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
599 static Lisp_Coding_System *
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
600 allocate_coding_system (enum coding_system_type type, Lisp_Object name)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
601 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
602 Lisp_Coding_System *codesys =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
603 alloc_lcrecord_type (Lisp_Coding_System, &lrecord_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
604
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
605 zero_lcrecord (codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
606 CODING_SYSTEM_PRE_WRITE_CONVERSION (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
607 CODING_SYSTEM_POST_READ_CONVERSION (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
608 CODING_SYSTEM_EOL_TYPE (codesys) = EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
609 CODING_SYSTEM_EOL_CRLF (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
610 CODING_SYSTEM_EOL_CR (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
611 CODING_SYSTEM_EOL_LF (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
612 CODING_SYSTEM_TYPE (codesys) = type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
613 CODING_SYSTEM_MNEMONIC (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
614 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
615 if (type == CODESYS_ISO2022)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
616 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
617 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
618 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
619 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, i) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
620 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
621 else if (type == CODESYS_CCL)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
622 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
623 CODING_SYSTEM_CCL_DECODE (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
624 CODING_SYSTEM_CCL_ENCODE (codesys) = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
625 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
626 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
627 CODING_SYSTEM_NAME (codesys) = name;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
628
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
629 return codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
630 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
631
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
632 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
633 /* Given a list of charset conversion specs as specified in a Lisp
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
634 program, parse it into STORE_HERE. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
635
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
636 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
637 parse_charset_conversion_specs (charset_conversion_spec_dynarr *store_here,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
638 Lisp_Object spec_list)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
639 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
640 Lisp_Object rest;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
641
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
642 EXTERNAL_LIST_LOOP (rest, spec_list)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
643 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
644 Lisp_Object car = XCAR (rest);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
645 Lisp_Object from, to;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
646 struct charset_conversion_spec spec;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
647
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
648 if (!CONSP (car) || !CONSP (XCDR (car)) || !NILP (XCDR (XCDR (car))))
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
649 invalid_argument ("Invalid charset conversion spec", car);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
650 from = Fget_charset (XCAR (car));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
651 to = Fget_charset (XCAR (XCDR (car)));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
652 if (XCHARSET_TYPE (from) != XCHARSET_TYPE (to))
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
653 invalid_operation_2
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
654 ("Attempted conversion between different charset types",
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
655 from, to);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
656 spec.from_charset = from;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
657 spec.to_charset = to;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
658
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
659 Dynarr_add (store_here, spec);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
660 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
661 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
662
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
663 /* Given a dynarr LOAD_HERE of internally-stored charset conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
664 specs, return the equivalent as the Lisp programmer would see it.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
665
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
666 If LOAD_HERE is 0, return Qnil. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
667
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
668 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
669 unparse_charset_conversion_specs (charset_conversion_spec_dynarr *load_here)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
670 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
671 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
672 Lisp_Object result;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
673
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
674 if (!load_here)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
675 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
676 for (i = 0, result = Qnil; i < Dynarr_length (load_here); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
677 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
678 struct charset_conversion_spec *ccs = Dynarr_atp (load_here, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
679 result = Fcons (list2 (ccs->from_charset, ccs->to_charset), result);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
680 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
681
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
682 return Fnreverse (result);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
683 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
684
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
685 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
686
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
687 DEFUN ("make-coding-system", Fmake_coding_system, 2, 4, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
688 Register symbol NAME as a coding system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
689
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
690 TYPE describes the conversion method used and should be one of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
691
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
692 nil or 'undecided
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
693 Automatic conversion. XEmacs attempts to detect the coding system
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
694 used in the file.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
695 'no-conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
696 No conversion. Use this for binary files and such. On output,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
697 graphic characters that are not in ASCII or Latin-1 will be
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
698 replaced by a ?. (For a no-conversion-encoded buffer, these
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
699 characters will only be present if you explicitly insert them.)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
700 'shift-jis
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
701 Shift-JIS (a Japanese encoding commonly used in PC operating systems).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
702 'ucs-4
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
703 ISO 10646 UCS-4 encoding.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
704 'utf-8
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
705 ISO 10646 UTF-8 encoding.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
706 'iso2022
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
707 Any ISO2022-compliant encoding. Among other things, this includes
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
708 JIS (the Japanese encoding commonly used for e-mail), EUC (the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
709 standard Unix encoding for Japanese and other languages), and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
710 Compound Text (the encoding used in X11). You can specify more
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
711 specific information about the conversion with the PROPS argument.
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
712 'big5
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
713 Big5 (the encoding commonly used for Taiwanese).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
714 'ccl
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
715 The conversion is performed using a user-written pseudo-code
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
716 program. CCL (Code Conversion Language) is the name of this
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
717 pseudo-code.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
718 'internal
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
719 Write out or read in the raw contents of the memory representing
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
720 the buffer's text. This is primarily useful for debugging
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
721 purposes, and is only enabled when XEmacs has been compiled with
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
722 DEBUG_XEMACS defined (via the --debug configure option).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
723 WARNING: Reading in a file using 'internal conversion can result
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
724 in an internal inconsistency in the memory representing a
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
725 buffer's text, which will produce unpredictable results and may
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
726 cause XEmacs to crash. Under normal circumstances you should
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
727 never use 'internal conversion.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
728
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
729 DOC-STRING is a string describing the coding system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
730
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
731 PROPS is a property list, describing the specific nature of the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
732 character set. Recognized properties are:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
733
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
734 'mnemonic
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
735 String to be displayed in the modeline when this coding system is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
736 active.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
737
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
738 'eol-type
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
739 End-of-line conversion to be used. It should be one of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
740
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
741 nil
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
742 Automatically detect the end-of-line type (LF, CRLF,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
743 or CR). Also generate subsidiary coding systems named
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
744 `NAME-unix', `NAME-dos', and `NAME-mac', that are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
745 identical to this coding system but have an EOL-TYPE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
746 value of 'lf, 'crlf, and 'cr, respectively.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
747 'lf
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
748 The end of a line is marked externally using ASCII LF.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
749 Since this is also the way that XEmacs represents an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
750 end-of-line internally, specifying this option results
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
751 in no end-of-line conversion. This is the standard
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
752 format for Unix text files.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
753 'crlf
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
754 The end of a line is marked externally using ASCII
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
755 CRLF. This is the standard format for MS-DOS text
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
756 files.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
757 'cr
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
758 The end of a line is marked externally using ASCII CR.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
759 This is the standard format for Macintosh text files.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
760 t
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
761 Automatically detect the end-of-line type but do not
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
762 generate subsidiary coding systems. (This value is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
763 converted to nil when stored internally, and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
764 `coding-system-property' will return nil.)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
765
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
766 'post-read-conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
767 Function called after a file has been read in, to perform the
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
768 decoding. Called with two arguments, START and END, denoting
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
769 a region of the current buffer to be decoded.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
770
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
771 'pre-write-conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
772 Function called before a file is written out, to perform the
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
773 encoding. Called with two arguments, START and END, denoting
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
774 a region of the current buffer to be encoded.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
775
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
776
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
777 The following additional properties are recognized if TYPE is 'iso2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
778
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
779 'charset-g0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
780 'charset-g1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
781 'charset-g2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
782 'charset-g3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
783 The character set initially designated to the G0 - G3 registers.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
784 The value should be one of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
785
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
786 -- A charset object (designate that character set)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
787 -- nil (do not ever use this register)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
788 -- t (no character set is initially designated to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
789 the register, but may be later on; this automatically
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
790 sets the corresponding `force-g*-on-output' property)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
791
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
792 'force-g0-on-output
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
793 'force-g1-on-output
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
794 'force-g2-on-output
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
795 'force-g2-on-output
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
796 If non-nil, send an explicit designation sequence on output before
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
797 using the specified register.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
798
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
799 'short
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
800 If non-nil, use the short forms "ESC $ @", "ESC $ A", and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
801 "ESC $ B" on output in place of the full designation sequences
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
802 "ESC $ ( @", "ESC $ ( A", and "ESC $ ( B".
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
803
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
804 'no-ascii-eol
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
805 If non-nil, don't designate ASCII to G0 at each end of line on output.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
806 Setting this to non-nil also suppresses other state-resetting that
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
807 normally happens at the end of a line.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
808
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
809 'no-ascii-cntl
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
810 If non-nil, don't designate ASCII to G0 before control chars on output.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
811
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
812 'seven
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
813 If non-nil, use 7-bit environment on output. Otherwise, use 8-bit
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
814 environment.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
815
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
816 'lock-shift
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
817 If non-nil, use locking-shift (SO/SI) instead of single-shift
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
818 or designation by escape sequence.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
819
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
820 'no-iso6429
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
821 If non-nil, don't use ISO6429's direction specification.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
822
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
823 'escape-quoted
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
824 If non-nil, literal control characters that are the same as
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
825 the beginning of a recognized ISO2022 or ISO6429 escape sequence
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
826 (in particular, ESC (0x1B), SO (0x0E), SI (0x0F), SS2 (0x8E),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
827 SS3 (0x8F), and CSI (0x9B)) are "quoted" with an escape character
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
828 so that they can be properly distinguished from an escape sequence.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
829 (Note that doing this results in a non-portable encoding.) This
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
830 encoding flag is used for byte-compiled files. Note that ESC
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
831 is a good choice for a quoting character because there are no
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
832 escape sequences whose second byte is a character from the Control-0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
833 or Control-1 character sets; this is explicitly disallowed by the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
834 ISO2022 standard.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
835
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
836 'input-charset-conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
837 A list of conversion specifications, specifying conversion of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
838 characters in one charset to another when decoding is performed.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
839 Each specification is a list of two elements: the source charset,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
840 and the destination charset.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
841
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
842 'output-charset-conversion
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
843 A list of conversion specifications, specifying conversion of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
844 characters in one charset to another when encoding is performed.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
845 The form of each specification is the same as for
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
846 'input-charset-conversion.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
847
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
848
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
849 The following additional properties are recognized (and required)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
850 if TYPE is 'ccl:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
851
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
852 'decode
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
853 CCL program used for decoding (converting to internal format).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
854
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
855 'encode
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
856 CCL program used for encoding (converting to external format).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
857 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
858 (name, type, doc_string, props))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
859 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
860 Lisp_Coding_System *codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
861 enum coding_system_type ty;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
862 int need_to_setup_eol_systems = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
863
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
864 /* Convert type to constant */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
865 if (NILP (type) || EQ (type, Qundecided))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
866 { ty = CODESYS_AUTODETECT; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
867 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
868 else if (EQ (type, Qshift_jis)) { ty = CODESYS_SHIFT_JIS; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
869 else if (EQ (type, Qiso2022)) { ty = CODESYS_ISO2022; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
870 else if (EQ (type, Qbig5)) { ty = CODESYS_BIG5; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
871 else if (EQ (type, Qucs4)) { ty = CODESYS_UCS4; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
872 else if (EQ (type, Qutf8)) { ty = CODESYS_UTF8; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
873 else if (EQ (type, Qccl)) { ty = CODESYS_CCL; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
874 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
875 else if (EQ (type, Qno_conversion)) { ty = CODESYS_NO_CONVERSION; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
876 #ifdef DEBUG_XEMACS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
877 else if (EQ (type, Qinternal)) { ty = CODESYS_INTERNAL; }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
878 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
879 else
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
880 invalid_constant ("Invalid coding system type", type);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
881
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
882 CHECK_SYMBOL (name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
883
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
884 codesys = allocate_coding_system (ty, name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
885
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
886 if (NILP (doc_string))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
887 doc_string = build_string ("");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
888 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
889 CHECK_STRING (doc_string);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
890 CODING_SYSTEM_DOC_STRING (codesys) = doc_string;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
891
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
892 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
893 EXTERNAL_PROPERTY_LIST_LOOP_3 (key, value, props)
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
894 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
895 if (EQ (key, Qmnemonic))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
896 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
897 if (!NILP (value))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
898 CHECK_STRING (value);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
899 CODING_SYSTEM_MNEMONIC (codesys) = value;
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
900 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
901
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
902 else if (EQ (key, Qeol_type))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
903 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
904 need_to_setup_eol_systems = NILP (value);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
905 if (EQ (value, Qt))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
906 value = Qnil;
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
907 CODING_SYSTEM_EOL_TYPE (codesys) = symbol_to_eol_type (value);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
908 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
909
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
910 else if (EQ (key, Qpost_read_conversion)) CODING_SYSTEM_POST_READ_CONVERSION (codesys) = value;
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
911 else if (EQ (key, Qpre_write_conversion)) CODING_SYSTEM_PRE_WRITE_CONVERSION (codesys) = value;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
912 #ifdef MULE
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
913 else if (ty == CODESYS_ISO2022)
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
914 {
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
915 #define FROB_INITIAL_CHARSET(charset_num) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
916 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, charset_num) = \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
917 ((EQ (value, Qt) || EQ (value, Qnil)) ? value : Fget_charset (value))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
918
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
919 if (EQ (key, Qcharset_g0)) FROB_INITIAL_CHARSET (0);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
920 else if (EQ (key, Qcharset_g1)) FROB_INITIAL_CHARSET (1);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
921 else if (EQ (key, Qcharset_g2)) FROB_INITIAL_CHARSET (2);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
922 else if (EQ (key, Qcharset_g3)) FROB_INITIAL_CHARSET (3);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
923
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
924 #define FROB_FORCE_CHARSET(charset_num) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
925 CODING_SYSTEM_ISO2022_FORCE_CHARSET_ON_OUTPUT (codesys, charset_num) = !NILP (value)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
926
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
927 else if (EQ (key, Qforce_g0_on_output)) FROB_FORCE_CHARSET (0);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
928 else if (EQ (key, Qforce_g1_on_output)) FROB_FORCE_CHARSET (1);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
929 else if (EQ (key, Qforce_g2_on_output)) FROB_FORCE_CHARSET (2);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
930 else if (EQ (key, Qforce_g3_on_output)) FROB_FORCE_CHARSET (3);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
931
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
932 #define FROB_BOOLEAN_PROPERTY(prop) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
933 CODING_SYSTEM_ISO2022_##prop (codesys) = !NILP (value)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
934
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
935 else if (EQ (key, Qshort)) FROB_BOOLEAN_PROPERTY (SHORT);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
936 else if (EQ (key, Qno_ascii_eol)) FROB_BOOLEAN_PROPERTY (NO_ASCII_EOL);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
937 else if (EQ (key, Qno_ascii_cntl)) FROB_BOOLEAN_PROPERTY (NO_ASCII_CNTL);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
938 else if (EQ (key, Qseven)) FROB_BOOLEAN_PROPERTY (SEVEN);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
939 else if (EQ (key, Qlock_shift)) FROB_BOOLEAN_PROPERTY (LOCK_SHIFT);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
940 else if (EQ (key, Qno_iso6429)) FROB_BOOLEAN_PROPERTY (NO_ISO6429);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
941 else if (EQ (key, Qescape_quoted)) FROB_BOOLEAN_PROPERTY (ESCAPE_QUOTED);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
942
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
943 else if (EQ (key, Qinput_charset_conversion))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
944 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
945 codesys->iso2022.input_conv =
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
946 Dynarr_new (charset_conversion_spec);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
947 parse_charset_conversion_specs (codesys->iso2022.input_conv,
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
948 value);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
949 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
950 else if (EQ (key, Qoutput_charset_conversion))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
951 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
952 codesys->iso2022.output_conv =
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
953 Dynarr_new (charset_conversion_spec);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
954 parse_charset_conversion_specs (codesys->iso2022.output_conv,
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
955 value);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
956 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
957 else
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
958 invalid_constant ("Unrecognized property", key);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
959 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
960 else if (EQ (type, Qccl))
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
961 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
962 Lisp_Object sym;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
963 struct ccl_program test_ccl;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
964 Extbyte *suffix;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
965
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
966 /* Check key first. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
967 if (EQ (key, Qdecode))
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
968 suffix = "-ccl-decode";
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
969 else if (EQ (key, Qencode))
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
970 suffix = "-ccl-encode";
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
971 else
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
972 invalid_constant ("Unrecognized property", key);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
973
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
974 /* If value is vector, register it as a ccl program
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
975 associated with an newly created symbol for
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
976 backward compatibility. */
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
977 if (VECTORP (value))
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
978 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
979 sym = Fintern (concat2 (Fsymbol_name (name),
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
980 build_string (suffix)),
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
981 Qnil);
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
982 Fregister_ccl_program (sym, value);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
983 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
984 else
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
985 {
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
986 CHECK_SYMBOL (value);
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
987 sym = value;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
988 }
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
989 /* check if the given ccl programs are valid. */
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
990 if (setup_ccl_program (&test_ccl, sym) < 0)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
991 invalid_argument ("Invalid CCL program", value);
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
992
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
993 if (EQ (key, Qdecode))
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
994 CODING_SYSTEM_CCL_DECODE (codesys) = sym;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
995 else if (EQ (key, Qencode))
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
996 CODING_SYSTEM_CCL_ENCODE (codesys) = sym;
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
997
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
998 }
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
999 #endif /* MULE */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1000 else
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1001 invalid_constant ("Unrecognized property", key);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1002 }
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1003 }
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1004
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1005 if (need_to_setup_eol_systems)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1006 setup_eol_coding_systems (codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1007
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1008 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1009 Lisp_Object codesys_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1010 XSETCODING_SYSTEM (codesys_obj, codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1011 Fputhash (name, codesys_obj, Vcoding_system_hash_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1012 return codesys_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1013 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1014 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1015
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1016 DEFUN ("copy-coding-system", Fcopy_coding_system, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1017 Copy OLD-CODING-SYSTEM to NEW-NAME.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1018 If NEW-NAME does not name an existing coding system, a new one will
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1019 be created.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1020 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1021 (old_coding_system, new_name))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1022 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1023 Lisp_Object new_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1024 old_coding_system = Fget_coding_system (old_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1025 new_coding_system = Ffind_coding_system (new_name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1026 if (NILP (new_coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1027 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1028 XSETCODING_SYSTEM (new_coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1029 allocate_coding_system
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1030 (XCODING_SYSTEM_TYPE (old_coding_system),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1031 new_name));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1032 Fputhash (new_name, new_coding_system, Vcoding_system_hash_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1033 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1034
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1035 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1036 Lisp_Coding_System *to = XCODING_SYSTEM (new_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1037 Lisp_Coding_System *from = XCODING_SYSTEM (old_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1038 memcpy (((char *) to ) + sizeof (to->header),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1039 ((char *) from) + sizeof (from->header),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1040 sizeof (*from) - sizeof (from->header));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1041 to->name = new_name;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1042 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1043 return new_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1044 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1045
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1046 DEFUN ("coding-system-canonical-name-p", Fcoding_system_canonical_name_p, 1, 1, 0, /*
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1047 Return t if OBJECT names a coding system, and is not a coding system alias.
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1048 */
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1049 (object))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1050 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1051 return CODING_SYSTEMP (Fgethash (object, Vcoding_system_hash_table, Qnil))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1052 ? Qt : Qnil;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1053 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1054
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1055 DEFUN ("coding-system-alias-p", Fcoding_system_alias_p, 1, 1, 0, /*
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1056 Return t if OBJECT is a coding system alias.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1057 All coding system aliases are created by `define-coding-system-alias'.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1058 */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1059 (object))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1060 {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1061 return SYMBOLP (Fgethash (object, Vcoding_system_hash_table, Qzero))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1062 ? Qt : Qnil;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1063 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1064
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1065 DEFUN ("coding-system-aliasee", Fcoding_system_aliasee, 1, 1, 0, /*
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1066 Return the coding-system symbol for which symbol ALIAS is an alias.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1067 */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1068 (alias))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1069 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1070 Lisp_Object aliasee = Fgethash (alias, Vcoding_system_hash_table, Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1071 if (SYMBOLP (aliasee))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1072 return aliasee;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1073 else
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1074 invalid_argument ("Symbol is not a coding system alias", alias);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1075 return Qnil; /* To keep the compiler happy */
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1076 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1077
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1078 static Lisp_Object
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1079 append_suffix_to_symbol (Lisp_Object symbol, const char *ascii_string)
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1080 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1081 return Fintern (concat2 (Fsymbol_name (symbol), build_string (ascii_string)),
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1082 Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1083 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1084
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1085 /* A maphash function, for removing dangling coding system aliases. */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1086 static int
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1087 dangling_coding_system_alias_p (Lisp_Object alias,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1088 Lisp_Object aliasee,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1089 void *dangling_aliases)
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1090 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1091 if (SYMBOLP (aliasee)
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1092 && NILP (Fgethash (aliasee, Vcoding_system_hash_table, Qnil)))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1093 {
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1094 (*(int *) dangling_aliases)++;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1095 return 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1096 }
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1097 else
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1098 return 0;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1099 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1100
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1101 DEFUN ("define-coding-system-alias", Fdefine_coding_system_alias, 2, 2, 0, /*
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1102 Define symbol ALIAS as an alias for coding system ALIASEE.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1103
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1104 You can use this function to redefine an alias that has already been defined,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1105 but you cannot redefine a name which is the canonical name for a coding system.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1106 \(a canonical name of a coding system is what is returned when you call
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1107 `coding-system-name' on a coding system).
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1108
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1109 ALIASEE itself can be an alias, which allows you to define nested aliases.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1110
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1111 You are forbidden, however, from creating alias loops or `dangling' aliases.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1112 These will be detected, and an error will be signaled if you attempt to do so.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1113
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1114 If ALIASEE is nil, then ALIAS will simply be undefined.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1115
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1116 See also `coding-system-alias-p', `coding-system-aliasee',
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1117 and `coding-system-canonical-name-p'.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1118 */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1119 (alias, aliasee))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1120 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1121 Lisp_Object real_coding_system, probe;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1122
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1123 CHECK_SYMBOL (alias);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1124
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1125 if (!NILP (Fcoding_system_canonical_name_p (alias)))
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1126 invalid_change
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1127 ("Symbol is the canonical name of a coding system and cannot be redefined",
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1128 alias);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1129
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1130 if (NILP (aliasee))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1131 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1132 Lisp_Object subsidiary_unix = append_suffix_to_symbol (alias, "-unix");
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1133 Lisp_Object subsidiary_dos = append_suffix_to_symbol (alias, "-dos");
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1134 Lisp_Object subsidiary_mac = append_suffix_to_symbol (alias, "-mac");
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1135
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1136 Fremhash (alias, Vcoding_system_hash_table);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1137
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1138 /* Undefine subsidiary aliases,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1139 presumably created by a previous call to this function */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1140 if (! NILP (Fcoding_system_alias_p (subsidiary_unix)) &&
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1141 ! NILP (Fcoding_system_alias_p (subsidiary_dos)) &&
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1142 ! NILP (Fcoding_system_alias_p (subsidiary_mac)))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1143 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1144 Fdefine_coding_system_alias (subsidiary_unix, Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1145 Fdefine_coding_system_alias (subsidiary_dos, Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1146 Fdefine_coding_system_alias (subsidiary_mac, Qnil);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1147 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1148
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1149 /* Undefine dangling coding system aliases. */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1150 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1151 int dangling_aliases;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1152
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1153 do {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1154 dangling_aliases = 0;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1155 elisp_map_remhash (dangling_coding_system_alias_p,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1156 Vcoding_system_hash_table,
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1157 &dangling_aliases);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1158 } while (dangling_aliases > 0);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1159 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1160
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1161 return Qnil;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1162 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1163
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1164 if (CODING_SYSTEMP (aliasee))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1165 aliasee = XCODING_SYSTEM_NAME (aliasee);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1166
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1167 /* Checks that aliasee names a coding-system */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1168 real_coding_system = Fget_coding_system (aliasee);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1169
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1170 /* Check for coding system alias loops */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1171 if (EQ (alias, aliasee))
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1172 alias_loop: invalid_operation_2
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1173 ("Attempt to create a coding system alias loop", alias, aliasee);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1174
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1175 for (probe = aliasee;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1176 SYMBOLP (probe);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1177 probe = Fgethash (probe, Vcoding_system_hash_table, Qzero))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1178 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1179 if (EQ (probe, alias))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1180 goto alias_loop;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1181 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1182
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1183 Fputhash (alias, aliasee, Vcoding_system_hash_table);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1184
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1185 /* Set up aliases for subsidiaries.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1186 #### There must be a better way to handle subsidiary coding systems. */
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1187 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1188 static const char *suffixes[] = { "-unix", "-dos", "-mac" };
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1189 int i;
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1190 for (i = 0; i < countof (suffixes); i++)
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1191 {
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1192 Lisp_Object alias_subsidiary =
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1193 append_suffix_to_symbol (alias, suffixes[i]);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1194 Lisp_Object aliasee_subsidiary =
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1195 append_suffix_to_symbol (aliasee, suffixes[i]);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1196
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1197 if (! NILP (Ffind_coding_system (aliasee_subsidiary)))
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1198 Fdefine_coding_system_alias (alias_subsidiary, aliasee_subsidiary);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1199 }
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
1200 }
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1201 /* FSF return value is a vector of [ALIAS-unix ALIAS-dos ALIAS-mac],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1202 but it doesn't look intentional, so I'd rather return something
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1203 meaningful or nothing at all. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1204 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1205 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1206
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1207 static Lisp_Object
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
1208 subsidiary_coding_system (Lisp_Object coding_system, eol_type_t type)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1209 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1210 Lisp_Coding_System *cs = XCODING_SYSTEM (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1211 Lisp_Object new_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1212
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1213 if (CODING_SYSTEM_EOL_TYPE (cs) != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1214 return coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1215
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1216 switch (type)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1217 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1218 case EOL_AUTODETECT: return coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1219 case EOL_LF: new_coding_system = CODING_SYSTEM_EOL_LF (cs); break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1220 case EOL_CR: new_coding_system = CODING_SYSTEM_EOL_CR (cs); break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1221 case EOL_CRLF: new_coding_system = CODING_SYSTEM_EOL_CRLF (cs); break;
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1222 default: abort (); return Qnil;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1223 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1224
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1225 return NILP (new_coding_system) ? coding_system : new_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1226 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1227
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1228 DEFUN ("subsidiary-coding-system", Fsubsidiary_coding_system, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1229 Return the subsidiary coding system of CODING-SYSTEM with eol type EOL-TYPE.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1230 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1231 (coding_system, eol_type))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1232 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1233 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1234
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1235 return subsidiary_coding_system (coding_system,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1236 symbol_to_eol_type (eol_type));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1237 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1238
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1239
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1240 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1241 /* Coding system accessors */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1242 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1243
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1244 DEFUN ("coding-system-doc-string", Fcoding_system_doc_string, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1245 Return the doc string for CODING-SYSTEM.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1246 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1247 (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1248 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1249 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1250 return XCODING_SYSTEM_DOC_STRING (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1251 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1252
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1253 DEFUN ("coding-system-type", Fcoding_system_type, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1254 Return the type of CODING-SYSTEM.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1255 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1256 (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1257 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1258 switch (XCODING_SYSTEM_TYPE (Fget_coding_system (coding_system)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1259 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1260 default: abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1261 case CODESYS_AUTODETECT: return Qundecided;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1262 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1263 case CODESYS_SHIFT_JIS: return Qshift_jis;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1264 case CODESYS_ISO2022: return Qiso2022;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1265 case CODESYS_BIG5: return Qbig5;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1266 case CODESYS_UCS4: return Qucs4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1267 case CODESYS_UTF8: return Qutf8;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1268 case CODESYS_CCL: return Qccl;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1269 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1270 case CODESYS_NO_CONVERSION: return Qno_conversion;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1271 #ifdef DEBUG_XEMACS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1272 case CODESYS_INTERNAL: return Qinternal;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1273 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1274 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1275 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1276
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1277 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1278 static
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1279 Lisp_Object coding_system_charset (Lisp_Object coding_system, int gnum)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1280 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1281 Lisp_Object cs
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1282 = XCODING_SYSTEM_ISO2022_INITIAL_CHARSET (coding_system, gnum);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1283
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1284 return CHARSETP (cs) ? XCHARSET_NAME (cs) : Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1285 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1286 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1287
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1288 DEFUN ("coding-system-property", Fcoding_system_property, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1289 Return the PROP property of CODING-SYSTEM.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1290 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1291 (coding_system, prop))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1292 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1293 int i, ok = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1294 enum coding_system_type type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1295
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1296 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1297 CHECK_SYMBOL (prop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1298 type = XCODING_SYSTEM_TYPE (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1299
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1300 for (i = 0; !ok && i < Dynarr_length (the_codesys_prop_dynarr); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1301 if (EQ (Dynarr_at (the_codesys_prop_dynarr, i).sym, prop))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1302 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1303 ok = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1304 switch (Dynarr_at (the_codesys_prop_dynarr, i).prop_type)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1305 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1306 case CODESYS_PROP_ALL_OK:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1307 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1308 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1309 case CODESYS_PROP_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1310 if (type != CODESYS_ISO2022)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1311 invalid_argument
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1312 ("Property only valid in ISO2022 coding systems",
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1313 prop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1314 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1315
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1316 case CODESYS_PROP_CCL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1317 if (type != CODESYS_CCL)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1318 invalid_argument
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1319 ("Property only valid in CCL coding systems",
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1320 prop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1321 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1322 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1323 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1324 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1325 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1326 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1327
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1328 if (!ok)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1329 invalid_constant ("Unrecognized property", prop);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1330
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1331 if (EQ (prop, Qname))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1332 return XCODING_SYSTEM_NAME (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1333 else if (EQ (prop, Qtype))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1334 return Fcoding_system_type (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1335 else if (EQ (prop, Qdoc_string))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1336 return XCODING_SYSTEM_DOC_STRING (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1337 else if (EQ (prop, Qmnemonic))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1338 return XCODING_SYSTEM_MNEMONIC (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1339 else if (EQ (prop, Qeol_type))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1340 return eol_type_to_symbol (XCODING_SYSTEM_EOL_TYPE (coding_system));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1341 else if (EQ (prop, Qeol_lf))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1342 return XCODING_SYSTEM_EOL_LF (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1343 else if (EQ (prop, Qeol_crlf))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1344 return XCODING_SYSTEM_EOL_CRLF (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1345 else if (EQ (prop, Qeol_cr))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1346 return XCODING_SYSTEM_EOL_CR (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1347 else if (EQ (prop, Qpost_read_conversion))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1348 return XCODING_SYSTEM_POST_READ_CONVERSION (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1349 else if (EQ (prop, Qpre_write_conversion))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1350 return XCODING_SYSTEM_PRE_WRITE_CONVERSION (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1351 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1352 else if (type == CODESYS_ISO2022)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1353 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1354 if (EQ (prop, Qcharset_g0))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1355 return coding_system_charset (coding_system, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1356 else if (EQ (prop, Qcharset_g1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1357 return coding_system_charset (coding_system, 1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1358 else if (EQ (prop, Qcharset_g2))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1359 return coding_system_charset (coding_system, 2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1360 else if (EQ (prop, Qcharset_g3))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1361 return coding_system_charset (coding_system, 3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1362
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1363 #define FORCE_CHARSET(charset_num) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1364 (XCODING_SYSTEM_ISO2022_FORCE_CHARSET_ON_OUTPUT \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1365 (coding_system, charset_num) ? Qt : Qnil)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1366
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1367 else if (EQ (prop, Qforce_g0_on_output)) return FORCE_CHARSET (0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1368 else if (EQ (prop, Qforce_g1_on_output)) return FORCE_CHARSET (1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1369 else if (EQ (prop, Qforce_g2_on_output)) return FORCE_CHARSET (2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1370 else if (EQ (prop, Qforce_g3_on_output)) return FORCE_CHARSET (3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1371
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1372 #define LISP_BOOLEAN(prop) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1373 (XCODING_SYSTEM_ISO2022_##prop (coding_system) ? Qt : Qnil)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1374
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1375 else if (EQ (prop, Qshort)) return LISP_BOOLEAN (SHORT);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1376 else if (EQ (prop, Qno_ascii_eol)) return LISP_BOOLEAN (NO_ASCII_EOL);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1377 else if (EQ (prop, Qno_ascii_cntl)) return LISP_BOOLEAN (NO_ASCII_CNTL);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1378 else if (EQ (prop, Qseven)) return LISP_BOOLEAN (SEVEN);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1379 else if (EQ (prop, Qlock_shift)) return LISP_BOOLEAN (LOCK_SHIFT);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1380 else if (EQ (prop, Qno_iso6429)) return LISP_BOOLEAN (NO_ISO6429);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1381 else if (EQ (prop, Qescape_quoted)) return LISP_BOOLEAN (ESCAPE_QUOTED);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1382
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1383 else if (EQ (prop, Qinput_charset_conversion))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1384 return
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1385 unparse_charset_conversion_specs
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1386 (XCODING_SYSTEM (coding_system)->iso2022.input_conv);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1387 else if (EQ (prop, Qoutput_charset_conversion))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1388 return
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1389 unparse_charset_conversion_specs
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1390 (XCODING_SYSTEM (coding_system)->iso2022.output_conv);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1391 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1392 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1393 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1394 else if (type == CODESYS_CCL)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1395 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1396 if (EQ (prop, Qdecode))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1397 return XCODING_SYSTEM_CCL_DECODE (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1398 else if (EQ (prop, Qencode))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1399 return XCODING_SYSTEM_CCL_ENCODE (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1400 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1401 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1402 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1403 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1404 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1405 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1406
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1407 return Qnil; /* not reached */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1408 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1409
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1410
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1411 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1412 /* Coding category functions */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1413 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1414
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1415 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1416 decode_coding_category (Lisp_Object symbol)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1417 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1418 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1419
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1420 CHECK_SYMBOL (symbol);
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1421 for (i = 0; i < CODING_CATEGORY_LAST; i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1422 if (EQ (coding_category_symbol[i], symbol))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1423 return i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1424
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1425 invalid_constant ("Unrecognized coding category", symbol);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1426 return 0; /* not reached */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1427 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1429 DEFUN ("coding-category-list", Fcoding_category_list, 0, 0, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1430 Return a list of all recognized coding categories.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1431 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1432 ())
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1433 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1434 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1435 Lisp_Object list = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1436
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1437 for (i = CODING_CATEGORY_LAST - 1; i >= 0; i--)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1438 list = Fcons (coding_category_symbol[i], list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1439 return list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1440 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1441
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1442 DEFUN ("set-coding-priority-list", Fset_coding_priority_list, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1443 Change the priority order of the coding categories.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1444 LIST should be list of coding categories, in descending order of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1445 priority. Unspecified coding categories will be lower in priority
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1446 than all specified ones, in the same relative order they were in
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1447 previously.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1448 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1449 (list))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1450 {
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1451 int category_to_priority[CODING_CATEGORY_LAST];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1452 int i, j;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1453 Lisp_Object rest;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1454
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1455 /* First generate a list that maps coding categories to priorities. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1456
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1457 for (i = 0; i < CODING_CATEGORY_LAST; i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1458 category_to_priority[i] = -1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1459
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1460 /* Highest priority comes from the specified list. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1461 i = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1462 EXTERNAL_LIST_LOOP (rest, list)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1463 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1464 int cat = decode_coding_category (XCAR (rest));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1465
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1466 if (category_to_priority[cat] >= 0)
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
1467 sferror ("Duplicate coding category in list", XCAR (rest));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1468 category_to_priority[cat] = i++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1469 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1470
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1471 /* Now go through the existing categories by priority to retrieve
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1472 the categories not yet specified and preserve their priority
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1473 order. */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1474 for (j = 0; j < CODING_CATEGORY_LAST; j++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1475 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1476 int cat = fcd->coding_category_by_priority[j];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1477 if (category_to_priority[cat] < 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1478 category_to_priority[cat] = i++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1479 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1480
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1481 /* Now we need to construct the inverse of the mapping we just
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1482 constructed. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1483
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1484 for (i = 0; i < CODING_CATEGORY_LAST; i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1485 fcd->coding_category_by_priority[category_to_priority[i]] = i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1486
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1487 /* Phew! That was confusing. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1488 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1489 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1490
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1491 DEFUN ("coding-priority-list", Fcoding_priority_list, 0, 0, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1492 Return a list of coding categories in descending order of priority.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1493 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1494 ())
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1495 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1496 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1497 Lisp_Object list = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1498
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1499 for (i = CODING_CATEGORY_LAST - 1; i >= 0; i--)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1500 list = Fcons (coding_category_symbol[fcd->coding_category_by_priority[i]],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1501 list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1502 return list;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1503 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1504
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1505 DEFUN ("set-coding-category-system", Fset_coding_category_system, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1506 Change the coding system associated with a coding category.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1507 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1508 (coding_category, coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1509 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1510 int cat = decode_coding_category (coding_category);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1511
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1512 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1513 fcd->coding_category_system[cat] = coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1514 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1515 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1516
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1517 DEFUN ("coding-category-system", Fcoding_category_system, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1518 Return the coding system associated with a coding category.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1519 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1520 (coding_category))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1521 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1522 int cat = decode_coding_category (coding_category);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1523 Lisp_Object sys = fcd->coding_category_system[cat];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1524
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1525 if (!NILP (sys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1526 return XCODING_SYSTEM_NAME (sys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1527 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1528 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1529
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1530
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1531 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1532 /* Detecting the encoding of data */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1533 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1534
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1535 struct detection_state
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1536 {
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
1537 eol_type_t eol_type;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1538 int seen_non_ascii;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1539 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1540 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1541 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1542 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1543 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1544 int in_second_byte;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1545 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1546 big5;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1547
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1548 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1549 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1550 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1551 int in_second_byte;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1552 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1553 shift_jis;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1554
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1555 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1556 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1557 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1558 int in_byte;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1559 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1560 ucs4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1561
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1562 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1563 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1564 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1565 int in_byte;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1566 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1567 utf8;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1568
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1569 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1570 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1571 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1572 int initted;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1573 struct iso2022_decoder iso;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1574 unsigned int flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1575 int high_byte_count;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1576 unsigned int saw_single_shift:1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1577 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1578 iso2022;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1579 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1580 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1581 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1582 int seen_anything;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1583 int just_saw_cr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1584 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1585 eol;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1586 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1587
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1588 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1589 acceptable_control_char_p (int c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1590 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1591 switch (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1592 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1593 /* Allow and ignore control characters that you might
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1594 reasonably see in a text file */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1595 case '\r':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1596 case '\n':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1597 case '\t':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1598 case 7: /* bell */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1599 case 8: /* backspace */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1600 case 11: /* vertical tab */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1601 case 12: /* form feed */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1602 case 26: /* MS-DOS C-z junk */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1603 case 31: /* '^_' -- for info */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1604 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1605 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1606 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1607 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1608 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1609
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1610 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1611 mask_has_at_most_one_bit_p (int mask)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1612 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1613 /* Perhaps the only thing useful you learn from intensive Microsoft
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1614 technical interviews */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1615 return (mask & (mask - 1)) == 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1616 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1617
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
1618 static eol_type_t
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1619 detect_eol_type (struct detection_state *st, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
1620 Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1621 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1622 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1623 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1624 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1625 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1626 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1627 if (st->eol.just_saw_cr)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1628 return EOL_CRLF;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1629 else if (st->eol.seen_anything)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1630 return EOL_LF;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1631 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1632 else if (st->eol.just_saw_cr)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1633 return EOL_CR;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1634 else if (c == '\r')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1635 st->eol.just_saw_cr = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1636 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1637 st->eol.just_saw_cr = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1638 st->eol.seen_anything = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1639 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1640
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1641 return EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1642 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1643
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1644 /* Attempt to determine the encoding and EOL type of the given text.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1645 Before calling this function for the first type, you must initialize
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1646 st->eol_type as appropriate and initialize st->mask to ~0.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1647
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1648 st->eol_type holds the determined EOL type, or EOL_AUTODETECT if
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1649 not yet known.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1650
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1651 st->mask holds the determined coding category mask, or ~0 if only
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1652 ASCII has been seen so far.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1653
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1654 Returns:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1655
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1656 0 == st->eol_type is EOL_AUTODETECT and/or more than coding category
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1657 is present in st->mask
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1658 1 == definitive answers are here for both st->eol_type and st->mask
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1659 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1660
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1661 static int
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
1662 detect_coding_type (struct detection_state *st, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
1663 Bytecount n, int just_do_eol)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1664 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1665 if (st->eol_type == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1666 st->eol_type = detect_eol_type (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1667
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1668 if (just_do_eol)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1669 return st->eol_type != EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1670
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1671 if (!st->seen_non_ascii)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1672 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1673 for (; n; n--, src++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1674 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1675 unsigned char c = *(unsigned char *) src;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1676 if ((c < 0x20 && !acceptable_control_char_p (c)) || c >= 0x80)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1677 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1678 st->seen_non_ascii = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1679 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1680 st->shift_jis.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1681 st->big5.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1682 st->ucs4.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1683 st->utf8.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1684 st->iso2022.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1685 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1686 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1687 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1688 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1689 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1690
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1691 if (!n)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1692 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1693 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1694 if (!mask_has_at_most_one_bit_p (st->iso2022.mask))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1695 st->iso2022.mask = detect_coding_iso2022 (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1696 if (!mask_has_at_most_one_bit_p (st->shift_jis.mask))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1697 st->shift_jis.mask = detect_coding_sjis (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1698 if (!mask_has_at_most_one_bit_p (st->big5.mask))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1699 st->big5.mask = detect_coding_big5 (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1700 if (!mask_has_at_most_one_bit_p (st->utf8.mask))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1701 st->utf8.mask = detect_coding_utf8 (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1702 if (!mask_has_at_most_one_bit_p (st->ucs4.mask))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1703 st->ucs4.mask = detect_coding_ucs4 (st, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1704
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1705 st->mask
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1706 = st->iso2022.mask | st->shift_jis.mask | st->big5.mask
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1707 | st->utf8.mask | st->ucs4.mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1708 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1709 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1710 int retval = mask_has_at_most_one_bit_p (st->mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1711 st->mask |= CODING_CATEGORY_NO_CONVERSION_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1712 return retval && st->eol_type != EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1713 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1714 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1715
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1716 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1717 coding_system_from_mask (int mask)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1718 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1719 if (mask == ~0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1720 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1721 /* If the file was entirely or basically ASCII, use the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1722 default value of `buffer-file-coding-system'. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1723 Lisp_Object retval =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1724 XBUFFER (Vbuffer_defaults)->buffer_file_coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1725 if (!NILP (retval))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1726 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1727 retval = Ffind_coding_system (retval);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1728 if (NILP (retval))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1729 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1730 warn_when_safe
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1731 (Qbad_variable, Qwarning,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1732 "Invalid `default-buffer-file-coding-system', set to nil");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1733 XBUFFER (Vbuffer_defaults)->buffer_file_coding_system = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1734 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1735 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1736 if (NILP (retval))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1737 retval = Fget_coding_system (Qraw_text);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1738 return retval;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1739 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1740 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1741 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1742 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1743 int cat = -1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1744 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1745 mask = postprocess_iso2022_mask (mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1746 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1747 /* Look through the coding categories by priority and find
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1748 the first one that is allowed. */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1749 for (i = 0; i < CODING_CATEGORY_LAST; i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1750 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1751 cat = fcd->coding_category_by_priority[i];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1752 if ((mask & (1 << cat)) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1753 !NILP (fcd->coding_category_system[cat]))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1754 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1755 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1756 if (cat >= 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1757 return fcd->coding_category_system[cat];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1758 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1759 return Fget_coding_system (Qraw_text);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1760 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1761 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1762
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1763 /* Given a seekable read stream and potential coding system and EOL type
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1764 as specified, do any autodetection that is called for. If the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1765 coding system and/or EOL type are not `autodetect', they will be left
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1766 alone; but this function will never return an autodetect coding system
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1767 or EOL type.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1768
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1769 This function does not automatically fetch subsidiary coding systems;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1770 that should be unnecessary with the explicit eol-type argument. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1771
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1772 #define LENGTH(string_constant) (sizeof (string_constant) - 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1773
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1774 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1775 determine_real_coding_system (Lstream *stream, Lisp_Object *codesys_in_out,
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
1776 eol_type_t *eol_type_in_out)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1777 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1778 struct detection_state decst;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1779
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1780 if (*eol_type_in_out == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1781 *eol_type_in_out = XCODING_SYSTEM_EOL_TYPE (*codesys_in_out);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1782
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1783 xzero (decst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1784 decst.eol_type = *eol_type_in_out;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1785 decst.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1786
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1787 /* If autodetection is called for, do it now. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1788 if (XCODING_SYSTEM_TYPE (*codesys_in_out) == CODESYS_AUTODETECT
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1789 || *eol_type_in_out == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1790 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1791 Extbyte buf[4096];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1792 Lisp_Object coding_system = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1793 Extbyte *p;
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
1794 Bytecount nread = Lstream_read (stream, buf, sizeof (buf));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1795 Extbyte *scan_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1796
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1797 /* Look for initial "-*-"; mode line prefix */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1798 for (p = buf,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1799 scan_end = buf + nread - LENGTH ("-*-coding:?-*-");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1800 p <= scan_end
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1801 && *p != '\n'
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1802 && *p != '\r';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1803 p++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1804 if (*p == '-' && *(p+1) == '*' && *(p+2) == '-')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1805 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1806 Extbyte *local_vars_beg = p + 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1807 /* Look for final "-*-"; mode line suffix */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1808 for (p = local_vars_beg,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1809 scan_end = buf + nread - LENGTH ("-*-");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1810 p <= scan_end
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1811 && *p != '\n'
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1812 && *p != '\r';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1813 p++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1814 if (*p == '-' && *(p+1) == '*' && *(p+2) == '-')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1815 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1816 Extbyte *suffix = p;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1817 /* Look for "coding:" */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1818 for (p = local_vars_beg,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1819 scan_end = suffix - LENGTH ("coding:?");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1820 p <= scan_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1821 p++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1822 if (memcmp ("coding:", p, LENGTH ("coding:")) == 0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1823 && (p == local_vars_beg
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1824 || (*(p-1) == ' ' ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1825 *(p-1) == '\t' ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1826 *(p-1) == ';')))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1827 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1828 Extbyte save;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1829 int n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1830 p += LENGTH ("coding:");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1831 while (*p == ' ' || *p == '\t') p++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1832
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1833 /* Get coding system name */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1834 save = *suffix; *suffix = '\0';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1835 /* Characters valid in a MIME charset name (rfc 1521),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1836 and in a Lisp symbol name. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1837 n = strspn ( (char *) p,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1838 "ABCDEFGHIJKLMNOPQRSTUVWXYZ"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1839 "abcdefghijklmnopqrstuvwxyz"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1840 "0123456789"
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1841 "!$%&*+-.^_{|}~");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1842 *suffix = save;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1843 if (n > 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1844 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1845 save = p[n]; p[n] = '\0';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1846 coding_system =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1847 Ffind_coding_system (intern ((char *) p));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1848 p[n] = save;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1849 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1850 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1851 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1852 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1853 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1854 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1855 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1856
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1857 if (NILP (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1858 do
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1859 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1860 if (detect_coding_type (&decst, buf, nread,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1861 XCODING_SYSTEM_TYPE (*codesys_in_out)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1862 != CODESYS_AUTODETECT))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1863 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1864 nread = Lstream_read (stream, buf, sizeof (buf));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1865 if (nread == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1866 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1867 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1868 while (1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1869
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1870 else if (XCODING_SYSTEM_TYPE (*codesys_in_out) == CODESYS_AUTODETECT
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1871 && XCODING_SYSTEM_EOL_TYPE (coding_system) == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1872 do
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1873 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1874 if (detect_coding_type (&decst, buf, nread, 1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1875 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1876 nread = Lstream_read (stream, buf, sizeof (buf));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1877 if (!nread)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1878 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1879 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1880 while (1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1881
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1882 *eol_type_in_out = decst.eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1883 if (XCODING_SYSTEM_TYPE (*codesys_in_out) == CODESYS_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1884 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1885 if (NILP (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1886 *codesys_in_out = coding_system_from_mask (decst.mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1887 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1888 *codesys_in_out = coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1889 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1890 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1891
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1892 /* If we absolutely can't determine the EOL type, just assume LF. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1893 if (*eol_type_in_out == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1894 *eol_type_in_out = EOL_LF;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1895
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1896 Lstream_rewind (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1897 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1898
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1899 DEFUN ("detect-coding-region", Fdetect_coding_region, 2, 3, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1900 Detect coding system of the text in the region between START and END.
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1901 Return a list of possible coding systems ordered by priority.
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1902 If only ASCII characters are found, return 'undecided or one of
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1903 its subsidiary coding systems according to a detected end-of-line
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1904 type. Optional arg BUFFER defaults to the current buffer.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1905 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1906 (start, end, buffer))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1907 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1908 Lisp_Object val = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1909 struct buffer *buf = decode_buffer (buffer, 0);
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
1910 Charbpos b, e;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1911 Lisp_Object instream, lb_instream;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1912 Lstream *istr, *lb_istr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1913 struct detection_state decst;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1914 struct gcpro gcpro1, gcpro2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1915
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1916 get_buffer_range_char (buf, start, end, &b, &e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1917 lb_instream = make_lisp_buffer_input_stream (buf, b, e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1918 lb_istr = XLSTREAM (lb_instream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1919 instream = make_encoding_input_stream (lb_istr, Fget_coding_system (Qbinary));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1920 istr = XLSTREAM (instream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1921 GCPRO2 (instream, lb_instream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1922 xzero (decst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1923 decst.eol_type = EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1924 decst.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1925 while (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1926 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
1927 Extbyte random_buffer[4096];
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
1928 Bytecount nread = Lstream_read (istr, random_buffer, sizeof (random_buffer));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1929
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1930 if (!nread)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1931 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1932 if (detect_coding_type (&decst, random_buffer, nread, 0))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1933 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1934 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1935
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1936 if (decst.mask == ~0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1937 val = subsidiary_coding_system (Fget_coding_system (Qundecided),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1938 decst.eol_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1939 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1940 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1941 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1942
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1943 val = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1944 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1945 decst.mask = postprocess_iso2022_mask (decst.mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1946 #endif
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
1947 for (i = CODING_CATEGORY_LAST - 1; i >= 0; i--)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1948 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1949 int sys = fcd->coding_category_by_priority[i];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1950 if (decst.mask & (1 << sys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1951 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1952 Lisp_Object codesys = fcd->coding_category_system[sys];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1953 if (!NILP (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1954 codesys = subsidiary_coding_system (codesys, decst.eol_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1955 val = Fcons (codesys, val);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1956 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1957 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1958 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1959 Lstream_close (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1960 UNGCPRO;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1961 Lstream_delete (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1962 Lstream_delete (lb_istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1963 return val;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1964 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1965
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1966
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1967 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1968 /* Converting to internal Mule format ("decoding") */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1969 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1970
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1971 /* A decoding stream is a stream used for decoding text (i.e.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1972 converting from some external format to internal format).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1973 The decoding-stream object keeps track of the actual coding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1974 stream, the stream that is at the other end, and data that
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1975 needs to be persistent across the lifetime of the stream. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1976
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1977 /* Handle the EOL stuff related to just-read-in character C.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1978 EOL_TYPE is the EOL type of the coding stream.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1979 FLAGS is the current value of FLAGS in the coding stream, and may
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1980 be modified by this macro. (The macro only looks at the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1981 CODING_STATE_CR flag.) DST is the Dynarr to which the decoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1982 bytes are to be written. You need to also define a local goto
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1983 label "label_continue_loop" that is at the end of the main
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1984 character-reading loop.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1985
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1986 If C is a CR character, then this macro handles it entirely and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1987 jumps to label_continue_loop. Otherwise, this macro does not add
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1988 anything to DST, and continues normally. You should continue
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1989 processing C normally after this macro. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1990
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1991 #define DECODE_HANDLE_EOL_TYPE(eol_type, c, flags, dst) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1992 do { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1993 if (c == '\r') \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1994 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1995 if (eol_type == EOL_CR) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1996 Dynarr_add (dst, '\n'); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1997 else if (eol_type != EOL_CRLF || flags & CODING_STATE_CR) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1998 Dynarr_add (dst, c); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1999 else \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2000 flags |= CODING_STATE_CR; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2001 goto label_continue_loop; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2002 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2003 else if (flags & CODING_STATE_CR) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2004 { /* eol_type == CODING_SYSTEM_EOL_CRLF */ \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2005 if (c != '\n') \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2006 Dynarr_add (dst, '\r'); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2007 flags &= ~CODING_STATE_CR; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2008 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2009 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2010
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2011 /* C should be a binary character in the range 0 - 255; convert
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2012 to internal format and add to Dynarr DST. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2013
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2014 #define DECODE_ADD_BINARY_CHAR(c, dst) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2015 do { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2016 if (BYTE_ASCII_P (c)) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2017 Dynarr_add (dst, c); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2018 else if (BYTE_C1_P (c)) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2019 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2020 Dynarr_add (dst, LEADING_BYTE_CONTROL_1); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2021 Dynarr_add (dst, c + 0x20); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2022 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2023 else \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2024 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2025 Dynarr_add (dst, LEADING_BYTE_LATIN_ISO8859_1); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2026 Dynarr_add (dst, c); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2027 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2028 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2029
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2030 #define DECODE_OUTPUT_PARTIAL_CHAR(ch) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2031 do { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2032 if (ch) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2033 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2034 DECODE_ADD_BINARY_CHAR (ch, dst); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2035 ch = 0; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2036 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2037 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2038
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2039 #define DECODE_HANDLE_END_OF_CONVERSION(flags, ch, dst) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2040 do { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2041 if (flags & CODING_STATE_END) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2042 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2043 DECODE_OUTPUT_PARTIAL_CHAR (ch); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2044 if (flags & CODING_STATE_CR) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2045 Dynarr_add (dst, '\r'); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2046 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2047 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2048
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2049 #define DECODING_STREAM_DATA(stream) LSTREAM_TYPE_DATA (stream, decoding)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2050
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2051 struct decoding_stream
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2052 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2053 /* Coding system that governs the conversion. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2054 Lisp_Coding_System *codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2055
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2056 /* Stream that we read the encoded data from or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2057 write the decoded data to. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2058 Lstream *other_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2059
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2060 /* If we are reading, then we can return only a fixed amount of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2061 data, so if the conversion resulted in too much data, we store it
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2062 here for retrieval the next time around. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2063 unsigned_char_dynarr *runoff;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2064
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2065 /* FLAGS holds flags indicating the current state of the decoding.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2066 Some of these flags are dependent on the coding system. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2067 unsigned int flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2068
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2069 /* CH holds a partially built-up character. Since we only deal
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2070 with one- and two-byte characters at the moment, we only use
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2071 this to store the first byte of a two-byte character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2072 unsigned int ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2073
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2074 /* EOL_TYPE specifies the type of end-of-line conversion that
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2075 currently applies. We need to keep this separate from the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2076 EOL type stored in CODESYS because the latter might indicate
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2077 automatic EOL-type detection while the former will always
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2078 indicate a particular EOL type. */
438
84b14dcb0985 Import from CVS: tag r21-2-27
cvs
parents: 432
diff changeset
2079 eol_type_t eol_type;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2080 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2081 /* Additional ISO2022 information. We define the structure above
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2082 because it's also needed by the detection routines. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2083 struct iso2022_decoder iso2022;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2084
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2085 /* Additional information (the state of the running CCL program)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2086 used by the CCL decoder. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2087 struct ccl_program ccl;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2088
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2089 /* counter for UTF-8 or UCS-4 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2090 unsigned char counter;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2091 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2092 struct detection_state decst;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2093 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2094
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2095 static Bytecount decoding_reader (Lstream *stream,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2096 unsigned char *data, Bytecount size);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2097 static Bytecount decoding_writer (Lstream *stream,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2098 const unsigned char *data, Bytecount size);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2099 static int decoding_rewinder (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2100 static int decoding_seekable_p (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2101 static int decoding_flusher (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2102 static int decoding_closer (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2103
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2104 static Lisp_Object decoding_marker (Lisp_Object stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2105
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2106 DEFINE_LSTREAM_IMPLEMENTATION ("decoding", lstream_decoding,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2107 sizeof (struct decoding_stream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2108
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2109 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2110 decoding_marker (Lisp_Object stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2111 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2112 Lstream *str = DECODING_STREAM_DATA (XLSTREAM (stream))->other_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2113 Lisp_Object str_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2114
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2115 /* We do not need to mark the coding systems or charsets stored
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2116 within the stream because they are stored in a global list
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2117 and automatically marked. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2118
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2119 XSETLSTREAM (str_obj, str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2120 mark_object (str_obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2121 if (str->imp->marker)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2122 return (str->imp->marker) (str_obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2123 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2124 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2125 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2126
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2127 /* Read SIZE bytes of data and store it into DATA. We are a decoding stream
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2128 so we read data from the other end, decode it, and store it into DATA. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2129
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2130 static Bytecount
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2131 decoding_reader (Lstream *stream, unsigned char *data, Bytecount size)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2132 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2133 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2134 unsigned char *orig_data = data;
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2135 Bytecount read_size;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2136 int error_occurred = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2137
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2138 /* We need to interface to mule_decode(), which expects to take some
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2139 amount of data and store the result into a Dynarr. We have
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2140 mule_decode() store into str->runoff, and take data from there
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2141 as necessary. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2142
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2143 /* We loop until we have enough data, reading chunks from the other
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2144 end and decoding it. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2145 while (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2146 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2147 /* Take data from the runoff if we can. Make sure to take at
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2148 most SIZE bytes, and delete the data from the runoff. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2149 if (Dynarr_length (str->runoff) > 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2150 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2151 Bytecount chunk = min (size, (Bytecount) Dynarr_length (str->runoff));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2152 memcpy (data, Dynarr_atp (str->runoff, 0), chunk);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2153 Dynarr_delete_many (str->runoff, 0, chunk);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2154 data += chunk;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2155 size -= chunk;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2156 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2157
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2158 if (size == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2159 break; /* No more room for data */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2160
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2161 if (str->flags & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2162 /* This means that on the previous iteration, we hit the EOF on
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2163 the other end. We loop once more so that mule_decode() can
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2164 output any final stuff it may be holding, or any "go back
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2165 to a sane state" escape sequences. (This latter makes sense
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2166 during encoding.) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2167 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2168
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2169 /* Exhausted the runoff, so get some more. DATA has at least
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2170 SIZE bytes left of storage in it, so it's OK to read directly
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2171 into it. (We'll be overwriting above, after we've decoded it
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2172 into the runoff.) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2173 read_size = Lstream_read (str->other_end, data, size);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2174 if (read_size < 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2175 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2176 error_occurred = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2177 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2178 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2179 if (read_size == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2180 /* There might be some more end data produced in the translation.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2181 See the comment above. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2182 str->flags |= CODING_STATE_END;
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2183 mule_decode (stream, (Extbyte *) data, str->runoff, read_size);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2184 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2185
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2186 if (data - orig_data == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2187 return error_occurred ? -1 : 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2188 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2189 return data - orig_data;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2190 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2191
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2192 static Bytecount
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2193 decoding_writer (Lstream *stream, const unsigned char *data, Bytecount size)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2194 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2195 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2196 Bytecount retval;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2197
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2198 /* Decode all our data into the runoff, and then attempt to write
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2199 it all out to the other end. Remove whatever chunk we succeeded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2200 in writing. */
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2201 mule_decode (stream, (Extbyte *) data, str->runoff, size);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2202 retval = Lstream_write (str->other_end, Dynarr_atp (str->runoff, 0),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2203 Dynarr_length (str->runoff));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2204 if (retval > 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2205 Dynarr_delete_many (str->runoff, 0, retval);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2206 /* Do NOT return retval. The return value indicates how much
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2207 of the incoming data was written, not how many bytes were
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2208 written. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2209 return size;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2210 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2211
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2212 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2213 reset_decoding_stream (struct decoding_stream *str)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2214 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2215 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2216 if (CODING_SYSTEM_TYPE (str->codesys) == CODESYS_ISO2022)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2217 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2218 Lisp_Object coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2219 XSETCODING_SYSTEM (coding_system, str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2220 reset_iso2022 (coding_system, &str->iso2022);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2221 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2222 else if (CODING_SYSTEM_TYPE (str->codesys) == CODESYS_CCL)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2223 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2224 setup_ccl_program (&str->ccl, CODING_SYSTEM_CCL_DECODE (str->codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2225 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2226 str->counter = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2227 #endif /* MULE */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2228 if (CODING_SYSTEM_TYPE (str->codesys) == CODESYS_AUTODETECT
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2229 || CODING_SYSTEM_EOL_TYPE (str->codesys) == EOL_AUTODETECT)
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2230 {
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2231 xzero (str->decst);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2232 str->decst.eol_type = EOL_AUTODETECT;
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2233 str->decst.mask = ~0;
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2234 }
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2235 str->flags = str->ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2236 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2237
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2238 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2239 decoding_rewinder (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2240 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2241 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2242 reset_decoding_stream (str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2243 Dynarr_reset (str->runoff);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2244 return Lstream_rewind (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2245 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2246
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2247 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2248 decoding_seekable_p (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2249 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2250 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2251 return Lstream_seekable_p (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2252 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2253
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2254 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2255 decoding_flusher (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2256 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2257 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2258 return Lstream_flush (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2259 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2260
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2261 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2262 decoding_closer (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2263 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2264 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2265 if (stream->flags & LSTREAM_FL_WRITE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2266 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2267 str->flags |= CODING_STATE_END;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2268 decoding_writer (stream, 0, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2269 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2270 Dynarr_free (str->runoff);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2271 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2272 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2273 if (str->iso2022.composite_chars)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2274 Dynarr_free (str->iso2022.composite_chars);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2275 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2276 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2277 return Lstream_close (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2278 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2279
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2280 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2281 decoding_stream_coding_system (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2282 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2283 Lisp_Object coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2284 struct decoding_stream *str = DECODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2285
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2286 XSETCODING_SYSTEM (coding_system, str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2287 return subsidiary_coding_system (coding_system, str->eol_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2288 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2289
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2290 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2291 set_decoding_stream_coding_system (Lstream *lstr, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2292 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2293 Lisp_Coding_System *cs = XCODING_SYSTEM (codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2294 struct decoding_stream *str = DECODING_STREAM_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2295 str->codesys = cs;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2296 if (CODING_SYSTEM_EOL_TYPE (cs) != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2297 str->eol_type = CODING_SYSTEM_EOL_TYPE (cs);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2298 reset_decoding_stream (str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2299 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2300
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2301 /* WARNING WARNING WARNING WARNING!!!!! If you open up a decoding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2302 stream for writing, no automatic code detection will be performed.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2303 The reason for this is that automatic code detection requires a
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2304 seekable input. Things will also fail if you open a decoding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2305 stream for reading using a non-fully-specified coding system and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2306 a non-seekable input stream. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2307
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2308 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2309 make_decoding_stream_1 (Lstream *stream, Lisp_Object codesys,
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2310 const char *mode)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2311 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2312 Lstream *lstr = Lstream_new (lstream_decoding, mode);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2313 struct decoding_stream *str = DECODING_STREAM_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2314 Lisp_Object obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2315
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2316 xzero (*str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2317 str->other_end = stream;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2318 str->runoff = (unsigned_char_dynarr *) Dynarr_new (unsigned_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2319 str->eol_type = EOL_AUTODETECT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2320 if (!strcmp (mode, "r")
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2321 && Lstream_seekable_p (stream))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2322 /* We can determine the coding system now. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2323 determine_real_coding_system (stream, &codesys, &str->eol_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2324 set_decoding_stream_coding_system (lstr, codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2325 str->decst.eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2326 str->decst.mask = ~0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2327 XSETLSTREAM (obj, lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2328 return obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2329 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2330
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2331 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2332 make_decoding_input_stream (Lstream *stream, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2333 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2334 return make_decoding_stream_1 (stream, codesys, "r");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2335 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2336
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2337 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2338 make_decoding_output_stream (Lstream *stream, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2339 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2340 return make_decoding_stream_1 (stream, codesys, "w");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2341 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2342
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2343 /* Note: the decode_coding_* functions all take the same
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2344 arguments as mule_decode(), which is to say some SRC data of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2345 size N, which is to be stored into dynamic array DST.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2346 DECODING is the stream within which the decoding is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2347 taking place, but no data is actually read from or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2348 written to that stream; that is handled in decoding_reader()
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2349 or decoding_writer(). This allows the same functions to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2350 be used for both reading and writing. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2351
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2352 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2353 mule_decode (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2354 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2355 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2356 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2357
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2358 /* If necessary, do encoding-detection now. We do this when
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2359 we're a writing stream or a non-seekable reading stream,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2360 meaning that we can't just process the whole input,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2361 rewind, and start over. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2362
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2363 if (CODING_SYSTEM_TYPE (str->codesys) == CODESYS_AUTODETECT ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2364 str->eol_type == EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2365 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2366 Lisp_Object codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2367
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2368 XSETCODING_SYSTEM (codesys, str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2369 detect_coding_type (&str->decst, src, n,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2370 CODING_SYSTEM_TYPE (str->codesys) !=
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2371 CODESYS_AUTODETECT);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2372 if (CODING_SYSTEM_TYPE (str->codesys) == CODESYS_AUTODETECT &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2373 str->decst.mask != ~0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2374 /* #### This is cheesy. What we really ought to do is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2375 buffer up a certain amount of data so as to get a
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2376 less random result. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2377 codesys = coding_system_from_mask (str->decst.mask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2378 str->eol_type = str->decst.eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2379 if (XCODING_SYSTEM (codesys) != str->codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2380 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2381 /* Preserve the CODING_STATE_END flag in case it was set.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2382 If we erase it, bad things might happen. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2383 int was_end = str->flags & CODING_STATE_END;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2384 set_decoding_stream_coding_system (decoding, codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2385 if (was_end)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2386 str->flags |= CODING_STATE_END;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2387 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2388 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2389
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2390 switch (CODING_SYSTEM_TYPE (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2391 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2392 #ifdef DEBUG_XEMACS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2393 case CODESYS_INTERNAL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2394 Dynarr_add_many (dst, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2395 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2396 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2397 case CODESYS_AUTODETECT:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2398 /* If we got this far and still haven't decided on the coding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2399 system, then do no conversion. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2400 case CODESYS_NO_CONVERSION:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2401 decode_coding_no_conversion (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2402 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2403 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2404 case CODESYS_SHIFT_JIS:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2405 decode_coding_sjis (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2406 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2407 case CODESYS_BIG5:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2408 decode_coding_big5 (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2409 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2410 case CODESYS_UCS4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2411 decode_coding_ucs4 (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2412 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2413 case CODESYS_UTF8:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2414 decode_coding_utf8 (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2415 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2416 case CODESYS_CCL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2417 str->ccl.last_block = str->flags & CODING_STATE_END;
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2418 /* When applying ccl program to stream, MUST NOT set NULL
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2419 pointer to src. */
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2420 ccl_driver (&str->ccl, (src ? (unsigned char *)src : (unsigned char*)""),
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2421 dst, n, 0, CCL_MODE_DECODING);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2422 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2423 case CODESYS_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2424 decode_coding_iso2022 (decoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2425 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2426 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2427 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2428 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2429 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2430 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2431
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2432 DEFUN ("decode-coding-region", Fdecode_coding_region, 3, 4, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2433 Decode the text between START and END which is encoded in CODING-SYSTEM.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2434 This is useful if you've read in encoded text from a file without decoding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2435 it (e.g. you read in a JIS-formatted file but used the `binary' or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2436 `no-conversion' coding system, so that it shows up as "^[$B!<!+^[(B").
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2437 Return length of decoded text.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2438 BUFFER defaults to the current buffer if unspecified.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2439 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2440 (start, end, coding_system, buffer))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2441 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2442 Charbpos b, e;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2443 struct buffer *buf = decode_buffer (buffer, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2444 Lisp_Object instream, lb_outstream, de_outstream, outstream;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2445 Lstream *istr, *ostr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2446 struct gcpro gcpro1, gcpro2, gcpro3, gcpro4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2447
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2448 get_buffer_range_char (buf, start, end, &b, &e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2449
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2450 barf_if_buffer_read_only (buf, b, e);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2451
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2452 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2453 instream = make_lisp_buffer_input_stream (buf, b, e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2454 lb_outstream = make_lisp_buffer_output_stream (buf, b, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2455 de_outstream = make_decoding_output_stream (XLSTREAM (lb_outstream),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2456 coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2457 outstream = make_encoding_output_stream (XLSTREAM (de_outstream),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2458 Fget_coding_system (Qbinary));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2459 istr = XLSTREAM (instream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2460 ostr = XLSTREAM (outstream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2461 GCPRO4 (instream, lb_outstream, de_outstream, outstream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2462
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2463 /* The chain of streams looks like this:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2464
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2465 [BUFFER] <----- send through
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2466 ------> [ENCODE AS BINARY]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2467 ------> [DECODE AS SPECIFIED]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2468 ------> [BUFFER]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2469 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2470
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2471 while (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2472 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2473 char tempbuf[1024]; /* some random amount */
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2474 Charbpos newpos, even_newer_pos;
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2475 Charbpos oldpos = lisp_buffer_stream_startpos (istr);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2476 Bytecount size_in_bytes = Lstream_read (istr, tempbuf, sizeof (tempbuf));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2477
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2478 if (!size_in_bytes)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2479 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2480 newpos = lisp_buffer_stream_startpos (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2481 Lstream_write (ostr, tempbuf, size_in_bytes);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2482 even_newer_pos = lisp_buffer_stream_startpos (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2483 buffer_delete_range (buf, even_newer_pos - (newpos - oldpos),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2484 even_newer_pos, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2485 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2486 Lstream_close (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2487 Lstream_close (ostr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2488 UNGCPRO;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2489 Lstream_delete (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2490 Lstream_delete (ostr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2491 Lstream_delete (XLSTREAM (de_outstream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2492 Lstream_delete (XLSTREAM (lb_outstream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2493 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2494 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2495
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2496
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2497 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2498 /* Converting to an external encoding ("encoding") */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2499 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2500
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2501 /* An encoding stream is an output stream. When you create the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2502 stream, you specify the coding system that governs the encoding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2503 and another stream that the resulting encoded data is to be
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2504 sent to, and then start sending data to it. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2505
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2506 #define ENCODING_STREAM_DATA(stream) LSTREAM_TYPE_DATA (stream, encoding)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2507
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2508 struct encoding_stream
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2509 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2510 /* Coding system that governs the conversion. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2511 Lisp_Coding_System *codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2512
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2513 /* Stream that we read the encoded data from or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2514 write the decoded data to. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2515 Lstream *other_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2516
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2517 /* If we are reading, then we can return only a fixed amount of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2518 data, so if the conversion resulted in too much data, we store it
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2519 here for retrieval the next time around. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2520 unsigned_char_dynarr *runoff;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2521
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2522 /* FLAGS holds flags indicating the current state of the encoding.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2523 Some of these flags are dependent on the coding system. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2524 unsigned int flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2525
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2526 /* CH holds a partially built-up character. Since we only deal
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2527 with one- and two-byte characters at the moment, we only use
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2528 this to store the first byte of a two-byte character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2529 unsigned int ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2530 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2531 /* Additional information used by the ISO2022 encoder. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2532 struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2533 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2534 /* CHARSET holds the character sets currently assigned to the G0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2535 through G3 registers. It is initialized from the array
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2536 INITIAL_CHARSET in CODESYS. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2537 Lisp_Object charset[4];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2538
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2539 /* Which registers are currently invoked into the left (GL) and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2540 right (GR) halves of the 8-bit encoding space? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2541 int register_left, register_right;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2542
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2543 /* Whether we need to explicitly designate the charset in the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2544 G? register before using it. It is initialized from the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2545 array FORCE_CHARSET_ON_OUTPUT in CODESYS. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2546 unsigned char force_charset_on_output[4];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2547
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2548 /* Other state variables that need to be preserved across
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2549 invocations. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2550 Lisp_Object current_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2551 int current_half;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2552 int current_char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2553 } iso2022;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2554
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2555 /* Additional information (the state of the running CCL program)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2556 used by the CCL encoder. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2557 struct ccl_program ccl;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2558 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2559 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2560
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2561 static Bytecount encoding_reader (Lstream *stream, unsigned char *data, Bytecount size);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2562 static Bytecount encoding_writer (Lstream *stream, const unsigned char *data,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2563 Bytecount size);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2564 static int encoding_rewinder (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2565 static int encoding_seekable_p (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2566 static int encoding_flusher (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2567 static int encoding_closer (Lstream *stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2568
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2569 static Lisp_Object encoding_marker (Lisp_Object stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2570
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2571 DEFINE_LSTREAM_IMPLEMENTATION ("encoding", lstream_encoding,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2572 sizeof (struct encoding_stream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2573
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2574 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2575 encoding_marker (Lisp_Object stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2576 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2577 Lstream *str = ENCODING_STREAM_DATA (XLSTREAM (stream))->other_end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2578 Lisp_Object str_obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2579
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2580 /* We do not need to mark the coding systems or charsets stored
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2581 within the stream because they are stored in a global list
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2582 and automatically marked. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2583
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2584 XSETLSTREAM (str_obj, str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2585 mark_object (str_obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2586 if (str->imp->marker)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2587 return (str->imp->marker) (str_obj);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2588 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2589 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2590 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2591
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2592 /* Read SIZE bytes of data and store it into DATA. We are a encoding stream
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2593 so we read data from the other end, encode it, and store it into DATA. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2594
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2595 static Bytecount
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2596 encoding_reader (Lstream *stream, unsigned char *data, Bytecount size)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2597 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2598 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2599 unsigned char *orig_data = data;
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2600 Bytecount read_size;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2601 int error_occurred = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2602
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2603 /* We need to interface to mule_encode(), which expects to take some
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2604 amount of data and store the result into a Dynarr. We have
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2605 mule_encode() store into str->runoff, and take data from there
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2606 as necessary. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2607
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2608 /* We loop until we have enough data, reading chunks from the other
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2609 end and encoding it. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2610 while (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2611 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2612 /* Take data from the runoff if we can. Make sure to take at
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2613 most SIZE bytes, and delete the data from the runoff. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2614 if (Dynarr_length (str->runoff) > 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2615 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2616 int chunk = min ((int) size, Dynarr_length (str->runoff));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2617 memcpy (data, Dynarr_atp (str->runoff, 0), chunk);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2618 Dynarr_delete_many (str->runoff, 0, chunk);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2619 data += chunk;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2620 size -= chunk;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2621 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2622
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2623 if (size == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2624 break; /* No more room for data */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2625
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2626 if (str->flags & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2627 /* This means that on the previous iteration, we hit the EOF on
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2628 the other end. We loop once more so that mule_encode() can
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2629 output any final stuff it may be holding, or any "go back
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2630 to a sane state" escape sequences. (This latter makes sense
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2631 during encoding.) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2632 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2633
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2634 /* Exhausted the runoff, so get some more. DATA at least SIZE bytes
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2635 left of storage in it, so it's OK to read directly into it.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2636 (We'll be overwriting above, after we've encoded it into the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2637 runoff.) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2638 read_size = Lstream_read (str->other_end, data, size);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2639 if (read_size < 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2640 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2641 error_occurred = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2642 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2643 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2644 if (read_size == 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2645 /* There might be some more end data produced in the translation.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2646 See the comment above. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2647 str->flags |= CODING_STATE_END;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2648 mule_encode (stream, data, str->runoff, read_size);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2649 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2650
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2651 if (data == orig_data)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2652 return error_occurred ? -1 : 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2653 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2654 return data - orig_data;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2655 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2656
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2657 static Bytecount
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2658 encoding_writer (Lstream *stream, const unsigned char *data, Bytecount size)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2659 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2660 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2661 Bytecount retval;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2662
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2663 /* Encode all our data into the runoff, and then attempt to write
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2664 it all out to the other end. Remove whatever chunk we succeeded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2665 in writing. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2666 mule_encode (stream, data, str->runoff, size);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2667 retval = Lstream_write (str->other_end, Dynarr_atp (str->runoff, 0),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2668 Dynarr_length (str->runoff));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2669 if (retval > 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2670 Dynarr_delete_many (str->runoff, 0, retval);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2671 /* Do NOT return retval. The return value indicates how much
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2672 of the incoming data was written, not how many bytes were
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2673 written. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2674 return size;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2675 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2676
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2677 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2678 reset_encoding_stream (struct encoding_stream *str)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2679 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2680 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2681 switch (CODING_SYSTEM_TYPE (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2682 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2683 case CODESYS_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2684 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2685 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2686
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2687 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2688 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2689 str->iso2022.charset[i] =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2690 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (str->codesys, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2691 str->iso2022.force_charset_on_output[i] =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2692 CODING_SYSTEM_ISO2022_FORCE_CHARSET_ON_OUTPUT (str->codesys, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2693 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2694 str->iso2022.register_left = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2695 str->iso2022.register_right = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2696 str->iso2022.current_charset = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2697 str->iso2022.current_half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2698 str->iso2022.current_char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2699 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2700 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2701 case CODESYS_CCL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2702 setup_ccl_program (&str->ccl, CODING_SYSTEM_CCL_ENCODE (str->codesys));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2703 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2704 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2705 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2706 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2707 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2708
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2709 str->flags = str->ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2710 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2711
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2712 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2713 encoding_rewinder (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2714 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2715 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2716 reset_encoding_stream (str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2717 Dynarr_reset (str->runoff);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2718 return Lstream_rewind (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2719 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2720
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2721 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2722 encoding_seekable_p (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2723 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2724 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2725 return Lstream_seekable_p (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2726 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2727
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2728 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2729 encoding_flusher (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2730 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2731 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2732 return Lstream_flush (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2733 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2734
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2735 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2736 encoding_closer (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2737 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2738 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2739 if (stream->flags & LSTREAM_FL_WRITE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2740 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2741 str->flags |= CODING_STATE_END;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2742 encoding_writer (stream, 0, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2743 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2744 Dynarr_free (str->runoff);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2745 return Lstream_close (str->other_end);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2746 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2747
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2748 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2749 encoding_stream_coding_system (Lstream *stream)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2750 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2751 Lisp_Object coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2752 struct encoding_stream *str = ENCODING_STREAM_DATA (stream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2753
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2754 XSETCODING_SYSTEM (coding_system, str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2755 return coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2756 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2757
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2758 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2759 set_encoding_stream_coding_system (Lstream *lstr, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2760 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2761 Lisp_Coding_System *cs = XCODING_SYSTEM (codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2762 struct encoding_stream *str = ENCODING_STREAM_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2763 str->codesys = cs;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2764 reset_encoding_stream (str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2765 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2766
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2767 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2768 make_encoding_stream_1 (Lstream *stream, Lisp_Object codesys,
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
2769 const char *mode)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2770 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2771 Lstream *lstr = Lstream_new (lstream_encoding, mode);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2772 struct encoding_stream *str = ENCODING_STREAM_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2773 Lisp_Object obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2774
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2775 xzero (*str);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2776 str->runoff = Dynarr_new (unsigned_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2777 str->other_end = stream;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2778 set_encoding_stream_coding_system (lstr, codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2779 XSETLSTREAM (obj, lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2780 return obj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2781 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2782
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2783 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2784 make_encoding_input_stream (Lstream *stream, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2785 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2786 return make_encoding_stream_1 (stream, codesys, "r");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2787 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2788
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2789 Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2790 make_encoding_output_stream (Lstream *stream, Lisp_Object codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2791 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2792 return make_encoding_stream_1 (stream, codesys, "w");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2793 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2794
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2795 /* Convert N bytes of internally-formatted data stored in SRC to an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2796 external format, according to the encoding stream ENCODING.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2797 Store the encoded data into DST. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2798
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2799 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2800 mule_encode (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2801 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2802 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2803 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2804
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2805 switch (CODING_SYSTEM_TYPE (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2806 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2807 #ifdef DEBUG_XEMACS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2808 case CODESYS_INTERNAL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2809 Dynarr_add_many (dst, src, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2810 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2811 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2812 case CODESYS_AUTODETECT:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2813 /* If we got this far and still haven't decided on the coding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2814 system, then do no conversion. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2815 case CODESYS_NO_CONVERSION:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2816 encode_coding_no_conversion (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2817 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2818 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2819 case CODESYS_SHIFT_JIS:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2820 encode_coding_sjis (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2821 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2822 case CODESYS_BIG5:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2823 encode_coding_big5 (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2824 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2825 case CODESYS_UCS4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2826 encode_coding_ucs4 (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2827 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2828 case CODESYS_UTF8:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2829 encode_coding_utf8 (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2830 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2831 case CODESYS_CCL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2832 str->ccl.last_block = str->flags & CODING_STATE_END;
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2833 /* When applying ccl program to stream, MUST NOT set NULL
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2834 pointer to src. */
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2835 ccl_driver (&str->ccl, ((src) ? src : (unsigned char*)""),
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2836 dst, n, 0, CCL_MODE_ENCODING);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2837 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2838 case CODESYS_ISO2022:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2839 encode_coding_iso2022 (encoding, src, dst, n);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2840 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2841 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2842 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2843 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2844 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2845 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2846
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2847 DEFUN ("encode-coding-region", Fencode_coding_region, 3, 4, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2848 Encode the text between START and END using CODING-SYSTEM.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2849 This will, for example, convert Japanese characters into stuff such as
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2850 "^[$B!<!+^[(B" if you use the JIS encoding. Return length of encoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2851 text. BUFFER defaults to the current buffer if unspecified.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2852 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2853 (start, end, coding_system, buffer))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2854 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2855 Charbpos b, e;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2856 struct buffer *buf = decode_buffer (buffer, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2857 Lisp_Object instream, lb_outstream, de_outstream, outstream;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2858 Lstream *istr, *ostr;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2859 struct gcpro gcpro1, gcpro2, gcpro3, gcpro4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2860
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2861 get_buffer_range_char (buf, start, end, &b, &e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2862
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2863 barf_if_buffer_read_only (buf, b, e);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2864
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2865 coding_system = Fget_coding_system (coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2866 instream = make_lisp_buffer_input_stream (buf, b, e, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2867 lb_outstream = make_lisp_buffer_output_stream (buf, b, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2868 de_outstream = make_decoding_output_stream (XLSTREAM (lb_outstream),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2869 Fget_coding_system (Qbinary));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2870 outstream = make_encoding_output_stream (XLSTREAM (de_outstream),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2871 coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2872 istr = XLSTREAM (instream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2873 ostr = XLSTREAM (outstream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2874 GCPRO4 (instream, outstream, de_outstream, lb_outstream);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2875 /* The chain of streams looks like this:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2876
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2877 [BUFFER] <----- send through
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2878 ------> [ENCODE AS SPECIFIED]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2879 ------> [DECODE AS BINARY]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2880 ------> [BUFFER]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2881 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2882 while (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2883 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2884 char tempbuf[1024]; /* some random amount */
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2885 Charbpos newpos, even_newer_pos;
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2886 Charbpos oldpos = lisp_buffer_stream_startpos (istr);
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2887 Bytecount size_in_bytes = Lstream_read (istr, tempbuf, sizeof (tempbuf));
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2888
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2889 if (!size_in_bytes)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2890 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2891 newpos = lisp_buffer_stream_startpos (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2892 Lstream_write (ostr, tempbuf, size_in_bytes);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2893 even_newer_pos = lisp_buffer_stream_startpos (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2894 buffer_delete_range (buf, even_newer_pos - (newpos - oldpos),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2895 even_newer_pos, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2896 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2897
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2898 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2899 Charcount retlen =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2900 lisp_buffer_stream_startpos (XLSTREAM (instream)) - b;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2901 Lstream_close (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2902 Lstream_close (ostr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2903 UNGCPRO;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2904 Lstream_delete (istr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2905 Lstream_delete (ostr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2906 Lstream_delete (XLSTREAM (de_outstream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2907 Lstream_delete (XLSTREAM (lb_outstream));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2908 return make_int (retlen);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2909 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2910 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2911
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2912 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2913
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2914 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2915 /* Shift-JIS methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2916 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2917
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2918 /* Shift-JIS is a coding system encoding three character sets: ASCII, right
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2919 half of JISX0201-Kana, and JISX0208. An ASCII character is encoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2920 as is. A character of JISX0201-Kana (DIMENSION1_CHARS94 character set) is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2921 encoded by "position-code + 0x80". A character of JISX0208
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2922 (DIMENSION2_CHARS94 character set) is encoded in 2-byte but two
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2923 position-codes are divided and shifted so that it fit in the range
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2924 below.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2925
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2926 --- CODE RANGE of Shift-JIS ---
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2927 (character set) (range)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2928 ASCII 0x00 .. 0x7F
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2929 JISX0201-Kana 0xA0 .. 0xDF
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2930 JISX0208 (1st byte) 0x80 .. 0x9F and 0xE0 .. 0xEF
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2931 (2nd byte) 0x40 .. 0x7E and 0x80 .. 0xFC
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2932 -------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2933
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2934 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2935
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2936 /* Is this the first byte of a Shift-JIS two-byte char? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2937
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2938 #define BYTE_SJIS_TWO_BYTE_1_P(c) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2939 (((c) >= 0x81 && (c) <= 0x9F) || ((c) >= 0xE0 && (c) <= 0xEF))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2940
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2941 /* Is this the second byte of a Shift-JIS two-byte char? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2942
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2943 #define BYTE_SJIS_TWO_BYTE_2_P(c) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2944 (((c) >= 0x40 && (c) <= 0x7E) || ((c) >= 0x80 && (c) <= 0xFC))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2945
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2946 #define BYTE_SJIS_KATAKANA_P(c) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2947 ((c) >= 0xA1 && (c) <= 0xDF)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2948
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2949 static int
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2950 detect_coding_sjis (struct detection_state *st, const Extbyte *src, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2951 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2952 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2953 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2954 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2955 if (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2956 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2957 if (st->shift_jis.in_second_byte)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2958 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2959 st->shift_jis.in_second_byte = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2960 if (c < 0x40)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2961 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2962 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2963 else if ((c >= 0x80 && c < 0xA0) || c >= 0xE0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2964 st->shift_jis.in_second_byte = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2965 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2966 return CODING_CATEGORY_SHIFT_JIS_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2967 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2968
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2969 /* Convert Shift-JIS data to internal format. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2970
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2971 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2972 decode_coding_sjis (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
2973 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2974 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2975 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2976 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2977 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2978 eol_type_t eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2979
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2980 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2981 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
2982 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2983
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2984 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2985 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2986 /* Previous character was first byte of Shift-JIS Kanji char. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2987 if (BYTE_SJIS_TWO_BYTE_2_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2988 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2989 unsigned char e1, e2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2990
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2991 Dynarr_add (dst, LEADING_BYTE_JAPANESE_JISX0208);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2992 DECODE_SJIS (ch, c, e1, e2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2993 Dynarr_add (dst, e1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2994 Dynarr_add (dst, e2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2995 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2996 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2997 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2998 DECODE_ADD_BINARY_CHAR (ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2999 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3000 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3001 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3002 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3003 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3004 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3005 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3006 if (BYTE_SJIS_TWO_BYTE_1_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3007 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3008 else if (BYTE_SJIS_KATAKANA_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3009 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3010 Dynarr_add (dst, LEADING_BYTE_KATAKANA_JISX0201);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3011 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3012 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3013 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3014 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3015 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3016 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3017 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3018
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3019 DECODE_HANDLE_END_OF_CONVERSION (flags, ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3020
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3021 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3022 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3023 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3024
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3025 /* Convert internally-formatted data to Shift-JIS. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3026
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3027 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3028 encode_coding_sjis (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3029 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3030 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3031 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3032 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3033 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3034 eol_type_t eol_type = CODING_SYSTEM_EOL_TYPE (str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3035
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3036 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3037 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3038 Intbyte c = *src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3039 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3040 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3041 if (eol_type != EOL_LF && eol_type != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3042 Dynarr_add (dst, '\r');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3043 if (eol_type != EOL_CR)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3044 Dynarr_add (dst, '\n');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3045 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3046 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3047 else if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3048 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3049 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3050 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3051 }
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3052 else if (INTBYTE_LEADING_BYTE_P (c))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3053 ch = (c == LEADING_BYTE_KATAKANA_JISX0201 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3054 c == LEADING_BYTE_JAPANESE_JISX0208_1978 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3055 c == LEADING_BYTE_JAPANESE_JISX0208) ? c : 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3056 else if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3057 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3058 if (ch == LEADING_BYTE_KATAKANA_JISX0201)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3059 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3060 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3061 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3062 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3063 else if (ch == LEADING_BYTE_JAPANESE_JISX0208_1978 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3064 ch == LEADING_BYTE_JAPANESE_JISX0208)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3065 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3066 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3067 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3068 unsigned char j1, j2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3069 ENCODE_SJIS (ch, c, j1, j2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3070 Dynarr_add (dst, j1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3071 Dynarr_add (dst, j2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3072 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3073 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3074 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3075 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3076
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3077 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3078 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3079 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3080
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3081 DEFUN ("decode-shift-jis-char", Fdecode_shift_jis_char, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3082 Decode a JISX0208 character of Shift-JIS coding-system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3083 CODE is the character code in Shift-JIS as a cons of type bytes.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3084 Return the corresponding character.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3085 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3086 (code))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3087 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3088 unsigned char c1, c2, s1, s2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3089
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3090 CHECK_CONS (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3091 CHECK_INT (XCAR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3092 CHECK_INT (XCDR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3093 s1 = XINT (XCAR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3094 s2 = XINT (XCDR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3095 if (BYTE_SJIS_TWO_BYTE_1_P (s1) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3096 BYTE_SJIS_TWO_BYTE_2_P (s2))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3097 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3098 DECODE_SJIS (s1, s2, c1, c2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3099 return make_char (MAKE_CHAR (Vcharset_japanese_jisx0208,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3100 c1 & 0x7F, c2 & 0x7F));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3101 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3102 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3103 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3104 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3105
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3106 DEFUN ("encode-shift-jis-char", Fencode_shift_jis_char, 1, 1, 0, /*
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3107 Encode a JISX0208 character CHARACTER to SHIFT-JIS coding-system.
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3108 Return the corresponding character code in SHIFT-JIS as a cons of two bytes.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3109 */
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3110 (character))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3111 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3112 Lisp_Object charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3113 int c1, c2, s1, s2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3114
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3115 CHECK_CHAR_COERCE_INT (character);
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3116 BREAKUP_CHAR (XCHAR (character), charset, c1, c2);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3117 if (EQ (charset, Vcharset_japanese_jisx0208))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3118 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3119 ENCODE_SJIS (c1 | 0x80, c2 | 0x80, s1, s2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3120 return Fcons (make_int (s1), make_int (s2));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3121 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3122 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3123 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3124 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3125
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3126
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3127 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3128 /* Big5 methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3129 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3130
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3131 /* BIG5 is a coding system encoding two character sets: ASCII and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3132 Big5. An ASCII character is encoded as is. Big5 is a two-byte
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3133 character set and is encoded in two-byte.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3134
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3135 --- CODE RANGE of BIG5 ---
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3136 (character set) (range)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3137 ASCII 0x00 .. 0x7F
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3138 Big5 (1st byte) 0xA1 .. 0xFE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3139 (2nd byte) 0x40 .. 0x7E and 0xA1 .. 0xFE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3140 --------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3141
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3142 Since the number of characters in Big5 is larger than maximum
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3143 characters in Emacs' charset (96x96), it can't be handled as one
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3144 charset. So, in Emacs, Big5 is divided into two: `charset-big5-1'
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3145 and `charset-big5-2'. Both <type>s are DIMENSION2_CHARS94. The former
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3146 contains frequently used characters and the latter contains less
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3147 frequently used characters. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3148
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3149 #define BYTE_BIG5_TWO_BYTE_1_P(c) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3150 ((c) >= 0xA1 && (c) <= 0xFE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3151
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3152 /* Is this the second byte of a Shift-JIS two-byte char? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3153
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3154 #define BYTE_BIG5_TWO_BYTE_2_P(c) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3155 (((c) >= 0x40 && (c) <= 0x7E) || ((c) >= 0xA1 && (c) <= 0xFE))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3156
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3157 /* Number of Big5 characters which have the same code in 1st byte. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3158
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3159 #define BIG5_SAME_ROW (0xFF - 0xA1 + 0x7F - 0x40)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3160
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3161 /* Code conversion macros. These are macros because they are used in
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3162 inner loops during code conversion.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3163
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3164 Note that temporary variables in macros introduce the classic
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3165 dynamic-scoping problems with variable names. We use capital-
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3166 lettered variables in the assumption that XEmacs does not use
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3167 capital letters in variables except in a very formalized way
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3168 (e.g. Qstring). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3169
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3170 /* Convert Big5 code (b1, b2) into its internal string representation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3171 (lb, c1, c2). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3172
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3173 /* There is a much simpler way to split the Big5 charset into two.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3174 For the moment I'm going to leave the algorithm as-is because it
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3175 claims to separate out the most-used characters into a single
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3176 charset, which perhaps will lead to optimizations in various
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3177 places.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3178
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3179 The way the algorithm works is something like this:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3180
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3181 Big5 can be viewed as a 94x157 charset, where the row is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3182 encoded into the bytes 0xA1 .. 0xFE and the column is encoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3183 into the bytes 0x40 .. 0x7E and 0xA1 .. 0xFE. As for frequency,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3184 the split between low and high column numbers is apparently
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3185 meaningless; ascending rows produce less and less frequent chars.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3186 Therefore, we assign the lower half of rows (0xA1 .. 0xC8) to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3187 the first charset, and the upper half (0xC9 .. 0xFE) to the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3188 second. To do the conversion, we convert the character into
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3189 a single number where 0 .. 156 is the first row, 157 .. 313
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3190 is the second, etc. That way, the characters are ordered by
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3191 decreasing frequency. Then we just chop the space in two
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3192 and coerce the result into a 94x94 space.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3193 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3194
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3195 #define DECODE_BIG5(b1, b2, lb, c1, c2) do \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3196 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3197 int B1 = b1, B2 = b2; \
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
3198 int I \
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3199 = (B1 - 0xA1) * BIG5_SAME_ROW + B2 - (B2 < 0x7F ? 0x40 : 0x62); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3200 \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3201 if (B1 < 0xC9) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3202 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3203 lb = LEADING_BYTE_CHINESE_BIG5_1; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3204 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3205 else \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3206 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3207 lb = LEADING_BYTE_CHINESE_BIG5_2; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3208 I -= (BIG5_SAME_ROW) * (0xC9 - 0xA1); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3209 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3210 c1 = I / (0xFF - 0xA1) + 0xA1; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3211 c2 = I % (0xFF - 0xA1) + 0xA1; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3212 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3213
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3214 /* Convert the internal string representation of a Big5 character
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3215 (lb, c1, c2) into Big5 code (b1, b2). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3216
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3217 #define ENCODE_BIG5(lb, c1, c2, b1, b2) do \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3218 { \
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
3219 int I = ((c1) - 0xA1) * (0xFF - 0xA1) + ((c2) - 0xA1); \
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3220 \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3221 if (lb == LEADING_BYTE_CHINESE_BIG5_2) \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3222 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3223 I += BIG5_SAME_ROW * (0xC9 - 0xA1); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3224 } \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3225 b1 = I / BIG5_SAME_ROW + 0xA1; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3226 b2 = I % BIG5_SAME_ROW; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3227 b2 += b2 < 0x3F ? 0x40 : 0x62; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3228 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3229
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3230 static int
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3231 detect_coding_big5 (struct detection_state *st, const Extbyte *src, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3232 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3233 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3234 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3235 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3236 if (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3237 (c >= 0x80 && c <= 0xA0))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3238 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3239 if (st->big5.in_second_byte)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3240 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3241 st->big5.in_second_byte = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3242 if (c < 0x40 || (c >= 0x80 && c <= 0xA0))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3243 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3244 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3245 else if (c >= 0xA1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3246 st->big5.in_second_byte = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3247 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3248 return CODING_CATEGORY_BIG5_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3249 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3250
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3251 /* Convert Big5 data to internal format. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3252
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3253 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3254 decode_coding_big5 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3255 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3256 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3257 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3258 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3259 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3260 eol_type_t eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3261
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3262 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3263 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3264 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3265 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3266 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3267 /* Previous character was first byte of Big5 char. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3268 if (BYTE_BIG5_TWO_BYTE_2_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3269 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3270 unsigned char b1, b2, b3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3271 DECODE_BIG5 (ch, c, b1, b2, b3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3272 Dynarr_add (dst, b1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3273 Dynarr_add (dst, b2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3274 Dynarr_add (dst, b3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3275 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3276 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3277 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3278 DECODE_ADD_BINARY_CHAR (ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3279 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3280 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3281 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3282 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3283 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3284 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3285 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3286 if (BYTE_BIG5_TWO_BYTE_1_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3287 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3288 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3289 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3290 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3291 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3292 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3293
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3294 DECODE_HANDLE_END_OF_CONVERSION (flags, ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3295
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3296 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3297 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3298 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3299
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3300 /* Convert internally-formatted data to Big5. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3301
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3302 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3303 encode_coding_big5 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3304 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3305 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3306 unsigned char c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3307 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3308 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3309 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3310 eol_type_t eol_type = CODING_SYSTEM_EOL_TYPE (str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3311
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3312 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3313 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3314 c = *src++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3315 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3316 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3317 if (eol_type != EOL_LF && eol_type != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3318 Dynarr_add (dst, '\r');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3319 if (eol_type != EOL_CR)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3320 Dynarr_add (dst, '\n');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3321 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3322 else if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3323 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3324 /* ASCII. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3325 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3326 }
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3327 else if (INTBYTE_LEADING_BYTE_P (c))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3328 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3329 if (c == LEADING_BYTE_CHINESE_BIG5_1 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3330 c == LEADING_BYTE_CHINESE_BIG5_2)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3331 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3332 /* A recognized leading byte. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3333 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3334 continue; /* not done with this character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3335 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3336 /* otherwise just ignore this character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3337 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3338 else if (ch == LEADING_BYTE_CHINESE_BIG5_1 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3339 ch == LEADING_BYTE_CHINESE_BIG5_2)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3340 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3341 /* Previous char was a recognized leading byte. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3342 ch = (ch << 8) | c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3343 continue; /* not done with this character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3344 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3345 else if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3346 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3347 /* Encountering second byte of a Big5 character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3348 unsigned char b1, b2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3349
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3350 ENCODE_BIG5 (ch >> 8, ch & 0xFF, c, b1, b2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3351 Dynarr_add (dst, b1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3352 Dynarr_add (dst, b2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3353 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3354
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3355 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3356 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3357
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3358 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3359 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3360 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3361
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3362
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3363 DEFUN ("decode-big5-char", Fdecode_big5_char, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3364 Decode a Big5 character CODE of BIG5 coding-system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3365 CODE is the character code in BIG5, a cons of two integers.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3366 Return the corresponding character.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3367 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3368 (code))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3369 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3370 unsigned char c1, c2, b1, b2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3371
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3372 CHECK_CONS (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3373 CHECK_INT (XCAR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3374 CHECK_INT (XCDR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3375 b1 = XINT (XCAR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3376 b2 = XINT (XCDR (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3377 if (BYTE_BIG5_TWO_BYTE_1_P (b1) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3378 BYTE_BIG5_TWO_BYTE_2_P (b2))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3379 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3380 int leading_byte;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3381 Lisp_Object charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3382 DECODE_BIG5 (b1, b2, leading_byte, c1, c2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3383 charset = CHARSET_BY_LEADING_BYTE (leading_byte);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3384 return make_char (MAKE_CHAR (charset, c1 & 0x7F, c2 & 0x7F));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3385 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3386 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3387 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3388 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3389
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3390 DEFUN ("encode-big5-char", Fencode_big5_char, 1, 1, 0, /*
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3391 Encode the Big5 character CHARACTER in the BIG5 coding-system.
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3392 Return the corresponding character code in Big5.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3393 */
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3394 (character))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3395 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3396 Lisp_Object charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3397 int c1, c2, b1, b2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3398
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3399 CHECK_CHAR_COERCE_INT (character);
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3400 BREAKUP_CHAR (XCHAR (character), charset, c1, c2);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3401 if (EQ (charset, Vcharset_chinese_big5_1) ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3402 EQ (charset, Vcharset_chinese_big5_2))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3403 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3404 ENCODE_BIG5 (XCHARSET_LEADING_BYTE (charset), c1 | 0x80, c2 | 0x80,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3405 b1, b2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3406 return Fcons (make_int (b1), make_int (b2));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3407 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3408 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3409 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3410 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3411
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3412
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3413 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3414 /* UCS-4 methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3415 /* */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3416 /* UCS-4 character codes are implemented as nonnegative integers. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3417 /* */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3418 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3419
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3420
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3421 DEFUN ("set-ucs-char", Fset_ucs_char, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3422 Map UCS-4 code CODE to Mule character CHARACTER.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3423
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3424 Return T on success, NIL on failure.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3425 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3426 (code, character))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3427 {
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
3428 EMACS_INT c;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3429
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3430 CHECK_CHAR (character);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
3431 CHECK_NATNUM (code);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3432 c = XINT (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3433
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
3434 if (c < countof (fcd->ucs_to_mule_table))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3435 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3436 fcd->ucs_to_mule_table[c] = character;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3437 return Qt;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3438 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3439 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3440 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3441 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3442
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3443 static Lisp_Object
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3444 ucs_to_char (unsigned long code)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3445 {
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
3446 if (code < countof (fcd->ucs_to_mule_table))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3447 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3448 return fcd->ucs_to_mule_table[code];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3449 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3450 else if ((0xe00000 <= code) && (code <= 0xe00000 + 94 * 94 * 14))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3451 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3452 unsigned int c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3453
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3454 code -= 0xe00000;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3455 c = code % (94 * 94);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3456 return make_char
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3457 (MAKE_CHAR (CHARSET_BY_ATTRIBUTES
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3458 (CHARSET_TYPE_94X94, code / (94 * 94) + '@',
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3459 CHARSET_LEFT_TO_RIGHT),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3460 c / 94 + 33, c % 94 + 33));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3461 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3462 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3463 return Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3464 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3465
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3466 DEFUN ("ucs-char", Fucs_char, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3467 Return Mule character corresponding to UCS code CODE (a positive integer).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3468 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3469 (code))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3470 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3471 CHECK_NATNUM (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3472 return ucs_to_char (XINT (code));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3473 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3474
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3475 DEFUN ("set-char-ucs", Fset_char_ucs, 2, 2, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3476 Map Mule character CHARACTER to UCS code CODE (a positive integer).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3477 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3478 (character, code))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3479 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3480 /* #### Isn't this gilding the lily? Fput_char_table checks its args.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3481 Fset_char_ucs is more restrictive on index arg, but should
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3482 check code arg in a char_table method. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3483 CHECK_CHAR (character);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3484 CHECK_NATNUM (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3485 return Fput_char_table (character, code, mule_to_ucs_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3486 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3487
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3488 DEFUN ("char-ucs", Fchar_ucs, 1, 1, 0, /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3489 Return the UCS code (a positive integer) corresponding to CHARACTER.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3490 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3491 (character))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3492 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3493 return Fget_char_table (character, mule_to_ucs_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3494 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3495
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3496 /* Decode a UCS-4 character into a buffer. If the lookup fails, use
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3497 <GETA MARK> (U+3013) of JIS X 0208, which means correct character
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3498 is not found, instead.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3499 #### do something more appropriate (use blob?)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3500 Danger, Will Robinson! Data loss. Should we signal user? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3501 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3502 decode_ucs4 (unsigned long ch, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3503 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3504 Lisp_Object chr = ucs_to_char (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3505
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3506 if (! NILP (chr))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3507 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3508 Intbyte work[MAX_EMCHAR_LEN];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3509 int len;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3510
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3511 ch = XCHAR (chr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3512 len = (ch < 128) ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3513 simple_set_charptr_emchar (work, ch) :
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3514 non_ascii_set_charptr_emchar (work, ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3515 Dynarr_add_many (dst, work, len);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3516 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3517 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3518 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3519 Dynarr_add (dst, LEADING_BYTE_JAPANESE_JISX0208);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3520 Dynarr_add (dst, 34 + 128);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3521 Dynarr_add (dst, 46 + 128);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3522 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3523 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3524
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3525 static unsigned long
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3526 mule_char_to_ucs4 (Lisp_Object charset,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3527 unsigned char h, unsigned char l)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3528 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3529 Lisp_Object code
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3530 = Fget_char_table (make_char (MAKE_CHAR (charset, h & 127, l & 127)),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3531 mule_to_ucs_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3532
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3533 if (INTP (code))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3534 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3535 return XINT (code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3536 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3537 else if ( (XCHARSET_DIMENSION (charset) == 2) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3538 (XCHARSET_CHARS (charset) == 94) )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3539 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3540 unsigned char final = XCHARSET_FINAL (charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3541
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3542 if ( ('@' <= final) && (final < 0x7f) )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3543 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3544 return 0xe00000 + (final - '@') * 94 * 94
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3545 + ((h & 127) - 33) * 94 + (l & 127) - 33;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3546 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3547 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3548 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3549 return '?';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3550 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3551 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3552 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3553 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3554 return '?';
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3555 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3556 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3557
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3558 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3559 encode_ucs4 (Lisp_Object charset,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3560 unsigned char h, unsigned char l, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3561 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3562 unsigned long code = mule_char_to_ucs4 (charset, h, l);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3563 Dynarr_add (dst, code >> 24);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3564 Dynarr_add (dst, (code >> 16) & 255);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3565 Dynarr_add (dst, (code >> 8) & 255);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3566 Dynarr_add (dst, code & 255);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3567 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3568
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3569 static int
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3570 detect_coding_ucs4 (struct detection_state *st, const Extbyte *src, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3571 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3572 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3573 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3574 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3575 switch (st->ucs4.in_byte)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3576 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3577 case 0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3578 if (c >= 128)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3579 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3580 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3581 st->ucs4.in_byte++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3582 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3583 case 3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3584 st->ucs4.in_byte = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3585 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3586 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3587 st->ucs4.in_byte++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3588 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3589 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3590 return CODING_CATEGORY_UCS4_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3591 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3592
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3593 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3594 decode_coding_ucs4 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3595 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3596 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3597 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3598 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3599 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3600 unsigned char counter = str->counter;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3601
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3602 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3603 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3604 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3605 switch (counter)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3606 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3607 case 0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3608 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3609 counter = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3610 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3611 case 1:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3612 decode_ucs4 ( ( ch << 8 ) | c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3613 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3614 counter = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3615 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3616 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3617 ch = ( ch << 8 ) | c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3618 counter--;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3619 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3620 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3621 if (counter & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3622 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3623
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3624 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3625 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3626 str->counter = counter;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3627 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3628
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3629 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3630 encode_coding_ucs4 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3631 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3632 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3633 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3634 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3635 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3636 unsigned char char_boundary = str->iso2022.current_char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3637 Lisp_Object charset = str->iso2022.current_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3638
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3639 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3640 /* flags for handling composite chars. We do a little switcharoo
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3641 on the source while we're outputting the composite char. */
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3642 Bytecount saved_n = 0;
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
3643 const unsigned char *saved_src = NULL;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3644 int in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3645
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3646 back_to_square_n:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3647 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3648
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3649 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3650 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3651 unsigned char c = *src++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3652
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3653 if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3654 { /* Processing ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3655 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3656 encode_ucs4 (Vcharset_ascii, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3657 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3658 }
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3659 else if (INTBYTE_LEADING_BYTE_P (c) || INTBYTE_LEADING_BYTE_P (ch))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3660 { /* Processing Leading Byte */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3661 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3662 charset = CHARSET_BY_LEADING_BYTE (c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3663 if (LEADING_BYTE_PREFIX_P(c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3664 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3665 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3666 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3667 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3668 { /* Processing Non-ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3669 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3670 if (EQ (charset, Vcharset_control_1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3671 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3672 encode_ucs4 (Vcharset_control_1, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3673 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3674 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3675 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3676 switch (XCHARSET_REP_BYTES (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3677 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3678 case 2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3679 encode_ucs4 (charset, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3680 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3681 case 3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3682 if (XCHARSET_PRIVATE_P (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3683 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3684 encode_ucs4 (charset, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3685 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3686 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3687 else if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3688 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3689 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3690 if (EQ (charset, Vcharset_composite))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3691 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3692 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3693 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3694 /* #### Bother! We don't know how to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3695 handle this yet. */
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3696 Dynarr_add (dst, '\0');
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3697 Dynarr_add (dst, '\0');
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3698 Dynarr_add (dst, '\0');
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3699 Dynarr_add (dst, '~');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3700 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3701 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3702 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3703 Emchar emch = MAKE_CHAR (Vcharset_composite,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3704 ch & 0x7F, c & 0x7F);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3705 Lisp_Object lstr = composite_char_string (emch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3706 saved_n = n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3707 saved_src = src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3708 in_composite = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3709 src = XSTRING_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3710 n = XSTRING_LENGTH (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3711 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3712 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3713 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3714 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3715 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3716 encode_ucs4(charset, ch, c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3717 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3718 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3719 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3720 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3721 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3722 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3723 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3724 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3725 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3726 case 4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3727 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3728 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3729 encode_ucs4 (charset, ch, c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3730 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3731 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3732 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3733 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3734 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3735 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3736 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3737 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3738 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3739 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3740 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3741 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3742 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3743 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3744
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3745 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3746 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3747 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3748 n = saved_n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3749 src = saved_src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3750 in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3751 goto back_to_square_n; /* Wheeeeeeeee ..... */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3752 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3753 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3754
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3755 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3756 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3757 str->iso2022.current_char_boundary = char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3758 str->iso2022.current_charset = charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3759
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3760 /* Verbum caro factum est! */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3761 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3762
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3763
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3764 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3765 /* UTF-8 methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3766 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3767
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3768 static int
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3769 detect_coding_utf8 (struct detection_state *st, const Extbyte *src, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3770 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3771 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3772 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3773 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3774 switch (st->utf8.in_byte)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3775 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3776 case 0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3777 if (c == ISO_CODE_ESC || c == ISO_CODE_SI || c == ISO_CODE_SO)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3778 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3779 else if (c >= 0xfc)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3780 st->utf8.in_byte = 5;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3781 else if (c >= 0xf8)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3782 st->utf8.in_byte = 4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3783 else if (c >= 0xf0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3784 st->utf8.in_byte = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3785 else if (c >= 0xe0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3786 st->utf8.in_byte = 2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3787 else if (c >= 0xc0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3788 st->utf8.in_byte = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3789 else if (c >= 0x80)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3790 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3791 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3792 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3793 if ((c & 0xc0) != 0x80)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3794 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3795 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3796 st->utf8.in_byte--;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3797 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3798 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3799 return CODING_CATEGORY_UTF8_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3800 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3801
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3802 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3803 decode_coding_utf8 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3804 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3805 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3806 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3807 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3808 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3809 eol_type_t eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3810 unsigned char counter = str->counter;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3811
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3812 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3813 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
3814 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3815 switch (counter)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3816 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3817 case 0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3818 if ( c >= 0xfc )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3819 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3820 ch = c & 0x01;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3821 counter = 5;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3822 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3823 else if ( c >= 0xf8 )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3824 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3825 ch = c & 0x03;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3826 counter = 4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3827 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3828 else if ( c >= 0xf0 )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3829 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3830 ch = c & 0x07;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3831 counter = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3832 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3833 else if ( c >= 0xe0 )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3834 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3835 ch = c & 0x0f;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3836 counter = 2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3837 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3838 else if ( c >= 0xc0 )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3839 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3840 ch = c & 0x1f;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3841 counter = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3842 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3843 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3844 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3845 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3846 decode_ucs4 (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3847 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3848 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3849 case 1:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3850 ch = ( ch << 6 ) | ( c & 0x3f );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3851 decode_ucs4 (ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3852 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3853 counter = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3854 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3855 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3856 ch = ( ch << 6 ) | ( c & 0x3f );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3857 counter--;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3858 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3859 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3860 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3861
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3862 if (flags & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3863 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3864
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3865 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3866 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3867 str->counter = counter;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3868 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3869
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3870 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3871 encode_utf8 (Lisp_Object charset,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3872 unsigned char h, unsigned char l, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3873 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3874 unsigned long code = mule_char_to_ucs4 (charset, h, l);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3875 if ( code <= 0x7f )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3876 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3877 Dynarr_add (dst, code);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3878 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3879 else if ( code <= 0x7ff )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3880 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3881 Dynarr_add (dst, (code >> 6) | 0xc0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3882 Dynarr_add (dst, (code & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3883 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3884 else if ( code <= 0xffff )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3885 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3886 Dynarr_add (dst, (code >> 12) | 0xe0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3887 Dynarr_add (dst, ((code >> 6) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3888 Dynarr_add (dst, (code & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3889 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3890 else if ( code <= 0x1fffff )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3891 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3892 Dynarr_add (dst, (code >> 18) | 0xf0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3893 Dynarr_add (dst, ((code >> 12) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3894 Dynarr_add (dst, ((code >> 6) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3895 Dynarr_add (dst, (code & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3896 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3897 else if ( code <= 0x3ffffff )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3898 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3899 Dynarr_add (dst, (code >> 24) | 0xf8);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3900 Dynarr_add (dst, ((code >> 18) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3901 Dynarr_add (dst, ((code >> 12) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3902 Dynarr_add (dst, ((code >> 6) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3903 Dynarr_add (dst, (code & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3904 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3905 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3906 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3907 Dynarr_add (dst, (code >> 30) | 0xfc);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3908 Dynarr_add (dst, ((code >> 24) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3909 Dynarr_add (dst, ((code >> 18) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3910 Dynarr_add (dst, ((code >> 12) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3911 Dynarr_add (dst, ((code >> 6) & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3912 Dynarr_add (dst, (code & 0x3f) | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3913 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3914 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3915
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3916 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3917 encode_coding_utf8 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3918 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3919 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3920 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3921 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3922 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3923 eol_type_t eol_type = CODING_SYSTEM_EOL_TYPE (str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3924 unsigned char char_boundary = str->iso2022.current_char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3925 Lisp_Object charset = str->iso2022.current_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3926
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3927 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3928 /* flags for handling composite chars. We do a little switcharoo
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3929 on the source while we're outputting the composite char. */
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3930 Bytecount saved_n = 0;
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
3931 const unsigned char *saved_src = NULL;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3932 int in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3933
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3934 back_to_square_n:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3935 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3936
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3937 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3938 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3939 unsigned char c = *src++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3940
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3941 if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3942 { /* Processing ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3943 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3944 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3945 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3946 if (eol_type != EOL_LF && eol_type != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3947 Dynarr_add (dst, '\r');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3948 if (eol_type != EOL_CR)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3949 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3950 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3951 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3952 encode_utf8 (Vcharset_ascii, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3953 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3954 }
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
3955 else if (INTBYTE_LEADING_BYTE_P (c) || INTBYTE_LEADING_BYTE_P (ch))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3956 { /* Processing Leading Byte */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3957 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3958 charset = CHARSET_BY_LEADING_BYTE (c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3959 if (LEADING_BYTE_PREFIX_P(c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3960 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3961 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3962 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3963 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3964 { /* Processing Non-ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3965 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3966 if (EQ (charset, Vcharset_control_1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3967 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3968 encode_utf8 (Vcharset_control_1, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3969 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3970 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3971 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3972 switch (XCHARSET_REP_BYTES (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3973 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3974 case 2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3975 encode_utf8 (charset, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3976 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3977 case 3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3978 if (XCHARSET_PRIVATE_P (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3979 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3980 encode_utf8 (charset, c, 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3981 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3982 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3983 else if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3984 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3985 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3986 if (EQ (charset, Vcharset_composite))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3987 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3988 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3989 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3990 /* #### Bother! We don't know how to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3991 handle this yet. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3992 encode_utf8 (Vcharset_ascii, '~', 0, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3993 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3994 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3995 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3996 Emchar emch = MAKE_CHAR (Vcharset_composite,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3997 ch & 0x7F, c & 0x7F);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3998 Lisp_Object lstr = composite_char_string (emch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3999 saved_n = n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4000 saved_src = src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4001 in_composite = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4002 src = XSTRING_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4003 n = XSTRING_LENGTH (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4004 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4005 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4006 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4007 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4008 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4009 encode_utf8 (charset, ch, c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4010 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4011 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4012 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4013 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4014 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4015 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4016 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4017 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4018 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4019 case 4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4020 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4021 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4022 encode_utf8 (charset, ch, c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4023 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4024 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4025 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4026 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4027 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4028 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4029 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4030 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4031 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4032 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4033 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4034 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4035 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4036 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4037
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4038 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4039 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4040 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4041 n = saved_n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4042 src = saved_src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4043 in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4044 goto back_to_square_n; /* Wheeeeeeeee ..... */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4045 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4046 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4047
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4048 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4049 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4050 str->iso2022.current_char_boundary = char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4051 str->iso2022.current_charset = charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4052
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4053 /* Verbum caro factum est! */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4054 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4055
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4056
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4057 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4058 /* ISO2022 methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4059 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4060
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4061 /* The following note describes the coding system ISO2022 briefly.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4062 Since the intention of this note is to help understand the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4063 functions in this file, some parts are NOT ACCURATE or OVERLY
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4064 SIMPLIFIED. For thorough understanding, please refer to the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4065 original document of ISO2022.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4066
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4067 ISO2022 provides many mechanisms to encode several character sets
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4068 in 7-bit and 8-bit environments. For 7-bit environments, all text
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4069 is encoded using bytes less than 128. This may make the encoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4070 text a little bit longer, but the text passes more easily through
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4071 several gateways, some of which strip off MSB (Most Signigant Bit).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4072
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4073 There are two kinds of character sets: control character set and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4074 graphic character set. The former contains control characters such
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4075 as `newline' and `escape' to provide control functions (control
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4076 functions are also provided by escape sequences). The latter
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4077 contains graphic characters such as 'A' and '-'. Emacs recognizes
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4078 two control character sets and many graphic character sets.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4079
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4080 Graphic character sets are classified into one of the following
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4081 four classes, according to the number of bytes (DIMENSION) and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4082 number of characters in one dimension (CHARS) of the set:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4083 - DIMENSION1_CHARS94
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4084 - DIMENSION1_CHARS96
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4085 - DIMENSION2_CHARS94
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4086 - DIMENSION2_CHARS96
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4087
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4088 In addition, each character set is assigned an identification tag,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4089 unique for each set, called "final character" (denoted as <F>
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4090 hereafter). The <F> of each character set is decided by ECMA(*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4091 when it is registered in ISO. The code range of <F> is 0x30..0x7F
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4092 (0x30..0x3F are for private use only).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4093
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4094 Note (*): ECMA = European Computer Manufacturers Association
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4095
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4096 Here are examples of graphic character set [NAME(<F>)]:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4097 o DIMENSION1_CHARS94 -- ASCII('B'), right-half-of-JISX0201('I'), ...
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4098 o DIMENSION1_CHARS96 -- right-half-of-ISO8859-1('A'), ...
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4099 o DIMENSION2_CHARS94 -- GB2312('A'), JISX0208('B'), ...
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4100 o DIMENSION2_CHARS96 -- none for the moment
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4101
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4102 A code area (1 byte = 8 bits) is divided into 4 areas, C0, GL, C1, and GR.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4103 C0 [0x00..0x1F] -- control character plane 0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4104 GL [0x20..0x7F] -- graphic character plane 0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4105 C1 [0x80..0x9F] -- control character plane 1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4106 GR [0xA0..0xFF] -- graphic character plane 1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4107
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4108 A control character set is directly designated and invoked to C0 or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4109 C1 by an escape sequence. The most common case is that:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4110 - ISO646's control character set is designated/invoked to C0, and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4111 - ISO6429's control character set is designated/invoked to C1,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4112 and usually these designations/invocations are omitted in encoded
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4113 text. In a 7-bit environment, only C0 can be used, and a control
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4114 character for C1 is encoded by an appropriate escape sequence to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4115 fit into the environment. All control characters for C1 are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4116 defined to have corresponding escape sequences.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4117
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4118 A graphic character set is at first designated to one of four
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4119 graphic registers (G0 through G3), then these graphic registers are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4120 invoked to GL or GR. These designations and invocations can be
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4121 done independently. The most common case is that G0 is invoked to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4122 GL, G1 is invoked to GR, and ASCII is designated to G0. Usually
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4123 these invocations and designations are omitted in encoded text.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4124 In a 7-bit environment, only GL can be used.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4125
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4126 When a graphic character set of CHARS94 is invoked to GL, codes
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4127 0x20 and 0x7F of the GL area work as control characters SPACE and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4128 DEL respectively, and codes 0xA0 and 0xFF of the GR area should not
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4129 be used.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4130
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4131 There are two ways of invocation: locking-shift and single-shift.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4132 With locking-shift, the invocation lasts until the next different
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4133 invocation, whereas with single-shift, the invocation affects the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4134 following character only and doesn't affect the locking-shift
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4135 state. Invocations are done by the following control characters or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4136 escape sequences:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4137
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4138 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4139 abbrev function cntrl escape seq description
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4140 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4141 SI/LS0 (shift-in) 0x0F none invoke G0 into GL
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4142 SO/LS1 (shift-out) 0x0E none invoke G1 into GL
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4143 LS2 (locking-shift-2) none ESC 'n' invoke G2 into GL
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4144 LS3 (locking-shift-3) none ESC 'o' invoke G3 into GL
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4145 LS1R (locking-shift-1 right) none ESC '~' invoke G1 into GR (*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4146 LS2R (locking-shift-2 right) none ESC '}' invoke G2 into GR (*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4147 LS3R (locking-shift 3 right) none ESC '|' invoke G3 into GR (*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4148 SS2 (single-shift-2) 0x8E ESC 'N' invoke G2 for one char
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4149 SS3 (single-shift-3) 0x8F ESC 'O' invoke G3 for one char
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4150 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4151 (*) These are not used by any known coding system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4152
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4153 Control characters for these functions are defined by macros
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4154 ISO_CODE_XXX in `coding.h'.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4155
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4156 Designations are done by the following escape sequences:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4157 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4158 escape sequence description
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4159 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4160 ESC '(' <F> designate DIMENSION1_CHARS94<F> to G0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4161 ESC ')' <F> designate DIMENSION1_CHARS94<F> to G1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4162 ESC '*' <F> designate DIMENSION1_CHARS94<F> to G2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4163 ESC '+' <F> designate DIMENSION1_CHARS94<F> to G3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4164 ESC ',' <F> designate DIMENSION1_CHARS96<F> to G0 (*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4165 ESC '-' <F> designate DIMENSION1_CHARS96<F> to G1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4166 ESC '.' <F> designate DIMENSION1_CHARS96<F> to G2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4167 ESC '/' <F> designate DIMENSION1_CHARS96<F> to G3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4168 ESC '$' '(' <F> designate DIMENSION2_CHARS94<F> to G0 (**)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4169 ESC '$' ')' <F> designate DIMENSION2_CHARS94<F> to G1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4170 ESC '$' '*' <F> designate DIMENSION2_CHARS94<F> to G2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4171 ESC '$' '+' <F> designate DIMENSION2_CHARS94<F> to G3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4172 ESC '$' ',' <F> designate DIMENSION2_CHARS96<F> to G0 (*)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4173 ESC '$' '-' <F> designate DIMENSION2_CHARS96<F> to G1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4174 ESC '$' '.' <F> designate DIMENSION2_CHARS96<F> to G2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4175 ESC '$' '/' <F> designate DIMENSION2_CHARS96<F> to G3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4176 ----------------------------------------------------------------------
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4177
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4178 In this list, "DIMENSION1_CHARS94<F>" means a graphic character set
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4179 of dimension 1, chars 94, and final character <F>, etc...
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4180
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4181 Note (*): Although these designations are not allowed in ISO2022,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4182 Emacs accepts them on decoding, and produces them on encoding
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4183 CHARS96 character sets in a coding system which is characterized as
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4184 7-bit environment, non-locking-shift, and non-single-shift.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4185
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4186 Note (**): If <F> is '@', 'A', or 'B', the intermediate character
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4187 '(' can be omitted. We refer to this as "short-form" hereafter.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4188
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4189 Now you may notice that there are a lot of ways for encoding the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4190 same multilingual text in ISO2022. Actually, there exist many
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4191 coding systems such as Compound Text (used in X11's inter client
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4192 communication, ISO-2022-JP (used in Japanese internet), ISO-2022-KR
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4193 (used in Korean internet), EUC (Extended UNIX Code, used in Asian
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4194 localized platforms), and all of these are variants of ISO2022.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4195
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4196 In addition to the above, Emacs handles two more kinds of escape
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4197 sequences: ISO6429's direction specification and Emacs' private
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4198 sequence for specifying character composition.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4199
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4200 ISO6429's direction specification takes the following form:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4201 o CSI ']' -- end of the current direction
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4202 o CSI '0' ']' -- end of the current direction
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4203 o CSI '1' ']' -- start of left-to-right text
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4204 o CSI '2' ']' -- start of right-to-left text
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4205 The control character CSI (0x9B: control sequence introducer) is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4206 abbreviated to the escape sequence ESC '[' in a 7-bit environment.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4207
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4208 Character composition specification takes the following form:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4209 o ESC '0' -- start character composition
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4210 o ESC '1' -- end character composition
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4211 Since these are not standard escape sequences of any ISO standard,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4212 their use with these meanings is restricted to Emacs only. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4213
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4214 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4215 reset_iso2022 (Lisp_Object coding_system, struct iso2022_decoder *iso)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4216 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4217 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4218
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4219 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4220 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4221 if (!NILP (coding_system))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4222 iso->charset[i] =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4223 XCODING_SYSTEM_ISO2022_INITIAL_CHARSET (coding_system, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4224 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4225 iso->charset[i] = Qt;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4226 iso->invalid_designated[i] = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4227 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4228 iso->esc = ISO_ESC_NOTHING;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4229 iso->esc_bytes_index = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4230 iso->register_left = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4231 iso->register_right = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4232 iso->switched_dir_and_no_valid_charset_yet = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4233 iso->invalid_switch_dir = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4234 iso->output_direction_sequence = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4235 iso->output_literally = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4236 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4237 if (iso->composite_chars)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4238 Dynarr_reset (iso->composite_chars);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4239 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4240 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4241
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4242 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4243 fit_to_be_escape_quoted (unsigned char c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4244 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4245 switch (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4246 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4247 case ISO_CODE_ESC:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4248 case ISO_CODE_CSI:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4249 case ISO_CODE_SS2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4250 case ISO_CODE_SS3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4251 case ISO_CODE_SO:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4252 case ISO_CODE_SI:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4253 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4254
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4255 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4256 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4257 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4258 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4259
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4260 /* Parse one byte of an ISO2022 escape sequence.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4261 If the result is an invalid escape sequence, return 0 and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4262 do not change anything in STR. Otherwise, if the result is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4263 an incomplete escape sequence, update ISO2022.ESC and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4264 ISO2022.ESC_BYTES and return -1. Otherwise, update
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4265 all the state variables (but not ISO2022.ESC_BYTES) and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4266 return 1.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4267
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4268 If CHECK_INVALID_CHARSETS is non-zero, check for designation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4269 or invocation of an invalid character set and treat that as
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4270 an unrecognized escape sequence.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4271
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4272 ********************************************************************
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4273
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4274 #### Strategies for error annotation and coding orthogonalization
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4275
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4276 We really want to separate out a number of things. Conceptually,
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4277 there is a nested syntax.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4278
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4279 At the top level is the ISO 2022 extension syntax, including charset
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4280 designation and invocation, and certain auxiliary controls such as the
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4281 ISO 6429 direction specification. These are octet-oriented, with the
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4282 single exception (AFAIK) of the "exit Unicode" sequence which uses the
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4283 UTF's natural width (1 byte for UTF-7 and UTF-8, 2 bytes for UCS-2 and
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4284 UTF-16, and 4 bytes for UCS-4 and UTF-32). This will be treated as a
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4285 (deprecated) special case in Unicode processing.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4286
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4287 The middle layer is ISO 2022 character interpretation. This will depend
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4288 on the current state of the ISO 2022 registers, and assembles octets
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4289 into the character's internal representation.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4290
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4291 The lowest level is translating system control conventions. At present
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4292 this is restricted to newline translation, but one could imagine doing
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4293 tab conversion or line wrapping here. "Escape from Unicode" processing
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4294 would be done at this level.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4295
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4296 At each level the parser will verify the syntax. In the case of a
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4297 syntax error or warning (such as a redundant escape sequence that affects
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4298 no characters), the parser will take some action, typically inserting the
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4299 erroneous octets directly into the output and creating an annotation
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4300 which can be used by higher level I/O to mark the affected region.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4301
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4302 This should make it possible to do something sensible about separating
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4303 newline convention processing from character construction, and about
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4304 preventing ISO 2022 escape sequences from being recognized
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4305 inappropriately.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4306
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4307 The basic strategy will be to have octet classification tables, and
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4308 switch processing according to the table entry.
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4309
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4310 It's possible that, by doing the processing with tables of functions or
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
4311 the like, the parser can be used for both detection and translation. */
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4312
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4313 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4314 parse_iso2022_esc (Lisp_Object codesys, struct iso2022_decoder *iso,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4315 unsigned char c, unsigned int *flags,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4316 int check_invalid_charsets)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4317 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4318 /* (1) If we're at the end of a designation sequence, CS is the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4319 charset being designated and REG is the register to designate
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4320 it to.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4321
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4322 (2) If we're at the end of a locking-shift sequence, REG is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4323 the register to invoke and HALF (0 == left, 1 == right) is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4324 the half to invoke it into.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4325
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4326 (3) If we're at the end of a single-shift sequence, REG is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4327 the register to invoke. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4328 Lisp_Object cs = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4329 int reg, half;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4330
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4331 /* NOTE: This code does goto's all over the fucking place.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4332 The reason for this is that we're basically implementing
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4333 a state machine here, and hierarchical languages like C
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4334 don't really provide a clean way of doing this. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4335
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4336 if (! (*flags & CODING_STATE_ESCAPE))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4337 /* At beginning of escape sequence; we need to reset our
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4338 escape-state variables. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4339 iso->esc = ISO_ESC_NOTHING;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4340
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4341 iso->output_literally = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4342 iso->output_direction_sequence = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4343
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4344 switch (iso->esc)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4345 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4346 case ISO_ESC_NOTHING:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4347 iso->esc_bytes_index = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4348 switch (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4349 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4350 case ISO_CODE_ESC: /* Start escape sequence */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4351 *flags |= CODING_STATE_ESCAPE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4352 iso->esc = ISO_ESC;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4353 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4354
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4355 case ISO_CODE_CSI: /* ISO6429 (specifying directionality) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4356 *flags |= CODING_STATE_ESCAPE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4357 iso->esc = ISO_ESC_5_11;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4358 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4359
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4360 case ISO_CODE_SO: /* locking shift 1 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4361 reg = 1; half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4362 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4363 case ISO_CODE_SI: /* locking shift 0 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4364 reg = 0; half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4365 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4366
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4367 case ISO_CODE_SS2: /* single shift */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4368 reg = 2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4369 goto single_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4370 case ISO_CODE_SS3: /* single shift */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4371 reg = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4372 goto single_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4373
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4374 default: /* Other control characters */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4375 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4376 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4377
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4378 case ISO_ESC:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4379 switch (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4380 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4381 /**** single shift ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4382
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4383 case 'N': /* single shift 2 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4384 reg = 2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4385 goto single_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4386 case 'O': /* single shift 3 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4387 reg = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4388 goto single_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4389
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4390 /**** locking shift ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4391
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4392 case '~': /* locking shift 1 right */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4393 reg = 1; half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4394 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4395 case 'n': /* locking shift 2 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4396 reg = 2; half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4397 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4398 case '}': /* locking shift 2 right */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4399 reg = 2; half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4400 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4401 case 'o': /* locking shift 3 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4402 reg = 3; half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4403 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4404 case '|': /* locking shift 3 right */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4405 reg = 3; half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4406 goto locking_shift;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4407
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4408 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4409 /**** composite ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4410
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4411 case '0':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4412 iso->esc = ISO_ESC_START_COMPOSITE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4413 *flags = (*flags & CODING_STATE_ISO2022_LOCK) |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4414 CODING_STATE_COMPOSITE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4415 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4416
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4417 case '1':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4418 iso->esc = ISO_ESC_END_COMPOSITE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4419 *flags = (*flags & CODING_STATE_ISO2022_LOCK) &
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4420 ~CODING_STATE_COMPOSITE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4421 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4422 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4423
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4424 /**** directionality ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4425
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4426 case '[':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4427 iso->esc = ISO_ESC_5_11;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4428 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4429
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4430 /**** designation ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4431
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4432 case '$': /* multibyte charset prefix */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4433 iso->esc = ISO_ESC_2_4;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4434 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4435
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4436 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4437 if (0x28 <= c && c <= 0x2F)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4438 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4439 iso->esc = (enum iso_esc_flag) (c - 0x28 + ISO_ESC_2_8);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4440 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4441 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4442
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4443 /* This function is called with CODESYS equal to nil when
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4444 doing coding-system detection. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4445 if (!NILP (codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4446 && XCODING_SYSTEM_ISO2022_ESCAPE_QUOTED (codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4447 && fit_to_be_escape_quoted (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4448 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4449 iso->esc = ISO_ESC_LITERAL;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4450 *flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4451 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4452 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4453
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4454 /* bzzzt! */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4455 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4456 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4457
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4458
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4459
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4460 /**** directionality ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4461
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4462 case ISO_ESC_5_11: /* ISO6429 direction control */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4463 if (c == ']')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4464 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4465 *flags &= (CODING_STATE_ISO2022_LOCK & ~CODING_STATE_R2L);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4466 goto directionality;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4467 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4468 if (c == '0') iso->esc = ISO_ESC_5_11_0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4469 else if (c == '1') iso->esc = ISO_ESC_5_11_1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4470 else if (c == '2') iso->esc = ISO_ESC_5_11_2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4471 else return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4472 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4473
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4474 case ISO_ESC_5_11_0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4475 if (c == ']')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4476 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4477 *flags &= (CODING_STATE_ISO2022_LOCK & ~CODING_STATE_R2L);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4478 goto directionality;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4479 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4480 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4481
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4482 case ISO_ESC_5_11_1:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4483 if (c == ']')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4484 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4485 *flags = (CODING_STATE_ISO2022_LOCK & ~CODING_STATE_R2L);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4486 goto directionality;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4487 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4488 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4489
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4490 case ISO_ESC_5_11_2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4491 if (c == ']')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4492 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4493 *flags = (*flags & CODING_STATE_ISO2022_LOCK) | CODING_STATE_R2L;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4494 goto directionality;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4495 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4496 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4497
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4498 directionality:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4499 iso->esc = ISO_ESC_DIRECTIONALITY;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4500 /* Various junk here to attempt to preserve the direction sequences
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4501 literally in the text if they would otherwise be swallowed due
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4502 to invalid designations that don't show up as actual charset
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4503 changes in the text. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4504 if (iso->invalid_switch_dir)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4505 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4506 /* We already inserted a direction switch literally into the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4507 text. We assume (#### this may not be right) that the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4508 next direction switch is the one going the other way,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4509 and we need to output that literally as well. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4510 iso->output_literally = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4511 iso->invalid_switch_dir = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4512 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4513 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4514 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4515 int jj;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4516
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4517 /* If we are in the thrall of an invalid designation,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4518 then stick the directionality sequence literally into the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4519 output stream so it ends up in the original text again. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4520 for (jj = 0; jj < 4; jj++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4521 if (iso->invalid_designated[jj])
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4522 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4523 if (jj < 4)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4524 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4525 iso->output_literally = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4526 iso->invalid_switch_dir = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4527 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4528 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4529 /* Indicate that we haven't yet seen a valid designation,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4530 so that if a switch-dir is directly followed by an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4531 invalid designation, both get inserted literally. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4532 iso->switched_dir_and_no_valid_charset_yet = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4533 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4534 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4535
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4536
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4537 /**** designation ****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4538
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4539 case ISO_ESC_2_4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4540 if (0x28 <= c && c <= 0x2F)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4541 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4542 iso->esc = (enum iso_esc_flag) (c - 0x28 + ISO_ESC_2_4_8);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4543 goto not_done;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4544 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4545 if (0x40 <= c && c <= 0x42)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4546 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4547 cs = CHARSET_BY_ATTRIBUTES (CHARSET_TYPE_94X94, c,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4548 *flags & CODING_STATE_R2L ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4549 CHARSET_RIGHT_TO_LEFT :
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4550 CHARSET_LEFT_TO_RIGHT);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4551 reg = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4552 goto designated;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4553 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4554 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4555
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4556 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4557 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4558 int type =-1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4559
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4560 if (c < '0' || c > '~')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4561 return 0; /* bad final byte */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4562
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4563 if (iso->esc >= ISO_ESC_2_8 &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4564 iso->esc <= ISO_ESC_2_15)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4565 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4566 type = ((iso->esc >= ISO_ESC_2_12) ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4567 CHARSET_TYPE_96 : CHARSET_TYPE_94);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4568 reg = (iso->esc - ISO_ESC_2_8) & 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4569 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4570 else if (iso->esc >= ISO_ESC_2_4_8 &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4571 iso->esc <= ISO_ESC_2_4_15)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4572 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4573 type = ((iso->esc >= ISO_ESC_2_4_12) ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4574 CHARSET_TYPE_96X96 : CHARSET_TYPE_94X94);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4575 reg = (iso->esc - ISO_ESC_2_4_8) & 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4576 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4577 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4578 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4579 /* Can this ever be reached? -slb */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4580 abort();
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
4581 return 0;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4582 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4583
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4584 cs = CHARSET_BY_ATTRIBUTES (type, c,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4585 *flags & CODING_STATE_R2L ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4586 CHARSET_RIGHT_TO_LEFT :
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4587 CHARSET_LEFT_TO_RIGHT);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4588 goto designated;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4589 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4590 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4591
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4592 not_done:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4593 iso->esc_bytes[iso->esc_bytes_index++] = (unsigned char) c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4594 return -1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4595
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4596 single_shift:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4597 if (check_invalid_charsets && !CHARSETP (iso->charset[reg]))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4598 /* can't invoke something that ain't there. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4599 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4600 iso->esc = ISO_ESC_SINGLE_SHIFT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4601 *flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4602 if (reg == 2)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4603 *flags |= CODING_STATE_SS2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4604 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4605 *flags |= CODING_STATE_SS3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4606 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4607
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4608 locking_shift:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4609 if (check_invalid_charsets &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4610 !CHARSETP (iso->charset[reg]))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4611 /* can't invoke something that ain't there. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4612 return 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4613 if (half)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4614 iso->register_right = reg;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4615 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4616 iso->register_left = reg;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4617 *flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4618 iso->esc = ISO_ESC_LOCKING_SHIFT;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4619 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4620
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4621 designated:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4622 if (NILP (cs) && check_invalid_charsets)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4623 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4624 iso->invalid_designated[reg] = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4625 iso->charset[reg] = Vcharset_ascii;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4626 iso->esc = ISO_ESC_DESIGNATE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4627 *flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4628 iso->output_literally = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4629 if (iso->switched_dir_and_no_valid_charset_yet)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4630 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4631 /* We encountered a switch-direction followed by an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4632 invalid designation. Ensure that the switch-direction
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4633 gets outputted; otherwise it will probably get eaten
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4634 when the text is written out again. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4635 iso->switched_dir_and_no_valid_charset_yet = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4636 iso->output_direction_sequence = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4637 /* And make sure that the switch-dir going the other
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4638 way gets outputted, as well. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4639 iso->invalid_switch_dir = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4640 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4641 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4642 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4643 /* This function is called with CODESYS equal to nil when
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4644 doing coding-system detection. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4645 if (!NILP (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4646 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4647 charset_conversion_spec_dynarr *dyn =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4648 XCODING_SYSTEM (codesys)->iso2022.input_conv;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4649
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4650 if (dyn)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4651 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4652 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4653
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4654 for (i = 0; i < Dynarr_length (dyn); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4655 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4656 struct charset_conversion_spec *spec = Dynarr_atp (dyn, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4657 if (EQ (cs, spec->from_charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4658 cs = spec->to_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4659 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4660 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4661 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4662
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4663 iso->charset[reg] = cs;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4664 iso->esc = ISO_ESC_DESIGNATE;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4665 *flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4666 if (iso->invalid_designated[reg])
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4667 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4668 iso->invalid_designated[reg] = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4669 iso->output_literally = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4670 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4671 if (iso->switched_dir_and_no_valid_charset_yet)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4672 iso->switched_dir_and_no_valid_charset_yet = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4673 return 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4674 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4675
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4676 static int
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
4677 detect_coding_iso2022 (struct detection_state *st, const Extbyte *src, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4678 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4679 int mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4680
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4681 /* #### There are serious deficiencies in the recognition mechanism
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4682 here. This needs to be much smarter if it's going to cut it.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4683 The sequence "\xff\x0f" is currently detected as LOCK_SHIFT while
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4684 it should be detected as Latin-1.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4685 All the ISO2022 stuff in this file should be synced up with the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4686 code from FSF Emacs-20.4, in which Mule should be more or less stable.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4687 Perhaps we should wait till R2L works in FSF Emacs? */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4688
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4689 if (!st->iso2022.initted)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4690 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4691 reset_iso2022 (Qnil, &st->iso2022.iso);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4692 st->iso2022.mask = (CODING_CATEGORY_ISO_7_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4693 CODING_CATEGORY_ISO_8_DESIGNATE_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4694 CODING_CATEGORY_ISO_8_1_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4695 CODING_CATEGORY_ISO_8_2_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4696 CODING_CATEGORY_ISO_LOCK_SHIFT_MASK);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4697 st->iso2022.flags = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4698 st->iso2022.high_byte_count = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4699 st->iso2022.saw_single_shift = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4700 st->iso2022.initted = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4701 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4702
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4703 mask = st->iso2022.mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4704
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4705 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4706 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
4707 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4708 if (c >= 0xA0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4709 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4710 mask &= ~CODING_CATEGORY_ISO_7_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4711 st->iso2022.high_byte_count++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4712 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4713 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4714 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4715 if (st->iso2022.high_byte_count && !st->iso2022.saw_single_shift)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4716 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4717 if (st->iso2022.high_byte_count & 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4718 /* odd number of high bytes; assume not iso-8-2 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4719 mask &= ~CODING_CATEGORY_ISO_8_2_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4720 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4721 st->iso2022.high_byte_count = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4722 st->iso2022.saw_single_shift = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4723 if (c > 0x80)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4724 mask &= ~CODING_CATEGORY_ISO_7_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4725 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4726 if (!(st->iso2022.flags & CODING_STATE_ESCAPE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4727 && (BYTE_C0_P (c) || BYTE_C1_P (c)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4728 { /* control chars */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4729 switch (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4730 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4731 /* Allow and ignore control characters that you might
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4732 reasonably see in a text file */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4733 case '\r':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4734 case '\n':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4735 case '\t':
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4736 case 7: /* bell */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4737 case 8: /* backspace */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4738 case 11: /* vertical tab */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4739 case 12: /* form feed */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4740 case 26: /* MS-DOS C-z junk */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4741 case 31: /* '^_' -- for info */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4742 goto label_continue_loop;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4743
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4744 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4745 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4746 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4747 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4748
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4749 if ((st->iso2022.flags & CODING_STATE_ESCAPE) || BYTE_C0_P (c)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4750 || BYTE_C1_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4751 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4752 if (parse_iso2022_esc (Qnil, &st->iso2022.iso, c,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4753 &st->iso2022.flags, 0))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4754 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4755 switch (st->iso2022.iso.esc)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4756 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4757 case ISO_ESC_DESIGNATE:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4758 mask &= ~CODING_CATEGORY_ISO_8_1_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4759 mask &= ~CODING_CATEGORY_ISO_8_2_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4760 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4761 case ISO_ESC_LOCKING_SHIFT:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4762 mask = CODING_CATEGORY_ISO_LOCK_SHIFT_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4763 goto ran_out_of_chars;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4764 case ISO_ESC_SINGLE_SHIFT:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4765 mask &= ~CODING_CATEGORY_ISO_8_DESIGNATE_MASK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4766 st->iso2022.saw_single_shift = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4767 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4768 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4769 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4770 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4771 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4772 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4773 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4774 mask = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4775 goto ran_out_of_chars;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4776 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4777 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4778 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4779 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4780
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4781 ran_out_of_chars:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4782
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4783 return mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4784 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4785
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4786 static int
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4787 postprocess_iso2022_mask (int mask)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4788 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4789 /* #### kind of cheesy */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4790 /* If seven-bit ISO is allowed, then assume that the encoding is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4791 entirely seven-bit and turn off the eight-bit ones. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4792 if (mask & CODING_CATEGORY_ISO_7_MASK)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4793 mask &= ~ (CODING_CATEGORY_ISO_8_DESIGNATE_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4794 CODING_CATEGORY_ISO_8_1_MASK |
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4795 CODING_CATEGORY_ISO_8_2_MASK);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4796 return mask;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4797 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4798
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4799 /* If FLAGS is a null pointer or specifies right-to-left motion,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4800 output a switch-dir-to-left-to-right sequence to DST.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4801 Also update FLAGS if it is not a null pointer.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4802 If INTERNAL_P is set, we are outputting in internal format and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4803 need to handle the CSI differently. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4804
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4805 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4806 restore_left_to_right_direction (Lisp_Coding_System *codesys,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4807 unsigned_char_dynarr *dst,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4808 unsigned int *flags,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4809 int internal_p)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4810 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4811 if (!flags || (*flags & CODING_STATE_R2L))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4812 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4813 if (CODING_SYSTEM_ISO2022_SEVEN (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4814 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4815 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4816 Dynarr_add (dst, '[');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4817 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4818 else if (internal_p)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4819 DECODE_ADD_BINARY_CHAR (ISO_CODE_CSI, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4820 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4821 Dynarr_add (dst, ISO_CODE_CSI);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4822 Dynarr_add (dst, '0');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4823 Dynarr_add (dst, ']');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4824 if (flags)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4825 *flags &= ~CODING_STATE_R2L;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4826 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4827 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4828
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4829 /* If FLAGS is a null pointer or specifies a direction different from
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4830 DIRECTION (which should be either CHARSET_RIGHT_TO_LEFT or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4831 CHARSET_LEFT_TO_RIGHT), output the appropriate switch-dir escape
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4832 sequence to DST. Also update FLAGS if it is not a null pointer.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4833 If INTERNAL_P is set, we are outputting in internal format and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4834 need to handle the CSI differently. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4835
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4836 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4837 ensure_correct_direction (int direction, Lisp_Coding_System *codesys,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4838 unsigned_char_dynarr *dst, unsigned int *flags,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4839 int internal_p)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4840 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4841 if ((!flags || (*flags & CODING_STATE_R2L)) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4842 direction == CHARSET_LEFT_TO_RIGHT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4843 restore_left_to_right_direction (codesys, dst, flags, internal_p);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4844 else if (!CODING_SYSTEM_ISO2022_NO_ISO6429 (codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4845 && (!flags || !(*flags & CODING_STATE_R2L)) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4846 direction == CHARSET_RIGHT_TO_LEFT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4847 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4848 if (CODING_SYSTEM_ISO2022_SEVEN (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4849 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4850 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4851 Dynarr_add (dst, '[');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4852 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4853 else if (internal_p)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4854 DECODE_ADD_BINARY_CHAR (ISO_CODE_CSI, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4855 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4856 Dynarr_add (dst, ISO_CODE_CSI);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4857 Dynarr_add (dst, '2');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4858 Dynarr_add (dst, ']');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4859 if (flags)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4860 *flags |= CODING_STATE_R2L;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4861 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4862 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4863
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4864 /* Convert ISO2022-format data to internal format. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4865
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4866 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
4867 decode_coding_iso2022 (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
4868 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4869 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4870 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4871 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4872 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4873 eol_type_t eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4874 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4875 unsigned_char_dynarr *real_dst = dst;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4876 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4877 Lisp_Object coding_system;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4878
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4879 XSETCODING_SYSTEM (coding_system, str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4880
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4881 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4882 if (flags & CODING_STATE_COMPOSITE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4883 dst = str->iso2022.composite_chars;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4884 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4885
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4886 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4887 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
4888 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4889 if (flags & CODING_STATE_ESCAPE)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4890 { /* Within ESC sequence */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4891 int retval = parse_iso2022_esc (coding_system, &str->iso2022,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4892 c, &flags, 1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4893
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4894 if (retval)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4895 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4896 switch (str->iso2022.esc)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4897 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4898 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4899 case ISO_ESC_START_COMPOSITE:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4900 if (str->iso2022.composite_chars)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4901 Dynarr_reset (str->iso2022.composite_chars);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4902 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4903 str->iso2022.composite_chars = Dynarr_new (unsigned_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4904 dst = str->iso2022.composite_chars;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4905 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4906 case ISO_ESC_END_COMPOSITE:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4907 {
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
4908 Intbyte comstr[MAX_EMCHAR_LEN];
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4909 Bytecount len;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4910 Emchar emch = lookup_composite_char (Dynarr_atp (dst, 0),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4911 Dynarr_length (dst));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4912 dst = real_dst;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4913 len = set_charptr_emchar (comstr, emch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4914 Dynarr_add_many (dst, comstr, len);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4915 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4916 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4917 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4918
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4919 case ISO_ESC_LITERAL:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4920 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4921 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4922
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4923 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4924 /* Everything else handled already */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4925 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4926 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4927 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4928
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4929 /* Attempted error recovery. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4930 if (str->iso2022.output_direction_sequence)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4931 ensure_correct_direction (flags & CODING_STATE_R2L ?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4932 CHARSET_RIGHT_TO_LEFT :
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4933 CHARSET_LEFT_TO_RIGHT,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4934 str->codesys, dst, 0, 1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4935 /* More error recovery. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4936 if (!retval || str->iso2022.output_literally)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4937 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4938 /* Output the (possibly invalid) sequence */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4939 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4940 for (i = 0; i < str->iso2022.esc_bytes_index; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4941 DECODE_ADD_BINARY_CHAR (str->iso2022.esc_bytes[i], dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4942 flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4943 if (!retval)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4944 n++, src--;/* Repeat the loop with the same character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4945 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4946 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4947 /* No sense in reprocessing the final byte of the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4948 escape sequence; it could mess things up anyway.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4949 Just add it now. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4950 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4951 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4952 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4953 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4954 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4955 else if (BYTE_C0_P (c) || BYTE_C1_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4956 { /* Control characters */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4957
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4958 /***** Error-handling *****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4959
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4960 /* If we were in the middle of a character, dump out the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4961 partial character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4962 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4963
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4964 /* If we just saw a single-shift character, dump it out.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4965 This may dump out the wrong sort of single-shift character,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4966 but least it will give an indication that something went
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4967 wrong. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4968 if (flags & CODING_STATE_SS2)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4969 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4970 DECODE_ADD_BINARY_CHAR (ISO_CODE_SS2, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4971 flags &= ~CODING_STATE_SS2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4972 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4973 if (flags & CODING_STATE_SS3)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4974 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4975 DECODE_ADD_BINARY_CHAR (ISO_CODE_SS3, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4976 flags &= ~CODING_STATE_SS3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4977 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4978
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4979 /***** Now handle the control characters. *****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4980
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4981 /* Handle CR/LF */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4982 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4983
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4984 flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4985
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4986 if (!parse_iso2022_esc (coding_system, &str->iso2022, c, &flags, 1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4987 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4988 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4989 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4990 { /* Graphic characters */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4991 Lisp_Object charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4992 int lb;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4993 int reg;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4994
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4995 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4996
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4997 /* Now determine the charset. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4998 reg = ((flags & CODING_STATE_SS2) ? 2
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4999 : (flags & CODING_STATE_SS3) ? 3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5000 : !BYTE_ASCII_P (c) ? str->iso2022.register_right
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5001 : str->iso2022.register_left);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5002 charset = str->iso2022.charset[reg];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5003
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5004 /* Error checking: */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5005 if (! CHARSETP (charset)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5006 || str->iso2022.invalid_designated[reg]
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5007 || (((c & 0x7F) == ' ' || (c & 0x7F) == ISO_CODE_DEL)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5008 && XCHARSET_CHARS (charset) == 94))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5009 /* Mrmph. We are trying to invoke a register that has no
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5010 or an invalid charset in it, or trying to add a character
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5011 outside the range of the charset. Insert that char literally
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5012 to preserve it for the output. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5013 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5014 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5015 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5016 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5017
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5018 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5019 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5020 /* Things are probably hunky-dorey. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5021
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5022 /* Fetch reverse charset, maybe. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5023 if (((flags & CODING_STATE_R2L) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5024 XCHARSET_DIRECTION (charset) == CHARSET_LEFT_TO_RIGHT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5025 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5026 (!(flags & CODING_STATE_R2L) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5027 XCHARSET_DIRECTION (charset) == CHARSET_RIGHT_TO_LEFT))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5028 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5029 Lisp_Object new_charset =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5030 XCHARSET_REVERSE_DIRECTION_CHARSET (charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5031 if (!NILP (new_charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5032 charset = new_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5033 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5034
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5035 lb = XCHARSET_LEADING_BYTE (charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5036 switch (XCHARSET_REP_BYTES (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5037 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5038 case 1: /* ASCII */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5039 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5040 Dynarr_add (dst, c & 0x7F);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5041 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5042
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5043 case 2: /* one-byte official */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5044 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5045 Dynarr_add (dst, lb);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5046 Dynarr_add (dst, c | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5047 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5048
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5049 case 3: /* one-byte private or two-byte official */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5050 if (XCHARSET_PRIVATE_P (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5051 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5052 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5053 Dynarr_add (dst, PRE_LEADING_BYTE_PRIVATE_1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5054 Dynarr_add (dst, lb);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5055 Dynarr_add (dst, c | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5056 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5057 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5058 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5059 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5060 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5061 Dynarr_add (dst, lb);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5062 Dynarr_add (dst, ch | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5063 Dynarr_add (dst, c | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5064 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5065 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5066 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5067 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5068 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5069 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5070
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5071 default: /* two-byte private */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5072 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5073 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5074 Dynarr_add (dst, PRE_LEADING_BYTE_PRIVATE_2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5075 Dynarr_add (dst, lb);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5076 Dynarr_add (dst, ch | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5077 Dynarr_add (dst, c | 0x80);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5078 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5079 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5080 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5081 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5082 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5083 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5084
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5085 if (!ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5086 flags &= CODING_STATE_ISO2022_LOCK;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5087 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5088
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5089 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5090 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5091
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5092 if (flags & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5093 DECODE_OUTPUT_PARTIAL_CHAR (ch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5094
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5095 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5096 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5097 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5098
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5099
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5100 /***** ISO2022 encoder *****/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5101
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5102 /* Designate CHARSET into register REG. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5103
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5104 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5105 iso2022_designate (Lisp_Object charset, unsigned char reg,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5106 struct encoding_stream *str, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5107 {
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5108 static const char inter94[] = "()*+";
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5109 static const char inter96[] = ",-./";
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
5110 int type;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5111 unsigned char final;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5112 Lisp_Object old_charset = str->iso2022.charset[reg];
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5113
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5114 str->iso2022.charset[reg] = charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5115 if (!CHARSETP (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5116 /* charset might be an initial nil or t. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5117 return;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5118 type = XCHARSET_TYPE (charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5119 final = XCHARSET_FINAL (charset);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5120 if (!str->iso2022.force_charset_on_output[reg] &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5121 CHARSETP (old_charset) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5122 XCHARSET_TYPE (old_charset) == type &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5123 XCHARSET_FINAL (old_charset) == final)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5124 return;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5125
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5126 str->iso2022.force_charset_on_output[reg] = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5127
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5128 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5129 charset_conversion_spec_dynarr *dyn =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5130 str->codesys->iso2022.output_conv;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5131
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5132 if (dyn)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5133 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5134 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5135
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5136 for (i = 0; i < Dynarr_length (dyn); i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5137 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5138 struct charset_conversion_spec *spec = Dynarr_atp (dyn, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5139 if (EQ (charset, spec->from_charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5140 charset = spec->to_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5141 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5142 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5143 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5144
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5145 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5146 switch (type)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5147 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5148 case CHARSET_TYPE_94:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5149 Dynarr_add (dst, inter94[reg]);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5150 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5151 case CHARSET_TYPE_96:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5152 Dynarr_add (dst, inter96[reg]);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5153 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5154 case CHARSET_TYPE_94X94:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5155 Dynarr_add (dst, '$');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5156 if (reg != 0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5157 || !(CODING_SYSTEM_ISO2022_SHORT (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5158 || final < '@'
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5159 || final > 'B')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5160 Dynarr_add (dst, inter94[reg]);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5161 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5162 case CHARSET_TYPE_96X96:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5163 Dynarr_add (dst, '$');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5164 Dynarr_add (dst, inter96[reg]);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5165 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5166 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5167 Dynarr_add (dst, final);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5168 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5169
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5170 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5171 ensure_normal_shift (struct encoding_stream *str, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5172 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5173 if (str->iso2022.register_left != 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5174 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5175 Dynarr_add (dst, ISO_CODE_SI);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5176 str->iso2022.register_left = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5177 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5178 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5179
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5180 static void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5181 ensure_shift_out (struct encoding_stream *str, unsigned_char_dynarr *dst)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5182 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5183 if (str->iso2022.register_left != 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5184 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5185 Dynarr_add (dst, ISO_CODE_SO);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5186 str->iso2022.register_left = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5187 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5188 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5189
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5190 /* Convert internally-formatted data to ISO2022 format. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5191
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5192 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5193 encode_coding_iso2022 (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5194 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5195 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5196 unsigned char charmask, c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5197 unsigned char char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5198 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5199 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5200 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5201 Lisp_Coding_System *codesys = str->codesys;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5202 eol_type_t eol_type = CODING_SYSTEM_EOL_TYPE (str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5203 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5204 Lisp_Object charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5205 int half;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5206
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5207 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5208 /* flags for handling composite chars. We do a little switcharoo
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5209 on the source while we're outputting the composite char. */
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5210 Bytecount saved_n = 0;
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5211 const unsigned char *saved_src = NULL;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5212 int in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5213 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5214
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5215 char_boundary = str->iso2022.current_char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5216 charset = str->iso2022.current_charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5217 half = str->iso2022.current_half;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5218
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5219 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5220 back_to_square_n:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5221 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5222 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5223 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5224 c = *src++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5225
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5226 if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5227 { /* Processing ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5228 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5229
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5230 restore_left_to_right_direction (codesys, dst, &flags, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5231
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5232 /* Make sure G0 contains ASCII */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5233 if ((c > ' ' && c < ISO_CODE_DEL) ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5234 !CODING_SYSTEM_ISO2022_NO_ASCII_CNTL (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5235 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5236 ensure_normal_shift (str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5237 iso2022_designate (Vcharset_ascii, 0, str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5238 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5239
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5240 /* If necessary, restore everything to the default state
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5241 at end-of-line */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5242 if (c == '\n' &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5243 !(CODING_SYSTEM_ISO2022_NO_ASCII_EOL (codesys)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5244 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5245 restore_left_to_right_direction (codesys, dst, &flags, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5246
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5247 ensure_normal_shift (str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5248
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5249 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5250 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5251 Lisp_Object initial_charset =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5252 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5253 iso2022_designate (initial_charset, i, str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5254 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5255 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5256 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5257 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5258 if (eol_type != EOL_LF && eol_type != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5259 Dynarr_add (dst, '\r');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5260 if (eol_type != EOL_CR)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5261 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5262 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5263 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5264 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5265 if (CODING_SYSTEM_ISO2022_ESCAPE_QUOTED (codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5266 && fit_to_be_escape_quoted (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5267 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5268 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5269 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5270 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5271 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5272
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5273 else if (INTBYTE_LEADING_BYTE_P (c) || INTBYTE_LEADING_BYTE_P (ch))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5274 { /* Processing Leading Byte */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5275 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5276 charset = CHARSET_BY_LEADING_BYTE (c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5277 if (LEADING_BYTE_PREFIX_P(c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5278 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5279 else if (!EQ (charset, Vcharset_control_1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5280 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5281 && !EQ (charset, Vcharset_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5282 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5283 )
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5284 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5285 int reg;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5286
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5287 ensure_correct_direction (XCHARSET_DIRECTION (charset),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5288 codesys, dst, &flags, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5289
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5290 /* Now determine which register to use. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5291 reg = -1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5292 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5293 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5294 if (EQ (charset, str->iso2022.charset[i]) ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5295 EQ (charset,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5296 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, i)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5297 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5298 reg = i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5299 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5300 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5301 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5302
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5303 if (reg == -1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5304 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5305 if (XCHARSET_GRAPHIC (charset) != 0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5306 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5307 if (!NILP (str->iso2022.charset[1]) &&
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5308 (!CODING_SYSTEM_ISO2022_SEVEN (codesys) ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5309 CODING_SYSTEM_ISO2022_LOCK_SHIFT (codesys)))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5310 reg = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5311 else if (!NILP (str->iso2022.charset[2]))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5312 reg = 2;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5313 else if (!NILP (str->iso2022.charset[3]))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5314 reg = 3;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5315 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5316 reg = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5317 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5318 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5319 reg = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5320 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5321
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5322 iso2022_designate (charset, reg, str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5323
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5324 /* Now invoke that register. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5325 switch (reg)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5326 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5327 case 0:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5328 ensure_normal_shift (str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5329 half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5330 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5331
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5332 case 1:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5333 if (CODING_SYSTEM_ISO2022_SEVEN (codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5334 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5335 ensure_shift_out (str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5336 half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5337 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5338 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5339 half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5340 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5341
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5342 case 2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5343 if (CODING_SYSTEM_ISO2022_SEVEN (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5344 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5345 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5346 Dynarr_add (dst, 'N');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5347 half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5348 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5349 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5350 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5351 Dynarr_add (dst, ISO_CODE_SS2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5352 half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5353 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5354 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5355
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5356 case 3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5357 if (CODING_SYSTEM_ISO2022_SEVEN (str->codesys))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5358 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5359 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5360 Dynarr_add (dst, 'O');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5361 half = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5362 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5363 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5364 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5365 Dynarr_add (dst, ISO_CODE_SS3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5366 half = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5367 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5368 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5369
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5370 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5371 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5372 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5373 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5374 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5375 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5376 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5377 { /* Processing Non-ASCII character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5378 charmask = (half == 0 ? 0x7F : 0xFF);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5379 char_boundary = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5380 if (EQ (charset, Vcharset_control_1))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5381 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5382 if (CODING_SYSTEM_ISO2022_ESCAPE_QUOTED (codesys)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5383 && fit_to_be_escape_quoted (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5384 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5385 /* you asked for it ... */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5386 Dynarr_add (dst, c - 0x20);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5387 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5388 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5389 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5390 switch (XCHARSET_REP_BYTES (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5391 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5392 case 2:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5393 Dynarr_add (dst, c & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5394 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5395 case 3:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5396 if (XCHARSET_PRIVATE_P (charset))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5397 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5398 Dynarr_add (dst, c & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5399 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5400 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5401 else if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5402 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5403 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5404 if (EQ (charset, Vcharset_composite))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5405 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5406 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5407 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5408 /* #### Bother! We don't know how to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5409 handle this yet. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5410 Dynarr_add (dst, '~');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5411 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5412 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5413 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5414 Emchar emch = MAKE_CHAR (Vcharset_composite,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5415 ch & 0x7F, c & 0x7F);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5416 Lisp_Object lstr = composite_char_string (emch);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5417 saved_n = n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5418 saved_src = src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5419 in_composite = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5420 src = XSTRING_DATA (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5421 n = XSTRING_LENGTH (lstr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5422 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5423 Dynarr_add (dst, '0'); /* start composing */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5424 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5425 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5426 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5427 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5428 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5429 Dynarr_add (dst, ch & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5430 Dynarr_add (dst, c & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5431 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5432 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5433 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5434 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5435 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5436 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5437 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5438 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5439 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5440 case 4:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5441 if (ch)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5442 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5443 Dynarr_add (dst, ch & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5444 Dynarr_add (dst, c & charmask);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5445 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5446 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5447 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5448 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5449 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5450 char_boundary = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5451 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5452 break;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5453 default:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5454 abort ();
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5455 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5456 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5457 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5458 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5459
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5460 #ifdef ENABLE_COMPOSITE_CHARS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5461 if (in_composite)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5462 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5463 n = saved_n;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5464 src = saved_src;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5465 in_composite = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5466 Dynarr_add (dst, ISO_CODE_ESC);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5467 Dynarr_add (dst, '1'); /* end composing */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5468 goto back_to_square_n; /* Wheeeeeeeee ..... */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5469 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5470 #endif /* ENABLE_COMPOSITE_CHARS */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5471
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5472 if (char_boundary && flags & CODING_STATE_END)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5473 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5474 restore_left_to_right_direction (codesys, dst, &flags, 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5475 ensure_normal_shift (str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5476 for (i = 0; i < 4; i++)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5477 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5478 Lisp_Object initial_charset =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5479 CODING_SYSTEM_ISO2022_INITIAL_CHARSET (codesys, i);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5480 iso2022_designate (initial_charset, i, str, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5481 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5482 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5483
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5484 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5485 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5486 str->iso2022.current_char_boundary = char_boundary;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5487 str->iso2022.current_charset = charset;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5488 str->iso2022.current_half = half;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5489
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5490 /* Verbum caro factum est! */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5491 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5492 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5493
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5494 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5495 /* No-conversion methods */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5496 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5497
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5498 /* This is used when reading in "binary" files -- i.e. files that may
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5499 contain all 256 possible byte values and that are not to be
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5500 interpreted as being in any particular decoding. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5501 static void
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
5502 decode_coding_no_conversion (Lstream *decoding, const Extbyte *src,
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5503 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5504 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5505 struct decoding_stream *str = DECODING_STREAM_DATA (decoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5506 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5507 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5508 eol_type_t eol_type = str->eol_type;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5509
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5510 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5511 {
444
576fb035e263 Import from CVS: tag r21-2-37
cvs
parents: 442
diff changeset
5512 unsigned char c = *(unsigned char *)src++;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5513
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5514 DECODE_HANDLE_EOL_TYPE (eol_type, c, flags, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5515 DECODE_ADD_BINARY_CHAR (c, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5516 label_continue_loop:;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5517 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5518
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5519 DECODE_HANDLE_END_OF_CONVERSION (flags, ch, dst);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5520
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5521 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5522 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5523 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5524
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5525 static void
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5526 encode_coding_no_conversion (Lstream *encoding, const Intbyte *src,
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5527 unsigned_char_dynarr *dst, Bytecount n)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5528 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5529 unsigned char c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5530 struct encoding_stream *str = ENCODING_STREAM_DATA (encoding);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5531 unsigned int flags = str->flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5532 unsigned int ch = str->ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5533 eol_type_t eol_type = CODING_SYSTEM_EOL_TYPE (str->codesys);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5534
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5535 while (n--)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5536 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5537 c = *src++;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5538 if (c == '\n')
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5539 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5540 if (eol_type != EOL_LF && eol_type != EOL_AUTODETECT)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5541 Dynarr_add (dst, '\r');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5542 if (eol_type != EOL_CR)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5543 Dynarr_add (dst, '\n');
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5544 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5545 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5546 else if (BYTE_ASCII_P (c))
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5547 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5548 assert (ch == 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5549 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5550 }
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
5551 else if (INTBYTE_LEADING_BYTE_P (c))
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5552 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5553 assert (ch == 0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5554 if (c == LEADING_BYTE_LATIN_ISO8859_1 ||
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5555 c == LEADING_BYTE_CONTROL_1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5556 ch = c;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5557 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5558 Dynarr_add (dst, '~'); /* untranslatable character */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5559 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5560 else
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5561 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5562 if (ch == LEADING_BYTE_LATIN_ISO8859_1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5563 Dynarr_add (dst, c);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5564 else if (ch == LEADING_BYTE_CONTROL_1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5565 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5566 assert (c < 0xC0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5567 Dynarr_add (dst, c - 0x20);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5568 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5569 /* else it should be the second or third byte of an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5570 untranslatable character, so ignore it */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5571 ch = 0;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5572 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5573 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5574
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5575 str->flags = flags;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5576 str->ch = ch;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5577 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5578
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5579
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5580
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5581 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5582 /* Initialization */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5583 /************************************************************************/
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5584
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5585 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5586 syms_of_file_coding (void)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5587 {
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5588 INIT_LRECORD_IMPLEMENTATION (coding_system);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5589
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5590 DEFSUBR (Fcoding_system_p);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5591 DEFSUBR (Ffind_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5592 DEFSUBR (Fget_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5593 DEFSUBR (Fcoding_system_list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5594 DEFSUBR (Fcoding_system_name);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5595 DEFSUBR (Fmake_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5596 DEFSUBR (Fcopy_coding_system);
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5597 DEFSUBR (Fcoding_system_canonical_name_p);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5598 DEFSUBR (Fcoding_system_alias_p);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5599 DEFSUBR (Fcoding_system_aliasee);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5600 DEFSUBR (Fdefine_coding_system_alias);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5601 DEFSUBR (Fsubsidiary_coding_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5602
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5603 DEFSUBR (Fcoding_system_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5604 DEFSUBR (Fcoding_system_doc_string);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5605 DEFSUBR (Fcoding_system_property);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5606
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5607 DEFSUBR (Fcoding_category_list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5608 DEFSUBR (Fset_coding_priority_list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5609 DEFSUBR (Fcoding_priority_list);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5610 DEFSUBR (Fset_coding_category_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5611 DEFSUBR (Fcoding_category_system);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5612
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5613 DEFSUBR (Fdetect_coding_region);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5614 DEFSUBR (Fdecode_coding_region);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5615 DEFSUBR (Fencode_coding_region);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5616 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5617 DEFSUBR (Fdecode_shift_jis_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5618 DEFSUBR (Fencode_shift_jis_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5619 DEFSUBR (Fdecode_big5_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5620 DEFSUBR (Fencode_big5_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5621 DEFSUBR (Fset_ucs_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5622 DEFSUBR (Fucs_char);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5623 DEFSUBR (Fset_char_ucs);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5624 DEFSUBR (Fchar_ucs);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5625 #endif /* MULE */
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5626 DEFSYMBOL_MULTIWORD_PREDICATE (Qcoding_systemp);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5627 DEFSYMBOL (Qno_conversion);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5628 DEFSYMBOL (Qraw_text);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5629 #ifdef MULE
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5630 DEFSYMBOL (Qbig5);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5631 DEFSYMBOL (Qshift_jis);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5632 defsymbol (&Qucs4, "ucs-4");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5633 defsymbol (&Qutf8, "utf-8");
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5634 DEFSYMBOL (Qccl);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5635 DEFSYMBOL (Qiso2022);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5636 #endif /* MULE */
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5637 DEFSYMBOL (Qmnemonic);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5638 DEFSYMBOL (Qeol_type);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5639 DEFSYMBOL (Qpost_read_conversion);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5640 DEFSYMBOL (Qpre_write_conversion);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5641
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5642 DEFSYMBOL (Qcr);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5643 DEFSYMBOL (Qlf);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5644 DEFSYMBOL (Qcrlf);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5645 DEFSYMBOL (Qeol_cr);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5646 DEFSYMBOL (Qeol_lf);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5647 DEFSYMBOL (Qeol_crlf);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5648 #ifdef MULE
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5649 DEFSYMBOL (Qcharset_g0);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5650 DEFSYMBOL (Qcharset_g1);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5651 DEFSYMBOL (Qcharset_g2);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5652 DEFSYMBOL (Qcharset_g3);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5653 DEFSYMBOL (Qforce_g0_on_output);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5654 DEFSYMBOL (Qforce_g1_on_output);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5655 DEFSYMBOL (Qforce_g2_on_output);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5656 DEFSYMBOL (Qforce_g3_on_output);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5657 DEFSYMBOL (Qno_iso6429);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5658 DEFSYMBOL (Qinput_charset_conversion);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5659 DEFSYMBOL (Qoutput_charset_conversion);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5660
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5661 DEFSYMBOL (Qshort);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5662 DEFSYMBOL (Qno_ascii_eol);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5663 DEFSYMBOL (Qno_ascii_cntl);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5664 DEFSYMBOL (Qseven);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5665 DEFSYMBOL (Qlock_shift);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5666 DEFSYMBOL (Qescape_quoted);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5667 #endif /* MULE */
563
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5668 DEFSYMBOL (Qencode);
183866b06e0b [xemacs-hg @ 2001-05-24 07:50:48 by ben]
ben
parents: 528
diff changeset
5669 DEFSYMBOL (Qdecode);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5670
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5671 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5672 defsymbol (&coding_category_symbol[CODING_CATEGORY_SHIFT_JIS],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5673 "shift-jis");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5674 defsymbol (&coding_category_symbol[CODING_CATEGORY_BIG5],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5675 "big5");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5676 defsymbol (&coding_category_symbol[CODING_CATEGORY_UCS4],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5677 "ucs-4");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5678 defsymbol (&coding_category_symbol[CODING_CATEGORY_UTF8],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5679 "utf-8");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5680 defsymbol (&coding_category_symbol[CODING_CATEGORY_ISO_7],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5681 "iso-7");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5682 defsymbol (&coding_category_symbol[CODING_CATEGORY_ISO_8_DESIGNATE],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5683 "iso-8-designate");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5684 defsymbol (&coding_category_symbol[CODING_CATEGORY_ISO_8_1],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5685 "iso-8-1");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5686 defsymbol (&coding_category_symbol[CODING_CATEGORY_ISO_8_2],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5687 "iso-8-2");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5688 defsymbol (&coding_category_symbol[CODING_CATEGORY_ISO_LOCK_SHIFT],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5689 "iso-lock-shift");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5690 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5691 defsymbol (&coding_category_symbol[CODING_CATEGORY_NO_CONVERSION],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5692 "no-conversion");
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5693 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5694
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5695 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5696 lstream_type_create_file_coding (void)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5697 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5698 LSTREAM_HAS_METHOD (decoding, reader);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5699 LSTREAM_HAS_METHOD (decoding, writer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5700 LSTREAM_HAS_METHOD (decoding, rewinder);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5701 LSTREAM_HAS_METHOD (decoding, seekable_p);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5702 LSTREAM_HAS_METHOD (decoding, flusher);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5703 LSTREAM_HAS_METHOD (decoding, closer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5704 LSTREAM_HAS_METHOD (decoding, marker);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5705
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5706 LSTREAM_HAS_METHOD (encoding, reader);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5707 LSTREAM_HAS_METHOD (encoding, writer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5708 LSTREAM_HAS_METHOD (encoding, rewinder);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5709 LSTREAM_HAS_METHOD (encoding, seekable_p);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5710 LSTREAM_HAS_METHOD (encoding, flusher);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5711 LSTREAM_HAS_METHOD (encoding, closer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5712 LSTREAM_HAS_METHOD (encoding, marker);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5713 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5714
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5715 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5716 vars_of_file_coding (void)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5717 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5718 int i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5719
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5720 fcd = xnew (struct file_coding_dump);
452
3d3049ae1304 Import from CVS: tag r21-2-41
cvs
parents: 448
diff changeset
5721 dump_add_root_struct_ptr (&fcd, &fcd_description);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5722
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5723 /* Initialize to something reasonable ... */
448
3078fd1074e8 Import from CVS: tag r21-2-39
cvs
parents: 444
diff changeset
5724 for (i = 0; i < CODING_CATEGORY_LAST; i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5725 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5726 fcd->coding_category_system[i] = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5727 fcd->coding_category_by_priority[i] = i;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5728 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5729
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5730 Fprovide (intern ("file-coding"));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5731
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5732 DEFVAR_LISP ("keyboard-coding-system", &Vkeyboard_coding_system /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5733 Coding system used for TTY keyboard input.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5734 Not used under a windowing system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5735 */ );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5736 Vkeyboard_coding_system = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5737
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5738 DEFVAR_LISP ("terminal-coding-system", &Vterminal_coding_system /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5739 Coding system used for TTY display output.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5740 Not used under a windowing system.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5741 */ );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5742 Vterminal_coding_system = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5743
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5744 DEFVAR_LISP ("coding-system-for-read", &Vcoding_system_for_read /*
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5745 Overriding coding system used when reading from a file or process.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5746 You should bind this variable with `let', but do not set it globally.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5747 If this is non-nil, it specifies the coding system that will be used
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5748 to decode input on read operations, such as from a file or process.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5749 It overrides `buffer-file-coding-system-for-read',
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5750 `insert-file-contents-pre-hook', etc. Use those variables instead of
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5751 this one for permanent changes to the environment. */ );
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5752 Vcoding_system_for_read = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5753
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5754 DEFVAR_LISP ("coding-system-for-write",
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5755 &Vcoding_system_for_write /*
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5756 Overriding coding system used when writing to a file or process.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5757 You should bind this variable with `let', but do not set it globally.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5758 If this is non-nil, it specifies the coding system that will be used
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5759 to encode output for write operations, such as to a file or process.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5760 It overrides `buffer-file-coding-system', `write-region-pre-hook', etc.
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5761 Use those variables instead of this one for permanent changes to the
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5762 environment. */ );
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5763 Vcoding_system_for_write = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5764
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5765 DEFVAR_LISP ("file-name-coding-system", &Vfile_name_coding_system /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5766 Coding system used to convert pathnames when accessing files.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5767 */ );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5768 Vfile_name_coding_system = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5769
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5770 DEFVAR_BOOL ("enable-multibyte-characters", &enable_multibyte_characters /*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5771 Non-nil means the buffer contents are regarded as multi-byte form
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5772 of characters, not a binary code. This affects the display, file I/O,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5773 and behaviors of various editing commands.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5774
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5775 Setting this to nil does not do anything.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5776 */ );
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5777 enable_multibyte_characters = 1;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5778 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5779
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5780 void
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5781 complex_vars_of_file_coding (void)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5782 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5783 staticpro (&Vcoding_system_hash_table);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5784 Vcoding_system_hash_table =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5785 make_lisp_hash_table (50, HASH_TABLE_NON_WEAK, HASH_TABLE_EQ);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5786
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5787 the_codesys_prop_dynarr = Dynarr_new (codesys_prop);
452
3d3049ae1304 Import from CVS: tag r21-2-41
cvs
parents: 448
diff changeset
5788 dump_add_root_struct_ptr (&the_codesys_prop_dynarr, &codesys_prop_dynarr_description);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5789
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5790 #define DEFINE_CODESYS_PROP(Prop_Type, Sym) do \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5791 { \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5792 struct codesys_prop csp; \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5793 csp.sym = (Sym); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5794 csp.prop_type = (Prop_Type); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5795 Dynarr_add (the_codesys_prop_dynarr, csp); \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5796 } while (0)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5797
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5798 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qmnemonic);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5799 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qeol_type);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5800 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qeol_cr);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5801 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qeol_crlf);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5802 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qeol_lf);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5803 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qpost_read_conversion);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5804 DEFINE_CODESYS_PROP (CODESYS_PROP_ALL_OK, Qpre_write_conversion);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5805 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5806 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qcharset_g0);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5807 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qcharset_g1);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5808 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qcharset_g2);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5809 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qcharset_g3);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5810 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qforce_g0_on_output);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5811 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qforce_g1_on_output);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5812 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qforce_g2_on_output);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5813 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qforce_g3_on_output);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5814 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qshort);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5815 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qno_ascii_eol);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5816 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qno_ascii_cntl);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5817 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qseven);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5818 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qlock_shift);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5819 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qno_iso6429);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5820 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qescape_quoted);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5821 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qinput_charset_conversion);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5822 DEFINE_CODESYS_PROP (CODESYS_PROP_ISO2022, Qoutput_charset_conversion);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5823
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5824 DEFINE_CODESYS_PROP (CODESYS_PROP_CCL, Qencode);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5825 DEFINE_CODESYS_PROP (CODESYS_PROP_CCL, Qdecode);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5826 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5827 /* Need to create this here or we're really screwed. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5828 Fmake_coding_system
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5829 (Qraw_text, Qno_conversion,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5830 build_string ("Raw text, which means it converts only line-break-codes."),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5831 list2 (Qmnemonic, build_string ("Raw")));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5832
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5833 Fmake_coding_system
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5834 (Qbinary, Qno_conversion,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5835 build_string ("Binary, which means it does not convert anything."),
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5836 list4 (Qeol_type, Qlf,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5837 Qmnemonic, build_string ("Binary")));
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5838
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5839 Fdefine_coding_system_alias (Qno_conversion, Qraw_text);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5840
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5841 Fdefine_coding_system_alias (Qfile_name, Qbinary);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5842
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5843 Fdefine_coding_system_alias (Qterminal, Qbinary);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5844 Fdefine_coding_system_alias (Qkeyboard, Qbinary);
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 438
diff changeset
5845
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5846 /* Need this for bootstrapping */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5847 fcd->coding_category_system[CODING_CATEGORY_NO_CONVERSION] =
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5848 Fget_coding_system (Qraw_text);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5849
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5850 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5851 {
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
5852 int i;
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5853
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
5854 for (i = 0; i < countof (fcd->ucs_to_mule_table); i++)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5855 fcd->ucs_to_mule_table[i] = Qnil;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5856 }
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5857 staticpro (&mule_to_ucs_table);
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 564
diff changeset
5858 mule_to_ucs_table = Fmake_char_table (Qgeneric);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5859 #endif /* MULE */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5860 }