annotate src/dumper.h @ 5648:3f4a234f4672

Support non-ASCII correctly in character classes, test this. src/ChangeLog addition: 2012-04-21 Aidan Kehoe <kehoea@parhasard.net> Support non-ASCII correctly in character classes ([:alnum:] and friends). * regex.c: * regex.c (ISBLANK, ISUNIBYTE): New. Make these and friends independent of the locale, since we want them to be consistent in XEmacs. * regex.c (print_partial_compiled_pattern): Print the flags for charset_mule; don't print non-ASCII as the character values in ranges, this breaks with locales. * regex.c (enum): Define various flags the charset_mule and charset_mule_not opcodes can now take. * regex.c (CHAR_CLASS_MAX_LENGTH): Update this. * regex.c (re_iswctype, re_wctype): New, from GNU. * regex.c (re_wctype_can_match_non_ascii): New; used when deciding on whether to use charset_mule or the ASCII-only regex character set opcode. * regex.c (regex_compile): Error correctly on long, non-existent character class names. Break out the handling of charsets that can match non-ASCII into a separate clause. Use compile_char_class when compiling character classes. * regex.c (compile_char_class): New. Used in regex_compile when compiling character sets that may match non-ASCII. * regex.c (re_compile_fastmap): If there are flags set for charset_mule or charset_mule_not, we can't use the fastmap (since we need to check syntax table values that aren't available there). * regex.c (re_match_2_internal): Check the new flags passed to the charset_mule{,_not} opcode, observe them if appropriate. * regex.h: * regex.h (enum): Expose re_wctype_t here, imported from GNU. tests/ChangeLog addition: 2012-04-21 Aidan Kehoe <kehoea@parhasard.net> * automated/regexp-tests.el: * automated/regexp-tests.el (Assert-char-class): Check that #'string-match errors correctly with an over-long character class name. Add tests for character class functionality that supports non-ASCII characters. These tests expose bugs in GNU Emacs 24.0.94.2, but pass under current XEmacs.
author Aidan Kehoe <kehoea@parhasard.net>
date Sat, 21 Apr 2012 18:58:28 +0100
parents 308d34e9f07d
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
1 /* Portable data dumper for XEmacs.
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
2 Copyright (C) 1999-2000 Olivier Galibert
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
3
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
4 This file is part of XEmacs.
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
5
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3263
diff changeset
6 XEmacs is free software: you can redistribute it and/or modify it
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
7 under the terms of the GNU General Public License as published by the
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3263
diff changeset
8 Free Software Foundation, either version 3 of the License, or (at your
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3263
diff changeset
9 option) any later version.
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
10
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
11 XEmacs is distributed in the hope that it will be useful, but WITHOUT
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
12 ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
13 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
14 for more details.
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
15
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
16 You should have received a copy of the GNU General Public License
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3263
diff changeset
17 along with XEmacs. If not, see <http://www.gnu.org/licenses/>. */
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
18
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
19 /* Synched up with: Not in FSF. */
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
20
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
21 /* This file has been Mule-ized, Ben Wing, 10-7-04. */
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
22
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
23 #ifndef INCLUDED_dumper_h
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
24 #define INCLUDED_dumper_h
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
25
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
26 BEGIN_C_DECLS
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
27
3263
d674024a8674 [xemacs-hg @ 2006-02-27 16:29:00 by crestani]
crestani
parents: 2720
diff changeset
28 #ifndef NEW_GC
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
29 void pdump_objects_unmark (void);
3263
d674024a8674 [xemacs-hg @ 2006-02-27 16:29:00 by crestani]
crestani
parents: 2720
diff changeset
30 #endif /* not NEW_GC */
2587
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
31 void pdump (void);
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
32 int pdump_load (const Wexttext *argv0);
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
33 void pdump_backtrace (void);
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
34 extern unsigned int dump_id;
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
35 extern Rawbyte *pdump_start, *pdump_end;
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
36
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
37 END_C_DECLS
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
38
1e2a3710564c [xemacs-hg @ 2005-02-15 03:17:07 by ben]
ben
parents:
diff changeset
39 #endif /* INCLUDED_dumper_h */