annotate etc/ETAGS.README @ 5648:3f4a234f4672

Support non-ASCII correctly in character classes, test this. src/ChangeLog addition: 2012-04-21 Aidan Kehoe <kehoea@parhasard.net> Support non-ASCII correctly in character classes ([:alnum:] and friends). * regex.c: * regex.c (ISBLANK, ISUNIBYTE): New. Make these and friends independent of the locale, since we want them to be consistent in XEmacs. * regex.c (print_partial_compiled_pattern): Print the flags for charset_mule; don't print non-ASCII as the character values in ranges, this breaks with locales. * regex.c (enum): Define various flags the charset_mule and charset_mule_not opcodes can now take. * regex.c (CHAR_CLASS_MAX_LENGTH): Update this. * regex.c (re_iswctype, re_wctype): New, from GNU. * regex.c (re_wctype_can_match_non_ascii): New; used when deciding on whether to use charset_mule or the ASCII-only regex character set opcode. * regex.c (regex_compile): Error correctly on long, non-existent character class names. Break out the handling of charsets that can match non-ASCII into a separate clause. Use compile_char_class when compiling character classes. * regex.c (compile_char_class): New. Used in regex_compile when compiling character sets that may match non-ASCII. * regex.c (re_compile_fastmap): If there are flags set for charset_mule or charset_mule_not, we can't use the fastmap (since we need to check syntax table values that aren't available there). * regex.c (re_match_2_internal): Check the new flags passed to the charset_mule{,_not} opcode, observe them if appropriate. * regex.h: * regex.h (enum): Expose re_wctype_t here, imported from GNU. tests/ChangeLog addition: 2012-04-21 Aidan Kehoe <kehoea@parhasard.net> * automated/regexp-tests.el: * automated/regexp-tests.el (Assert-char-class): Check that #'string-match errors correctly with an over-long character class name. Add tests for character class functionality that supports non-ASCII characters. These tests expose bugs in GNU Emacs 24.0.94.2, but pass under current XEmacs.
author Aidan Kehoe <kehoea@parhasard.net>
date Sat, 21 Apr 2012 18:58:28 +0100
parents 308d34e9f07d
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3972
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
1 etags is distributed under the following conditions:
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
2
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
3
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
4 Copyright (C) 1984 The Regents of the University of California
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
5
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
6 Redistribution and use in source and binary forms, with or without
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
7 modification, are permitted provided that the following conditions are
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
8 met:
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
9 1. Redistributions of source code must retain the above copyright
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
10 notice, this list of conditions and the following disclaimer.
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
11 2. Redistributions in binary form must reproduce the above copyright
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
12 notice, this list of conditions and the following disclaimer in the
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
13 documentation and/or other materials provided with the
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
14 distribution.
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
15 3. Neither the name of the University nor the names of its
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
16 contributors may be used to endorse or promote products derived
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
17 from this software without specific prior written permission.
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
18
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
19 THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND ANY
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
20 EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
21 WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
22 DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE FOR ANY
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
23 DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
24 (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES;
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
25 LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
26 ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
27 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF THIS
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
28 SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
29
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
30
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
31 Copyright (C) 1984, 1987, 1988, 1989, 1993, 1994, 1995, 1998, 1999,
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
32 2000, 2001, 2002, 2003, 2004, 2005, 2006, 2007
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
33 Free Software Foundation, Inc.
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
34
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
35 This file is part of XEmacs.
3972
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
36
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
37 XEmacs is free software: you can redistribute it and/or modify it
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
38 under the terms of the GNU General Public License as published by the
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
39 Free Software Foundation, either version 3 of the License, or (at your
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
40 option) any later version.
3972
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
41
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
42 XEmacs is distributed in the hope that it will be useful, but WITHOUT
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
43 ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
44 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
45 for more details.
3972
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
46
6cd3955b7e4d [xemacs-hg @ 2007-05-21 03:50:13 by stephent]
stephent
parents:
diff changeset
47 You should have received a copy of the GNU General Public License
5402
308d34e9f07d Changed bulk of GPLv2 or later files identified by script
Mats Lidell <matsl@xemacs.org>
parents: 3972
diff changeset
48 along with XEmacs. If not, see <http://www.gnu.org/licenses/>.