Mercurial > hg > xemacs-beta
view src/s/aix4-2.h @ 5648:3f4a234f4672
Support non-ASCII correctly in character classes, test this.
src/ChangeLog addition:
2012-04-21 Aidan Kehoe <kehoea@parhasard.net>
Support non-ASCII correctly in character classes ([:alnum:] and
friends).
* regex.c:
* regex.c (ISBLANK, ISUNIBYTE): New. Make these and friends
independent of the locale, since we want them to be consistent in
XEmacs.
* regex.c (print_partial_compiled_pattern): Print the flags for
charset_mule; don't print non-ASCII as the character values in
ranges, this breaks with locales.
* regex.c (enum):
Define various flags the charset_mule and charset_mule_not opcodes
can now take.
* regex.c (CHAR_CLASS_MAX_LENGTH): Update this.
* regex.c (re_iswctype, re_wctype): New, from GNU.
* regex.c (re_wctype_can_match_non_ascii): New; used when deciding
on whether to use charset_mule or the ASCII-only regex character
set opcode.
* regex.c (regex_compile):
Error correctly on long, non-existent character class names.
Break out the handling of charsets that can match non-ASCII into a
separate clause. Use compile_char_class when compiling character
classes.
* regex.c (compile_char_class): New. Used in regex_compile when
compiling character sets that may match non-ASCII.
* regex.c (re_compile_fastmap):
If there are flags set for charset_mule or charset_mule_not, we
can't use the fastmap (since we need to check syntax table values
that aren't available there).
* regex.c (re_match_2_internal):
Check the new flags passed to the charset_mule{,_not} opcode,
observe them if appropriate.
* regex.h:
* regex.h (enum):
Expose re_wctype_t here, imported from GNU.
tests/ChangeLog addition:
2012-04-21 Aidan Kehoe <kehoea@parhasard.net>
* automated/regexp-tests.el:
* automated/regexp-tests.el (Assert-char-class):
Check that #'string-match errors correctly with an over-long
character class name.
Add tests for character class functionality that supports
non-ASCII characters. These tests expose bugs in GNU Emacs
24.0.94.2, but pass under current XEmacs.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Sat, 21 Apr 2012 18:58:28 +0100 |
parents | 1f0b15040456 |
children |
line wrap: on
line source
/* Definitions file for XEmacs running on IBM AIX version 4.2 * Copyright (C) 1985, 1986, 1990 Free Software Foundation, Inc. * Copyright (C) 2010 Ben Wing. * * This file is part of XEmacs. * * XEmacs is free software: you can redistribute it and/or modify it * under the terms of the GNU General Public License as published by the * Free Software Foundation, either version 3 of the License, or (at your * option) any later version. * * XEmacs is distributed in the hope that it will be useful, but WITHOUT * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License * for more details. * * You should have received a copy of the GNU General Public License * along with XEmacs. If not, see <http://www.gnu.org/licenses/>. */ /* Synched up with: FSF 19.31. */ /* * Define symbols to identify the version of Unix this is. * Define all the symbols that apply correctly. */ #define USG /* System III, System V, etc */ #define USG5 /* Specify IBM AIX version of system */ #ifndef AIX #define AIX #define AIX4 #define AIX4_1 #endif /* This symbol should be defined on AIX Version 3 ??????? */ #ifndef _AIX #define _AIX #endif /* SYSTEM_TYPE should indicate the kind of system you are using. * It sets the Lisp variable system-type. */ #define SYSTEM_TYPE "aix" /* Special itemss needed to make Emacs run on this system. */ #ifndef __GNUC__ #define LINKER "cc" #endif /* Deleted LIBS_DEBUG */ /* No need to specify -lc when linking. */ #define LIB_STANDARD /* AIX doesn't define this. */ #define unix 1 #ifndef __GNUC__ /* Otherwise, XEmacs is just too big ... */ #define C_SWITCH_SYSTEM "-mminimal-toc" #endif #define HAVE_ALLOCA /* FIXME: This was needed for AIX 4.1. Is it for AIX 4.2? */ /* The character-composition stuff is broken in X11R5. * Even with XIMStatusNothing aliased to XIMStatusNone, * tranle@intellicorp.com (Minh Tran-Le) reports that enabling * the internationalization code causes the modifier keys C, M and Shift * to beep after a mouse click. */ #define X11R5_INHIBIT_I18N #define MAIL_USE_LOCKF /* XEmacs: from dkeller@VNET.IBM.COM */ #define BROKEN_SIGIO #ifndef NOT_C_CODE #define _XFUNCS_H_ 1 /* #### we don't use either any more. some AIX user should delete the include * and see if anything breaks. --ben */ /* AIX is happier when bzero and strcasecmp are declared */ #include "strings.h" /* Forward declarations for xlc warning suppressions */ struct ether_addr; struct sockaddr_dl; #ifdef __xlC__ /* "eXceLlent C compiler" ?! */ #if __xlC__ >= 1200 /* IBM's C compiler option `-O3' is too aggressive. * We recommend instead the combination `-O3 -qstrict', which seems safe. * * cc -O3 miscompiles at least two functions. From IBM's docs: * * IBM> -qstrict turns off the following optimizations: * * IBM> Performing code motion and scheduling on computations such as loads * IBM> and floating-point computations that may trigger an exception. * * Nevertheless, we try to work with these compiler options. */ #pragma option_override (bytecount_to_charcount, "opt(strict,yes)") #pragma option_override (Fexpand_file_name, "opt(strict,yes)") #endif /* recent IBM C compiler */ #endif /* IBM's C compiler */ #endif /* C code */ /* getaddrinfo is broken in AIX 4.3 as per IY04165. * At this time (2/21/2000), there's no PTF available. * -- Mike Sperber <mike@xemacs.org> */ #undef HAVE_GETADDRINFO #ifdef __GNUC__ #undef _NO_PROTO #endif /* XEmacs: C_DEBUG_SWITCH, C_OPTIMIZE_SWITCH no longer used */