Mercurial > hg > xemacs-beta
annotate src/strcat.c @ 5648:3f4a234f4672
Support non-ASCII correctly in character classes, test this.
src/ChangeLog addition:
2012-04-21 Aidan Kehoe <kehoea@parhasard.net>
Support non-ASCII correctly in character classes ([:alnum:] and
friends).
* regex.c:
* regex.c (ISBLANK, ISUNIBYTE): New. Make these and friends
independent of the locale, since we want them to be consistent in
XEmacs.
* regex.c (print_partial_compiled_pattern): Print the flags for
charset_mule; don't print non-ASCII as the character values in
ranges, this breaks with locales.
* regex.c (enum):
Define various flags the charset_mule and charset_mule_not opcodes
can now take.
* regex.c (CHAR_CLASS_MAX_LENGTH): Update this.
* regex.c (re_iswctype, re_wctype): New, from GNU.
* regex.c (re_wctype_can_match_non_ascii): New; used when deciding
on whether to use charset_mule or the ASCII-only regex character
set opcode.
* regex.c (regex_compile):
Error correctly on long, non-existent character class names.
Break out the handling of charsets that can match non-ASCII into a
separate clause. Use compile_char_class when compiling character
classes.
* regex.c (compile_char_class): New. Used in regex_compile when
compiling character sets that may match non-ASCII.
* regex.c (re_compile_fastmap):
If there are flags set for charset_mule or charset_mule_not, we
can't use the fastmap (since we need to check syntax table values
that aren't available there).
* regex.c (re_match_2_internal):
Check the new flags passed to the charset_mule{,_not} opcode,
observe them if appropriate.
* regex.h:
* regex.h (enum):
Expose re_wctype_t here, imported from GNU.
tests/ChangeLog addition:
2012-04-21 Aidan Kehoe <kehoea@parhasard.net>
* automated/regexp-tests.el:
* automated/regexp-tests.el (Assert-char-class):
Check that #'string-match errors correctly with an over-long
character class name.
Add tests for character class functionality that supports
non-ASCII characters. These tests expose bugs in GNU Emacs
24.0.94.2, but pass under current XEmacs.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Sat, 21 Apr 2012 18:58:28 +0100 |
parents | 2aa9cd456ae7 |
children |
rev | line source |
---|---|
428 | 1 /* Copyright (C) 1991 Free Software Foundation, Inc. |
2 This file is part of the GNU C Library. | |
3 | |
5405 | 4 The GNU C Library is free software: you can redistribute it and/or modify it |
5 under the terms of the GNU General Public License as published by the | |
6 Free Software Foundation, either version 3 of the License, or (at your | |
7 option) any later version. | |
428 | 8 |
5405 | 9 The GNU C Library is distributed in the hope that it will be useful, but WITHOUT |
10 ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or | |
11 FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License | |
12 for more details. | |
428 | 13 |
5405 | 14 You should have received a copy of the GNU General Public License |
15 along with the GNU C Library. If not, see <http://www.gnu.org/licenses/>. */ | |
428 | 16 |
17 /* Synched up with: Not in FSF. */ | |
18 | |
19 # include <config.h> | |
20 # ifndef REGISTER /* Strictly enforced in 20.3 */ | |
21 # define REGISTER | |
22 # endif | |
23 | |
24 /* In HPUX 10 the strcat function references memory past the last byte of | |
25 the string! This will core dump if the memory following the last byte is | |
26 not mapped. | |
27 | |
442 | 28 Here is a correct version from, glibc 1.09. |
428 | 29 */ |
30 | |
31 char *strcat (char *dest, const char *src); | |
32 | |
33 /* Append SRC on the end of DEST. */ | |
34 char * | |
35 strcat (char *dest, const char *src) | |
36 { | |
37 REGISTER char *s1 = dest; | |
442 | 38 REGISTER const char *s2 = src; |
428 | 39 char c; |
40 | |
41 /* Find the end of the string. */ | |
42 do | |
43 c = *s1++; | |
44 while (c != '\0'); | |
45 | |
46 /* Make S1 point before the next character, so we can increment | |
47 it while memory is read (wins on pipelined cpus). */ | |
48 s1 -= 2; | |
49 | |
50 do | |
51 { | |
52 c = *s2++; | |
53 *++s1 = c; | |
54 } | |
55 while (c != '\0'); | |
56 | |
57 return dest; | |
58 } |