annotate src/regex.h @ 665:fdefd0186b75

[xemacs-hg @ 2001-09-20 06:28:42 by ben] The great integral types renaming. The purpose of this is to rationalize the names used for various integral types, so that they match their intended uses and follow consist conventions, and eliminate types that were not semantically different from each other. The conventions are: -- All integral types that measure quantities of anything are signed. Some people disagree vociferously with this, but their arguments are mostly theoretical, and are vastly outweighed by the practical headaches of mixing signed and unsigned values, and more importantly by the far increased likelihood of inadvertent bugs: Because of the broken "viral" nature of unsigned quantities in C (operations involving mixed signed/unsigned are done unsigned, when exactly the opposite is nearly always wanted), even a single error in declaring a quantity unsigned that should be signed, or even the even more subtle error of comparing signed and unsigned values and forgetting the necessary cast, can be catastrophic, as comparisons will yield wrong results. -Wsign-compare is turned on specifically to catch this, but this tends to result in a great number of warnings when mixing signed and unsigned, and the casts are annoying. More has been written on this elsewhere. -- All such quantity types just mentioned boil down to EMACS_INT, which is 32 bits on 32-bit machines and 64 bits on 64-bit machines. This is guaranteed to be the same size as Lisp objects of type `int', and (as far as I can tell) of size_t (unsigned!) and ssize_t. The only type below that is not an EMACS_INT is Hashcode, which is an unsigned value of the same size as EMACS_INT. -- Type names should be relatively short (no more than 10 characters or so), with the first letter capitalized and no underscores if they can at all be avoided. -- "count" == a zero-based measurement of some quantity. Includes sizes, offsets, and indexes. -- "bpos" == a one-based measurement of a position in a buffer. "Charbpos" and "Bytebpos" count text in the buffer, rather than bytes in memory; thus Bytebpos does not directly correspond to the memory representation. Use "Membpos" for this. -- "Char" refers to internal-format characters, not to the C type "char", which is really a byte. -- For the actual name changes, see the script below. I ran the following script to do the conversion. (NOTE: This script is idempotent. You can safely run it multiple times and it will not screw up previous results -- in fact, it will do nothing if nothing has changed. Thus, it can be run repeatedly as necessary to handle patches coming in from old workspaces, or old branches.) There are two tags, just before and just after the change: `pre-integral-type-rename' and `post-integral-type-rename'. When merging code from the main trunk into a branch, the best thing to do is first merge up to `pre-integral-type-rename', then apply the script and associated changes, then merge from `post-integral-type-change' to the present. (Alternatively, just do the merging in one operation; but you may then have a lot of conflicts needing to be resolved by hand.) Script `fixtypes.sh' follows: ----------------------------------- cut ------------------------------------ files="*.[ch] s/*.h m/*.h config.h.in ../configure.in Makefile.in.in ../lib-src/*.[ch] ../lwlib/*.[ch]" gr Memory_Count Bytecount $files gr Lstream_Data_Count Bytecount $files gr Element_Count Elemcount $files gr Hash_Code Hashcode $files gr extcount bytecount $files gr bufpos charbpos $files gr bytind bytebpos $files gr memind membpos $files gr bufbyte intbyte $files gr Extcount Bytecount $files gr Bufpos Charbpos $files gr Bytind Bytebpos $files gr Memind Membpos $files gr Bufbyte Intbyte $files gr EXTCOUNT BYTECOUNT $files gr BUFPOS CHARBPOS $files gr BYTIND BYTEBPOS $files gr MEMIND MEMBPOS $files gr BUFBYTE INTBYTE $files gr MEMORY_COUNT BYTECOUNT $files gr LSTREAM_DATA_COUNT BYTECOUNT $files gr ELEMENT_COUNT ELEMCOUNT $files gr HASH_CODE HASHCODE $files ----------------------------------- cut ------------------------------------ `fixtypes.sh' is a Bourne-shell script; it uses 'gr': ----------------------------------- cut ------------------------------------ #!/bin/sh # Usage is like this: # gr FROM TO FILES ... # globally replace FROM with TO in FILES. FROM and TO are regular expressions. # backup files are stored in the `backup' directory. from="$1" to="$2" shift 2 echo ${1+"$@"} | xargs global-replace "s/$from/$to/g" ----------------------------------- cut ------------------------------------ `gr' in turn uses a Perl script to do its real work, `global-replace', which follows: ----------------------------------- cut ------------------------------------ : #-*- Perl -*- ### global-modify --- modify the contents of a file by a Perl expression ## Copyright (C) 1999 Martin Buchholz. ## Copyright (C) 2001 Ben Wing. ## Authors: Martin Buchholz <martin@xemacs.org>, Ben Wing <ben@xemacs.org> ## Maintainer: Ben Wing <ben@xemacs.org> ## Current Version: 1.0, May 5, 2001 # This program is free software; you can redistribute it and/or modify # it under the terms of the GNU General Public License as published by # the Free Software Foundation; either version 2, or (at your option) # any later version. # # This program is distributed in the hope that it will be useful, but # WITHOUT ANY WARRANTY; without even the implied warranty of # MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU # General Public License for more details. # # You should have received a copy of the GNU General Public License # along with XEmacs; see the file COPYING. If not, write to the Free # Software Foundation, Inc., 59 Temple Place - Suite 330, Boston, MA # 02111-1307, USA. eval 'exec perl -w -S $0 ${1+"$@"}' if 0; use strict; use FileHandle; use Carp; use Getopt::Long; use File::Basename; (my $myName = $0) =~ s@.*/@@; my $usage=" Usage: $myName [--help] [--backup-dir=DIR] [--line-mode] [--hunk-mode] PERLEXPR FILE ... Globally modify a file, either line by line or in one big hunk. Typical usage is like this: [with GNU print, GNU xargs: guaranteed to handle spaces, quotes, etc. in file names] find . -name '*.[ch]' -print0 | xargs -0 $0 's/\bCONST\b/const/g'\n [with non-GNU print, xargs] find . -name '*.[ch]' -print | xargs $0 's/\bCONST\b/const/g'\n The file is read in, either line by line (with --line-mode specified) or in one big hunk (with --hunk-mode specified; it's the default), and the Perl expression is then evalled with \$_ set to the line or hunk of text, including the terminating newline if there is one. It should destructively modify the value there, storing the changed result in \$_. Files in which any modifications are made are backed up to the directory specified using --backup-dir, or to `backup' by default. To disable this, use --backup-dir= with no argument. Hunk mode is the default because it is MUCH MUCH faster than line-by-line. Use line-by-line only when it matters, e.g. you want to do a replacement only once per line (the default without the `g' argument). Conversely, when using hunk mode, *ALWAYS* use `g'; otherwise, you will only make one replacement in the entire file! "; my %options = (); $Getopt::Long::ignorecase = 0; &GetOptions ( \%options, 'help', 'backup-dir=s', 'line-mode', 'hunk-mode', ); die $usage if $options{"help"} or @ARGV <= 1; my $code = shift; die $usage if grep (-d || ! -w, @ARGV); sub SafeOpen { open ((my $fh = new FileHandle), $_[0]); confess "Can't open $_[0]: $!" if ! defined $fh; return $fh; } sub SafeClose { close $_[0] or confess "Can't close $_[0]: $!"; } sub FileContents { my $fh = SafeOpen ("< $_[0]"); my $olddollarslash = $/; local $/ = undef; my $contents = <$fh>; $/ = $olddollarslash; return $contents; } sub WriteStringToFile { my $fh = SafeOpen ("> $_[0]"); binmode $fh; print $fh $_[1] or confess "$_[0]: $!\n"; SafeClose $fh; } foreach my $file (@ARGV) { my $changed_p = 0; my $new_contents = ""; if ($options{"line-mode"}) { my $fh = SafeOpen $file; while (<$fh>) { my $save_line = $_; eval $code; $changed_p = 1 if $save_line ne $_; $new_contents .= $_; } } else { my $orig_contents = $_ = FileContents $file; eval $code; if ($_ ne $orig_contents) { $changed_p = 1; $new_contents = $_; } } if ($changed_p) { my $backdir = $options{"backup-dir"}; $backdir = "backup" if !defined ($backdir); if ($backdir) { my ($name, $path, $suffix) = fileparse ($file, ""); my $backfulldir = $path . $backdir; my $backfile = "$backfulldir/$name"; mkdir $backfulldir, 0755 unless -d $backfulldir; print "modifying $file (original saved in $backfile)\n"; rename $file, $backfile; } WriteStringToFile ($file, $new_contents); } } ----------------------------------- cut ------------------------------------ In addition to those programs, I needed to fix up a few other things, particularly relating to the duplicate definitions of types, now that some types merged with others. Specifically: 1. in lisp.h, removed duplicate declarations of Bytecount. The changed code should now look like this: (In each code snippet below, the first and last lines are the same as the original, as are all lines outside of those lines. That allows you to locate the section to be replaced, and replace the stuff in that section, verifying that there isn't anything new added that would need to be kept.) --------------------------------- snip ------------------------------------- /* Counts of bytes or chars */ typedef EMACS_INT Bytecount; typedef EMACS_INT Charcount; /* Counts of elements */ typedef EMACS_INT Elemcount; /* Hash codes */ typedef unsigned long Hashcode; /* ------------------------ dynamic arrays ------------------- */ --------------------------------- snip ------------------------------------- 2. in lstream.h, removed duplicate declaration of Bytecount. Rewrote the comment about this type. The changed code should now look like this: --------------------------------- snip ------------------------------------- #endif /* The have been some arguments over the what the type should be that specifies a count of bytes in a data block to be written out or read in, using Lstream_read(), Lstream_write(), and related functions. Originally it was long, which worked fine; Martin "corrected" these to size_t and ssize_t on the grounds that this is theoretically cleaner and is in keeping with the C standards. Unfortunately, this practice is horribly error-prone due to design flaws in the way that mixed signed/unsigned arithmetic happens. In fact, by doing this change, Martin introduced a subtle but fatal error that caused the operation of sending large mail messages to the SMTP server under Windows to fail. By putting all values back to be signed, avoiding any signed/unsigned mixing, the bug immediately went away. The type then in use was Lstream_Data_Count, so that it be reverted cleanly if a vote came to that. Now it is Bytecount. Some earlier comments about why the type must be signed: This MUST BE SIGNED, since it also is used in functions that return the number of bytes actually read to or written from in an operation, and these functions can return -1 to signal error. Note that the standard Unix read() and write() functions define the count going in as a size_t, which is UNSIGNED, and the count going out as an ssize_t, which is SIGNED. This is a horrible design flaw. Not only is it highly likely to lead to logic errors when a -1 gets interpreted as a large positive number, but operations are bound to fail in all sorts of horrible ways when a number in the upper-half of the size_t range is passed in -- this number is unrepresentable as an ssize_t, so code that checks to see how many bytes are actually written (which is mandatory if you are dealing with certain types of devices) will get completely screwed up. --ben */ typedef enum lstream_buffering --------------------------------- snip ------------------------------------- 3. in dumper.c, there are four places, all inside of switch() statements, where XD_BYTECOUNT appears twice as a case tag. In each case, the two case blocks contain identical code, and you should *REMOVE THE SECOND* and leave the first.
author ben
date Thu, 20 Sep 2001 06:31:11 +0000
parents b39c14581166
children 6728e641994e
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
1 /* Definitions for data structures and routines for the regular
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
2 expression library, version 0.12.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
3
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
4 Copyright (C) 1985, 89, 90, 91, 92, 93, 95 Free Software Foundation, Inc.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
5
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
6 This program is free software; you can redistribute it and/or modify
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
7 it under the terms of the GNU General Public License as published by
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
8 the Free Software Foundation; either version 2, or (at your option)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
9 any later version.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
10
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
11 This program is distributed in the hope that it will be useful,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
12 but WITHOUT ANY WARRANTY; without even the implied warranty of
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
13 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
14 GNU General Public License for more details.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
15
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
16 You should have received a copy of the GNU General Public License
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
17 along with this program; see the file COPYING. If not, write to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
18 the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
19 Boston, MA 02111-1307, USA. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
20
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
21 /* Synched up with: FSF 19.29. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
22
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 428
diff changeset
23 #ifndef INCLUDED_regex_h_
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 428
diff changeset
24 #define INCLUDED_regex_h_
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
25
446
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
26 #ifdef emacs
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
27 #define RE_TRANSLATE_TYPE Lisp_Object
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
28 #else
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
29 #define RE_TRANSLATE_TYPE char *
665
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
30 #define Elemcount ssize_t
fdefd0186b75 [xemacs-hg @ 2001-09-20 06:28:42 by ben]
ben
parents: 647
diff changeset
31 #define Bytecount ssize_t
446
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
32 #endif /* emacs */
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
33
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
34 /* POSIX says that <sys/types.h> must be included (by the caller) before
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
35 <regex.h>. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
36
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
37
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
38 /* The following bits are used to determine the regexp syntax we
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
39 recognize. The not-set meaning typically corresponds to the syntax
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
40 used by Emacs (the exception is RE_INTERVAL, made for historical
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
41 reasons). The bits are given in alphabetical order, and the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
42 definitions shifted by one from the previous bit; thus, when we add or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
43 remove a bit, only one other definition need change. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
44 typedef unsigned reg_syntax_t;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
45
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
46 /* If this bit is not set, then \ inside a bracket expression is literal.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
47 If set, then such a \ quotes the following character. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
48 #define RE_BACKSLASH_ESCAPE_IN_LISTS (1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
49
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
50 /* If this bit is not set, then + and ? are operators, and \+ and \? are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
51 literals.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
52 If set, then \+ and \? are operators and + and ? are literals. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
53 #define RE_BK_PLUS_QM (RE_BACKSLASH_ESCAPE_IN_LISTS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
54
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
55 /* If this bit is set, then character classes are supported. They are:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
56 [:alpha:], [:upper:], [:lower:], [:digit:], [:alnum:], [:xdigit:],
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
57 [:space:], [:print:], [:punct:], [:graph:], and [:cntrl:].
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
58 If not set, then character classes are not supported. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
59 #define RE_CHAR_CLASSES (RE_BK_PLUS_QM << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
60
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
61 /* If this bit is set, then ^ and $ are always anchors (outside bracket
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
62 expressions, of course).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
63 If this bit is not set, then it depends:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
64 ^ is an anchor if it is at the beginning of a regular
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
65 expression or after an open-group or an alternation operator;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
66 $ is an anchor if it is at the end of a regular expression, or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
67 before a close-group or an alternation operator.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
68
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
69 This bit could be (re)combined with RE_CONTEXT_INDEP_OPS, because
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
70 POSIX draft 11.2 says that * etc. in leading positions is undefined.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
71 We already implemented a previous draft which made those constructs
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
72 invalid, though, so we haven't changed the code back. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
73 #define RE_CONTEXT_INDEP_ANCHORS (RE_CHAR_CLASSES << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
74
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
75 /* If this bit is set, then special characters are always special
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
76 regardless of where they are in the pattern.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
77 If this bit is not set, then special characters are special only in
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
78 some contexts; otherwise they are ordinary. Specifically,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
79 * + ? and intervals are only special when not after the beginning,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
80 open-group, or alternation operator. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
81 #define RE_CONTEXT_INDEP_OPS (RE_CONTEXT_INDEP_ANCHORS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
82
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
83 /* If this bit is set, then *, +, ?, and { cannot be first in an re or
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
84 immediately after an alternation or begin-group operator. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
85 #define RE_CONTEXT_INVALID_OPS (RE_CONTEXT_INDEP_OPS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
86
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
87 /* If this bit is set, then . matches newline.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
88 If not set, then it doesn't. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
89 #define RE_DOT_NEWLINE (RE_CONTEXT_INVALID_OPS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
90
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
91 /* If this bit is set, then . doesn't match NUL.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
92 If not set, then it does. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
93 #define RE_DOT_NOT_NULL (RE_DOT_NEWLINE << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
94
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
95 /* If this bit is set, nonmatching lists [^...] do not match newline.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
96 If not set, they do. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
97 #define RE_HAT_LISTS_NOT_NEWLINE (RE_DOT_NOT_NULL << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
98
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
99 /* If this bit is set, either \{...\} or {...} defines an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
100 interval, depending on RE_NO_BK_BRACES.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
101 If not set, \{, \}, {, and } are literals. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
102 #define RE_INTERVALS (RE_HAT_LISTS_NOT_NEWLINE << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
103
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
104 /* If this bit is set, +, ? and | aren't recognized as operators.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
105 If not set, they are. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
106 #define RE_LIMITED_OPS (RE_INTERVALS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
107
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
108 /* If this bit is set, newline is an alternation operator.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
109 If not set, newline is literal. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
110 #define RE_NEWLINE_ALT (RE_LIMITED_OPS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
111
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
112 /* If this bit is set, then `{...}' defines an interval, and \{ and \}
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
113 are literals.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
114 If not set, then `\{...\}' defines an interval. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
115 #define RE_NO_BK_BRACES (RE_NEWLINE_ALT << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
116
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
117 /* If this bit is set, (...) defines a group, and \( and \) are literals.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
118 If not set, \(...\) defines a group, and ( and ) are literals. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
119 #define RE_NO_BK_PARENS (RE_NO_BK_BRACES << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
120
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
121 /* If this bit is set, then \<digit> matches <digit>.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
122 If not set, then \<digit> is a back-reference. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
123 #define RE_NO_BK_REFS (RE_NO_BK_PARENS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
124
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
125 /* If this bit is set, then | is an alternation operator, and \| is literal.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
126 If not set, then \| is an alternation operator, and | is literal. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
127 #define RE_NO_BK_VBAR (RE_NO_BK_REFS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
128
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
129 /* If this bit is set, then an ending range point collating higher
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
130 than the starting range point, as in [z-a], is invalid.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
131 If not set, then when ending range point collates higher than the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
132 starting range point, the range is ignored. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
133 #define RE_NO_EMPTY_RANGES (RE_NO_BK_VBAR << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
134
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
135 /* If this bit is not set, allow minimal matching:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
136 - a*? and a+? and a?? perform shortest-possible matching (compare with a*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
137 and a+ and a?, respectively, which perform longest-possible matching)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
138 - other juxtaposing of * + and ? is rejected.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
139 If this bit is set, consecutive * + and ?'s are collapsed in a logical
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
140 manner:
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
141 - a*? and a+? are the same as a*
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
142 - a?? is the same as a?
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
143 */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
144 #define RE_NO_MINIMAL_MATCHING (RE_NO_EMPTY_RANGES << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
145
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
146 /* If this bit is set, succeed as soon as we match the whole pattern,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
147 without further backtracking. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
148 #define RE_NO_POSIX_BACKTRACKING (RE_NO_MINIMAL_MATCHING << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
149
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
150 /* If this bit is not set, (?:re) behaves like (re) (or \(?:re\) behaves like
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
151 \(re\)) except that the matched string is not registered. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
152 #define RE_NO_SHY_GROUPS (RE_NO_POSIX_BACKTRACKING << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
153
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
154 /* If this bit is set, then an unmatched ) is ordinary.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
155 If not set, then an unmatched ) is invalid. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
156 #define RE_UNMATCHED_RIGHT_PAREN_ORD (RE_NO_SHY_GROUPS << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
157
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
158 /* If this bit is set, then \22 will read as a back reference,
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
159 provided at least 22 non-shy groups have been seen so far. In all
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
160 other cases (bit not set, not 22 non-shy groups seen so far), it
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
161 reads as a back reference \2 followed by a digit 2. */
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
162 #define RE_NO_MULTI_DIGIT_BK_REFS (RE_UNMATCHED_RIGHT_PAREN_ORD << 1)
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
163
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
164 /* This global variable defines the particular regexp syntax to use (for
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
165 some interfaces). When a regexp is compiled, the syntax used is
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
166 stored in the pattern buffer, so changing this does not affect
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
167 already-compiled regexps. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
168 extern reg_syntax_t re_syntax_options;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
169
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
170 /* Define combinations of the above bits for the standard possibilities.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
171 (The [[[ comments delimit what gets put into the Texinfo file, so
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
172 don't delete them!) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
173 /* [[[begin syntaxes]]] */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
174 #define RE_SYNTAX_EMACS RE_INTERVALS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
175
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
176 #define RE_SYNTAX_AWK \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
177 (RE_BACKSLASH_ESCAPE_IN_LISTS | RE_DOT_NOT_NULL \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
178 | RE_NO_BK_PARENS | RE_NO_BK_REFS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
179 | RE_NO_BK_VBAR | RE_NO_EMPTY_RANGES \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
180 | RE_UNMATCHED_RIGHT_PAREN_ORD | RE_NO_SHY_GROUPS \
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
181 | RE_NO_MINIMAL_MATCHING | RE_NO_MULTI_DIGIT_BK_REFS)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
182
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
183 #define RE_SYNTAX_POSIX_AWK \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
184 (RE_SYNTAX_POSIX_EXTENDED | RE_BACKSLASH_ESCAPE_IN_LISTS)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
185
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
186 #define RE_SYNTAX_GREP \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
187 (RE_BK_PLUS_QM | RE_CHAR_CLASSES \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
188 | RE_HAT_LISTS_NOT_NEWLINE | RE_INTERVALS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
189 | RE_NEWLINE_ALT | RE_NO_SHY_GROUPS \
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
190 | RE_NO_MINIMAL_MATCHING | RE_NO_MULTI_DIGIT_BK_REFS)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
191
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
192 #define RE_SYNTAX_EGREP \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
193 (RE_CHAR_CLASSES | RE_CONTEXT_INDEP_ANCHORS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
194 | RE_CONTEXT_INDEP_OPS | RE_HAT_LISTS_NOT_NEWLINE \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
195 | RE_NEWLINE_ALT | RE_NO_BK_PARENS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
196 | RE_NO_BK_VBAR | RE_NO_SHY_GROUPS \
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
197 | RE_NO_MINIMAL_MATCHING | RE_NO_MULTI_DIGIT_BK_REFS)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
198
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
199 #define RE_SYNTAX_POSIX_EGREP \
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
200 (RE_SYNTAX_EGREP | RE_INTERVALS | RE_NO_BK_BRACES | \
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
201 RE_NO_MULTI_DIGIT_BK_REFS)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
202
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
203 /* P1003.2/D11.2, section 4.20.7.1, lines 5078ff. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
204 #define RE_SYNTAX_ED RE_SYNTAX_POSIX_BASIC
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
205
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
206 #define RE_SYNTAX_SED RE_SYNTAX_POSIX_BASIC
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
207
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
208 /* Syntax bits common to both basic and extended POSIX regex syntax. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
209 #define _RE_SYNTAX_POSIX_COMMON \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
210 (RE_CHAR_CLASSES | RE_DOT_NEWLINE | RE_DOT_NOT_NULL \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
211 | RE_INTERVALS | RE_NO_EMPTY_RANGES | RE_NO_SHY_GROUPS \
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
212 | RE_NO_MINIMAL_MATCHING | RE_NO_MULTI_DIGIT_BK_REFS)
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
213
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
214 #define RE_SYNTAX_POSIX_BASIC \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
215 (_RE_SYNTAX_POSIX_COMMON | RE_BK_PLUS_QM)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
216
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
217 /* Differs from ..._POSIX_BASIC only in that RE_BK_PLUS_QM becomes
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
218 RE_LIMITED_OPS, i.e., \? \+ \| are not recognized. Actually, this
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
219 isn't minimal, since other operators, such as \`, aren't disabled. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
220 #define RE_SYNTAX_POSIX_MINIMAL_BASIC \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
221 (_RE_SYNTAX_POSIX_COMMON | RE_LIMITED_OPS)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
222
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
223 #define RE_SYNTAX_POSIX_EXTENDED \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
224 (_RE_SYNTAX_POSIX_COMMON | RE_CONTEXT_INDEP_ANCHORS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
225 | RE_CONTEXT_INDEP_OPS | RE_NO_BK_BRACES \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
226 | RE_NO_BK_PARENS | RE_NO_BK_VBAR \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
227 | RE_UNMATCHED_RIGHT_PAREN_ORD)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
228
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
229 /* Differs from ..._POSIX_EXTENDED in that RE_CONTEXT_INVALID_OPS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
230 replaces RE_CONTEXT_INDEP_OPS and RE_NO_BK_REFS is added. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
231 #define RE_SYNTAX_POSIX_MINIMAL_EXTENDED \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
232 (_RE_SYNTAX_POSIX_COMMON | RE_CONTEXT_INDEP_ANCHORS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
233 | RE_CONTEXT_INVALID_OPS | RE_NO_BK_BRACES \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
234 | RE_NO_BK_PARENS | RE_NO_BK_REFS \
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
235 | RE_NO_BK_VBAR | RE_UNMATCHED_RIGHT_PAREN_ORD)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
236 /* [[[end syntaxes]]] */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
237
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
238 /* Maximum number of duplicates an interval can allow. Some systems
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
239 (erroneously) define this in other header files, but we want our
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
240 value, so remove any previous define. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
241 #ifdef RE_DUP_MAX
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
242 #undef RE_DUP_MAX
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
243 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
244 #define RE_DUP_MAX ((1 << 15) - 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
245
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
246
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
247 /* POSIX `cflags' bits (i.e., information for `regcomp'). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
248
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
249 /* If this bit is set, then use extended regular expression syntax.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
250 If not set, then use basic regular expression syntax. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
251 #define REG_EXTENDED 1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
252
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
253 /* If this bit is set, then ignore case when matching.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
254 If not set, then case is significant. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
255 #define REG_ICASE (REG_EXTENDED << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
256
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
257 /* If this bit is set, then anchors do not match at newline
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
258 characters in the string.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
259 If not set, then anchors do match at newlines. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
260 #define REG_NEWLINE (REG_ICASE << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
261
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
262 /* If this bit is set, then report only success or fail in regexec.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
263 If not set, then returns differ between not matching and errors. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
264 #define REG_NOSUB (REG_NEWLINE << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
265
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
266
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
267 /* POSIX `eflags' bits (i.e., information for regexec). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
268
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
269 /* If this bit is set, then the beginning-of-line operator doesn't match
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
270 the beginning of the string (presumably because it's not the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
271 beginning of a line).
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
272 If not set, then the beginning-of-line operator does match the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
273 beginning of the string. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
274 #define REG_NOTBOL 1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
275
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
276 /* Like REG_NOTBOL, except for the end-of-line. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
277 #define REG_NOTEOL (1 << 1)
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
278
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
279
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
280 /* If any error codes are removed, changed, or added, update the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
281 `re_error_msg' table in regex.c. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
282 typedef enum
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
283 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
284 REG_NOERROR = 0, /* Success. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
285 REG_NOMATCH, /* Didn't find a match (for regexec). */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
286
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
287 /* POSIX regcomp return error codes. (In the order listed in the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
288 standard.) */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
289 REG_BADPAT, /* Invalid pattern. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
290 REG_ECOLLATE, /* Not implemented. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
291 REG_ECTYPE, /* Invalid character class name. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
292 REG_EESCAPE, /* Trailing backslash. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
293 REG_ESUBREG, /* Invalid back reference. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
294 REG_EBRACK, /* Unmatched left bracket. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
295 REG_EPAREN, /* Parenthesis imbalance. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
296 REG_EBRACE, /* Unmatched \{. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
297 REG_BADBR, /* Invalid contents of \{\}. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
298 REG_ERANGE, /* Invalid range end. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
299 REG_ESPACE, /* Ran out of memory. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
300 REG_BADRPT, /* No preceding re for repetition op. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
301
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
302 /* Error codes we've added. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
303 REG_EEND, /* Premature end. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
304 REG_ESIZE, /* Compiled pattern bigger than 2^16 bytes. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
305 REG_ERPAREN /* Unmatched ) or \); not returned from regcomp. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
306 #ifdef emacs
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
307 ,REG_ESYNTAX /* Invalid syntax designator. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
308 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
309 #ifdef MULE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
310 ,REG_ERANGESPAN /* Ranges may not span charsets. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
311 ,REG_ECATEGORY /* Invalid category designator */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
312 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
313 } reg_errcode_t;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
314
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
315 /* This data structure represents a compiled pattern. Before calling
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
316 the pattern compiler, the fields `buffer', `allocated', `fastmap',
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
317 `translate', and `no_sub' can be set. After the pattern has been
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
318 compiled, the `re_nsub' field is available. All other fields are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
319 private to the regex routines. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
320
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
321 struct re_pattern_buffer
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
322 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
323 /* [[[begin pattern_buffer]]] */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
324 /* Space that holds the compiled pattern. It is declared as
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
325 `unsigned char *' because its elements are
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
326 sometimes used as array indexes. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
327 unsigned char *buffer;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
328
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
329 /* Number of bytes to which `buffer' points. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
330 long allocated;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
331
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
332 /* Number of bytes actually used in `buffer'. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
333 long used;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
334
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
335 /* Syntax setting with which the pattern was compiled. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
336 reg_syntax_t syntax;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
337
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
338 /* Pointer to a fastmap, if any, otherwise zero. re_search uses
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
339 the fastmap, if there is one, to skip over impossible
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
340 starting points for matches. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
341 char *fastmap;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
342
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
343 /* Either a translate table to apply to all characters before
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
344 comparing them, or zero for no translation. The translation
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
345 is applied to a pattern when it is compiled and to a string
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
346 when it is matched. */
446
1ccc32a20af4 Import from CVS: tag r21-2-38
cvs
parents: 442
diff changeset
347 RE_TRANSLATE_TYPE translate;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
348
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
349 /* Number of returnable groups found by the compiler. (This does
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
350 not count shy groups.) */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
351 int re_nsub;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
352
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
353 /* Total number of groups found by the compiler. (Including
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
354 shy ones.) */
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
355 int re_ngroups;
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
356
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
357 /* Zero if this pattern cannot match the empty string, one else.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
358 Well, in truth it's used only in `re_search_2', to see
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
359 whether or not we should use the fastmap, so we don't set
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
360 this absolutely perfectly; see `re_compile_fastmap' (the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
361 `duplicate' case). */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
362 unsigned int can_be_null : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
363
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
364 /* If REGS_UNALLOCATED, allocate space in the `regs' structure
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
365 for `max (RE_NREGS, re_nsub + 1)' groups.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
366 If REGS_REALLOCATE, reallocate space if necessary.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
367 If REGS_FIXED, use what's there. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
368 #define REGS_UNALLOCATED 0
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
369 #define REGS_REALLOCATE 1
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
370 #define REGS_FIXED 2
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
371 unsigned int regs_allocated : 2;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
372
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
373 /* Set to zero when `regex_compile' compiles a pattern; set to one
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
374 by `re_compile_fastmap' if it updates the fastmap. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
375 unsigned int fastmap_accurate : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
376
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
377 /* If set, `re_match_2' does not return information about
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
378 subexpressions. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
379 unsigned int no_sub : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
380
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
381 /* If set, a beginning-of-line anchor doesn't match at the
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
382 beginning of the string. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
383 unsigned int not_bol : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
384
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
385 /* Similarly for an end-of-line anchor. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
386 unsigned int not_eol : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
387
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
388 /* If true, an anchor at a newline matches. */
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
389 unsigned int newline_anchor : 1;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
390
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
391 unsigned int warned_about_incompatible_back_references : 1;
502
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
392
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
393 /* Mapping between back references and groups (may not be
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
394 equivalent with shy groups). */
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
395 int *external_to_internal_register;
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
396
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
397 int external_to_internal_register_size;
7039e6323819 [xemacs-hg @ 2001-05-04 22:41:46 by ben]
ben
parents: 446
diff changeset
398
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
399 /* [[[end pattern_buffer]]] */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
400 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
401
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
402 typedef struct re_pattern_buffer regex_t;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
403
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
404 /* Type for byte offsets within the string. POSIX mandates this. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
405 typedef int regoff_t;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
406
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
407
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
408 /* This is the structure we store register match data in. See
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
409 regex.texinfo for a full description of what registers match. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
410 struct re_registers
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
411 {
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
412 int num_regs;
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
413 regoff_t *start;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
414 regoff_t *end;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
415 };
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
416
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
417
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
418 /* If `regs_allocated' is REGS_UNALLOCATED in the pattern buffer,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
419 `re_match_2' returns information about at least this many registers
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
420 the first time a `regs' structure is passed. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
421 #ifndef RE_NREGS
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
422 #define RE_NREGS 30
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
423 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
424
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
425
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
426 /* POSIX specification for registers. Aside from the different names than
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
427 `re_registers', POSIX uses an array of structures, instead of a
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
428 structure of arrays. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
429 typedef struct
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
430 {
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
431 regoff_t rm_so; /* Byte offset from string's start to substring's start. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
432 regoff_t rm_eo; /* Byte offset from string's start to substring's end. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
433 } regmatch_t;
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
434
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
435 /* Declarations for routines. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
436
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
437 /* Sets the current default syntax to SYNTAX, and return the old syntax.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
438 You can also simply assign to the `re_syntax_options' variable. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
439 reg_syntax_t re_set_syntax (reg_syntax_t syntax);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
440
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
441 /* Compile the regular expression PATTERN, with length LENGTH
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
442 and syntax given by the global `re_syntax_options', into the buffer
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
443 BUFFER. Return NULL if successful, and an error string if not. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
444 const char *re_compile_pattern (const char *pattern, int length,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
445 struct re_pattern_buffer *buffer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
446
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
447
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
448 /* Compile a fastmap for the compiled pattern in BUFFER; used to
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
449 accelerate searches. Return 0 if successful and -2 if was an
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
450 internal error. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
451 int re_compile_fastmap (struct re_pattern_buffer *buffer);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
452
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
453
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
454 /* Search in the string STRING (with length LENGTH) for the pattern
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
455 compiled into BUFFER. Start searching at position START, for RANGE
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
456 characters. Return the starting position of the match, -1 for no
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
457 match, or -2 for an internal error. Also return register
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
458 information in REGS (if REGS and BUFFER->no_sub are nonzero). */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
459 int re_search (struct re_pattern_buffer *buffer, const char *string,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
460 int length, int start, int range,
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
461 struct re_registers *regs);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
462
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
463
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
464 /* Like `re_search', but search in the concatenation of STRING1 and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
465 STRING2. Also, stop searching at index START + STOP. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
466 int re_search_2 (struct re_pattern_buffer *buffer, const char *string1,
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
467 int length1, const char *string2, int length2, int start,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
468 int range, struct re_registers *regs, int stop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
469
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
470
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
471 /* Like `re_search', but return how many characters in STRING the regexp
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
472 in BUFFER matched, starting at position START. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
473 int re_match (struct re_pattern_buffer *buffer, const char *string,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
474 int length, int start, struct re_registers *regs);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
475
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
476
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
477 /* Relates to `re_match' as `re_search_2' relates to `re_search'. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
478 int re_match_2 (struct re_pattern_buffer *buffer, const char *string1,
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
479 int length1, const char *string2, int length2,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
480 int start, struct re_registers *regs, int stop);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
481
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
482
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
483 /* Set REGS to hold NUM_REGS registers, storing them in STARTS and
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
484 ENDS. Subsequent matches using BUFFER and REGS will use this memory
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
485 for recording register information. STARTS and ENDS must be
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
486 allocated with malloc, and must each be at least `NUM_REGS * sizeof
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
487 (regoff_t)' bytes long.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
488
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
489 If NUM_REGS == 0, then subsequent matches should allocate their own
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
490 register data.
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
491
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
492 Unless this function is called, the first search or match using
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
493 PATTERN_BUFFER will allocate its own register data, without
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
494 freeing the old data. */
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
495 void re_set_registers (struct re_pattern_buffer *buffer,
647
b39c14581166 [xemacs-hg @ 2001-08-13 04:45:47 by ben]
ben
parents: 502
diff changeset
496 struct re_registers *regs, int num_regs,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
497 regoff_t *starts, regoff_t *ends);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
498
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
499 #ifdef _REGEX_RE_COMP
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
500 /* 4.2 bsd compatibility. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
501 char *re_comp (const char *);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
502 int re_exec (const char *);
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
503 #endif
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
504
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
505 /* POSIX compatibility. */
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
506 int regcomp (regex_t *preg, const char *pattern, int cflags);
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
507 int regexec (const regex_t *preg, const char *string, size_t nmatch,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
508 regmatch_t pmatch[], int eflags);
442
abe6d1db359e Import from CVS: tag r21-2-36
cvs
parents: 440
diff changeset
509 size_t regerror (int errcode, const regex_t *preg, char *errbuf,
428
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
510 size_t errbuf_size);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
511 void regfree (regex_t *preg);
3ecd8885ac67 Import from CVS: tag r21-2-22
cvs
parents:
diff changeset
512
440
8de8e3f6228a Import from CVS: tag r21-2-28
cvs
parents: 428
diff changeset
513 #endif /* INCLUDED_regex_h_ */