annotate lib-src/ad2c @ 4407:4ee73bbe4f8e

Always use boyer_moore in ASCII or Latin-1 buffers with ASCII search strings. 2007-12-26 Aidan Kehoe <kehoea@parhasard.net> * casetab.c: Extend and correct some case table documentation. * search.c (search_buffer): Correct a bug where only the first entry for a character in the case equivalence table was examined in determining if the Boyer-Moore search algorithm is appropriate. If there are case mappings outside of the charset and row of the characters specified in the search string, those case mappings can be safely ignored (and Boyer-Moore search can be used) if we know from the buffer statistics that the corresponding characters cannot occur. * search.c (boyer_moore): Assert that we haven't been passed a string with varying characters sets or rows within character sets. That's what simple_search is for. In the very rare event that a character in the search string has a canonical case mapping that is not in the same character set and row, don't try to search for the canonical character, search for some other character that is in the the desired character set and row. Assert that the case table isn't corrupt. Do not search for any character case mappings that cannot possibly occur in the buffer, given the buffer metadata about its contents.
author Aidan Kehoe <kehoea@parhasard.net>
date Wed, 26 Dec 2007 17:30:16 +0100
parents 376386a54a3c
children ac2d302a0011 26a007fa2f4c
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
0
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
1 #!/bin/sh
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
2 #
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
3 # ad2c : Convert app-defaults file to C strings decls.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
4 #
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
5 # George Ferguson, ferguson@cs.rcohester.edu, 12 Nov 1990.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
6 # 19 Mar 1991 : gf
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
7 # Made it self-contained.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
8 # 6 Jan 1992 : mycroft@gnu.ai.mit.edu (Charles Hannum)
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
9 # Removed use of "-n" and ":read" label since Gnu and
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
10 # IBM sed print pattern space on "n" command. Still works
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
11 # with Sun sed, of course.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
12 # 7 Jan 1992: matthew@sunpix.East.Sun.COM (Matthew Stier)
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
13 # Escape quotes after escaping backslashes.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
14 #
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
15 # Synched up with: Not in FSF.
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
16
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
17 sed '
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
18 /^!/d
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
19 /^$/d
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
20 s/\\/\\\\/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
21 s/\\$//g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
22 s/"/\\"/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
23 s/^/"/
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
24 : test
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
25 /\\$/b slash
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
26 s/$/",/
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
27 p
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
28 d
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
29 : slash
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
30 n
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
31 /^!/d
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
32 /^$/d
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
33 s/"/\\"/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
34 s/\\\\/\\/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
35 s/\\n/\\\\n/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
36 s/\\t/\\\\t/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
37 s/\\f/\\\\f/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
38 s/\\b/\\\\b/g
376386a54a3c Import from CVS: tag r19-14
cvs
parents:
diff changeset
39 b test' "$@"