comparison src/ChangeLog @ 4407:4ee73bbe4f8e

Always use boyer_moore in ASCII or Latin-1 buffers with ASCII search strings. 2007-12-26 Aidan Kehoe <kehoea@parhasard.net> * casetab.c: Extend and correct some case table documentation. * search.c (search_buffer): Correct a bug where only the first entry for a character in the case equivalence table was examined in determining if the Boyer-Moore search algorithm is appropriate. If there are case mappings outside of the charset and row of the characters specified in the search string, those case mappings can be safely ignored (and Boyer-Moore search can be used) if we know from the buffer statistics that the corresponding characters cannot occur. * search.c (boyer_moore): Assert that we haven't been passed a string with varying characters sets or rows within character sets. That's what simple_search is for. In the very rare event that a character in the search string has a canonical case mapping that is not in the same character set and row, don't try to search for the canonical character, search for some other character that is in the the desired character set and row. Assert that the case table isn't corrupt. Do not search for any character case mappings that cannot possibly occur in the buffer, given the buffer metadata about its contents.
author Aidan Kehoe <kehoea@parhasard.net>
date Wed, 26 Dec 2007 17:30:16 +0100
parents a2af1ff1761f
children 8bbabcab2c42
comparison
equal deleted inserted replaced
4356:cc293ef846d2 4407:4ee73bbe4f8e
1 2007-12-26 Aidan Kehoe <kehoea@parhasard.net>
2
3 * casetab.c:
4 Extend and correct some case table documentation.
5 * search.c (search_buffer):
6 Correct a bug where only the first entry for a character in the
7 case equivalence table was examined in determining if the
8 Boyer-Moore search algorithm is appropriate.
9
10 If there are case mappings outside of the charset and row of the
11 characters specified in the search string, those case mappings can
12 be safely ignored (and Boyer-Moore search can be used) if we know
13 from the buffer statistics that the corresponding characters cannot
14 occur.
15
16 * search.c (boyer_moore):
17 Assert that we haven't been passed a string with varying
18 characters sets or rows within character sets. That's what
19 simple_search is for.
20
21 In the very rare event that a character in the search string has a
22 canonical case mapping that is not in the same character set and
23 row, don't try to search for the canonical character, search for
24 some other character that is in the the desired character set and
25 row. Assert that the case table isn't corrupt.
26
27 Do not search for any character case mappings that cannot possibly
28 occur in the buffer, given the buffer metadata about its
29 contents.
30
1 2007-12-24 Aidan Kehoe <kehoea@parhasard.net> 31 2007-12-24 Aidan Kehoe <kehoea@parhasard.net>
2 32
3 * symbols.c (Fintern_soft): 33 * symbols.c (Fintern_soft):
4 Provide a new optional third argument, DEFAULT, for those who want 34 Provide a new optional third argument, DEFAULT, for those who want
5 to check if "nil" is a symbol or not. (More realistically, general 35 to check if "nil" is a symbol or not. (More realistically, general