Mercurial > hg > xemacs-beta
annotate src/README.kkcc @ 5776:65d65b52d608
Pass character count from coding systems to buffer insertion code.
src/ChangeLog addition:
2014-01-16 Aidan Kehoe <kehoea@parhasard.net>
Pass character count information from the no-conversion and
unicode coding systems to the buffer insertion code, making
#'find-file on large buffers a little snappier (if
ERROR_CHECK_TEXT is not defined).
* file-coding.c:
* file-coding.c (coding_character_tell): New.
* file-coding.c (conversion_coding_stream_description): New.
* file-coding.c (no_conversion_convert):
Update characters_seen when decoding.
* file-coding.c (no_conversion_character_tell): New.
* file-coding.c (lstream_type_create_file_coding): Create the
no_conversion type with data.
* file-coding.c (coding_system_type_create):
Make the character_tell method available here.
* file-coding.h:
* file-coding.h (struct coding_system_methods):
Add a new character_tell() method, passing charcount information
from the coding systems to the buffer code, avoiding duplicate
bytecount-to-charcount work especially with large buffers.
* fileio.c (Finsert_file_contents_internal):
Update this to pass charcount information to
buffer_insert_string_1(), if that is available from the lstream code.
* insdel.c:
* insdel.c (buffer_insert_string_1):
Add a new CCLEN argument, giving the character count of the string
to insert. It can be -1 to indicate that te function should work
it out itself using bytecount_to_charcount(), as it used to.
* insdel.c (buffer_insert_raw_string_1):
* insdel.c (buffer_insert_lisp_string_1):
* insdel.c (buffer_insert_ascstring_1):
* insdel.c (buffer_insert_emacs_char_1):
* insdel.c (buffer_insert_from_buffer_1):
* insdel.c (buffer_replace_char):
Update these functions to use the new calling convention.
* insdel.h:
* insdel.h (buffer_insert_string):
Update this header to reflect the new buffer_insert_string_1()
argument.
* lstream.c (Lstream_character_tell): New.
Return the number of characters *read* and seen by the consumer so
far, taking into account the unget buffer, and buffered reading.
* lstream.c (Lstream_unread):
Update unget_character_count here as appropriate.
* lstream.c (Lstream_rewind):
Reset unget_character_count here too.
* lstream.h:
* lstream.h (struct lstream):
Provide the character_tell method, add a new field,
unget_character_count, giving the number of characters ever passed
to Lstream_unread().
Declare Lstream_character_tell().
Make Lstream_ungetc(), which happens to be unused, an inline
function rather than a macro, in the course of updating it to
modify unget_character_count.
* print.c (output_string):
Use the new argument to buffer_insert_string_1().
* tests.c:
* tests.c (Ftest_character_tell):
New test function.
* tests.c (syms_of_tests):
Make it available.
* unicode.c:
* unicode.c (struct unicode_coding_stream):
* unicode.c (unicode_character_tell):
New method.
* unicode.c (unicode_convert):
Update the character counter as appropriate.
* unicode.c (coding_system_type_create_unicode):
Make the character_tell method available.
author | Aidan Kehoe <kehoea@parhasard.net> |
---|---|
date | Thu, 16 Jan 2014 16:27:52 +0000 |
parents | 3889ef128488 |
children |
rev | line source |
---|---|
992 | 1 2002-07-17 Marcus Crestani <crestani@informatik.uni-tuebingen.de> |
2 Markus Kaltenbach <makalten@informatik.uni-tuebingen.de> | |
3 Mike Sperber <mike@xemacs.org> | |
4 | |
1598 | 5 updated 2003-07-29 |
6 | |
992 | 7 New KKCC-GC mark algorithm: |
8 configure flag : --use-kkcc | |
9 | |
10 For better understanding, first a few words about the mark algorithm | |
11 up to now: | |
12 Every Lisp_Object has its own mark method, which calls mark_object | |
13 with the stuff to be marked. | |
1598 | 14 Also, many Lisp_Objects have pdump descriptions memory_descriptions, |
15 which are used by the portable dumper. The dumper gets all the | |
16 information it needs about the Lisp_Object from the descriptions. | |
992 | 17 |
18 Also the garbage collector can use the information in the pdump | |
19 descriptions, so we can get rid of the mark methods. | |
20 That is what we have been doing. | |
21 | |
22 | |
23 DUMPABLE FLAG | |
24 ------------- | |
25 First we added a dumpable flag to lrecord_implementation. It shows, | |
26 if the object is dumpable and should be processed by the dumper. | |
27 The dumpable flag is the third argument of a lrecord_implementation | |
28 definition (DEFINE_LRECORD_IMPLEMENTATION). | |
29 If it is set to 1, the dumper processes the descriptions and dumps | |
30 the Object, if it is set to 0, the dumper does not care about it. | |
31 | |
32 | |
1598 | 33 KKCC MARKING |
34 ------------ | |
35 All Lisp_Objects have memory_descriptions now, so we could get | |
36 rid of the mark_object calls. | |
37 The KKCC algorithm manages its own stack. Instead of calling | |
38 mark_object, all the alive Lisp_Objects are pushed on the | |
39 kkcc_gc_stack. Then these elements on the stack are processed | |
40 according to their descriptions. | |
41 | |
42 | |
992 | 43 TODO |
44 ---- | |
1598 | 45 - For weakness use weak datatypes instead of XD_FLAG_NO_KKCC. |
46 XD_FLAG_NO_KKCC occurs in: | |
47 * elhash.c: htentry | |
48 * extents.c: lispobject_gap_array, extent_list, extent_info | |
49 * marker.c: marker | |
50 Not everything has to be rewritten. See Ben's comment in lrecord.h. | |
51 - Clean up special case marking (weak_hash_tables, weak_lists, | |
52 ephemerons). | |
53 - Stack optimization (have one stack during runtime instead of | |
54 malloc/free it for every garbage collect) | |
992 | 55 |
5384
3889ef128488
Fix misspelled words, and some grammar, across the entire source tree.
Jerry James <james@xemacs.org>
parents:
1598
diff
changeset
|
56 There are a few Lisp_Objects, where there occurred differences and |
1204 | 57 inexactness between the mark-method and the pdump description. All |
58 these Lisp_Objects get dumped (except image instances), so their | |
59 descriptions have been written, before we started our work: | |
992 | 60 * alloc.c: string |
1598 | 61 description: size_, data_, and plist is described |
62 mark: only plist is marked, but flush_cached_extent_info is called. | |
63 flush_cached_extent_info -> | |
64 free_soe -> | |
65 free_extent_list -> | |
66 free_gap_array -> | |
67 gap_array_delete_all_markers -> | |
68 Add gap_array to the gap_array_marker_freelist | |
992 | 69 |
1204 | 70 * glyphs.c: image_instance |
1598 | 71 description: device is not set to nil |
1204 | 72 mark: mark method sets device to nil if dead |
1598 | 73 See comment above the description. |