comparison notes.txt @ 30:16eff0d30d4d

tidied dereferencing, added simple (no recursion) coverage for variables in ranges
author Henry S. Thompson <ht@markup.co.uk>
date Tue, 11 Apr 2017 17:03:39 +0100
parents c56a2e6990bd
children e500d7c18aad
comparison
equal deleted inserted replaced
29:87ed04a0fde2 30:16eff0d30d4d
58 there was content before. DONE 58 there was content before. DONE
59 ----------- 59 -----------
60 Using attributes to hold space-separated lists is risky, as in 60 Using attributes to hold space-separated lists is risky, as in
61 refs.xsl output, is risky! Fixed, see below. 61 refs.xsl output, is risky! Fixed, see below.
62 ----------- 62 -----------
63 Not handling variables as references. Not catching external 63 Not handling variables as references FIXED. Not catching external
64 references to variables. Not catching naked [n]! as external 64 references to variables FIXED (as externals). Not catching naked [n]! as external
65 references. 65 references FIXED
66 Fixed, but not dereferenced vars 66 Solo local vars are recursively dereferenced
67 The definition table is in workbook.xml definedNames/definedName[@name=$name]/. 67 The definition table is in workbook.xml definedNames/definedName[@name=$name]/.
68 Sheet name to filename mapping for locals is in workbook.xml sheets/sheet[@name=$sname]/@sheetId 68 Sheet name to filename mapping for locals is in workbook.xml sheets/sheet[@name=$sname]/@sheetId
69 Variables on l or r of ranges are just looked up: if they are complex
70 no recursion is done: the _semantics_ of this case are not clear to
71 me, need a real-life example...
69 ----------- 72 -----------
70 Switch to default namespace in order to reduce size and improve 73 Switch to default namespace in order to reduce size and improve
71 readability, and to elements instead of attributes DONE 74 readability, and to elements instead of attributes DONE
72 ----------- 75 -----------
73 Should put another step after refs.xsl to compute a map from 76 Should put another step after refs.xsl to compute a map from
74 distinct-values of all targets to all the cells which use them 77 distinct-values of all targets to all the cells which use them
75 (likewise ranges) DONE. That really does mean we should move to elts for 78 (likewise ranges) DONE. That really does mean we should move to elts for
76 each ref or range, since at this point we want to compute vector 79 each ref or range, since at this point we want to compute vector
77 representation as well, so we can identify projections 80 representation as well DONE, so we can identify projections
78 81
79 Slightly irritating that we'll have to serialise this as XML and then 82 Slightly irritating that we'll have to serialise this as XML and then
80 re-build it later... 83 re-build it later...
81 ----------- 84 -----------
82 Overgenerating in kenneth_lay__19506: e.g. <e:ref c="E9" er="[1]!'.SPX' '.SPX'!"/> 85 Overgenerating in kenneth_lay__19506: e.g. <e:ref c="E9" er="[1]!'.SPX' '.SPX'!"/>
100 <val>1264.96</val> 103 <val>1264.96</val>
101 </value> 104 </value>
102 </values> 105 </values>
103 </ddeItem> 106 </ddeItem>
104 Whew! 107 Whew!
108 FIXED
105 ---------- 109 ----------
106 http://upcommons.upc.edu/bitstream/handle/2117/100584/KDIR_2016_47_CR.pdf 110 http://upcommons.upc.edu/bitstream/handle/2117/100584/KDIR_2016_47_CR.pdf
107 [downloaded] 111 [downloaded]
108 uses appearance a lot. That needs to be harvested from styles.xml 112 uses appearance a lot. That needs to be harvested from styles.xml
109 The kenneth_lay enron sample has _403_ numbered formats... 113 The kenneth_lay enron sample has _403_ numbered formats...
110 114 ----------
111 Tried the largest sheet from the largest .xlsx I could find: 115 Tried the largest sheet from the largest .xlsx I could find:
112 fuse1k/'benjamin_rogers__1002__NYISO Price Information version 2'.xlsx 116 fuse1k/'benjamin_rogers__1002__NYISO Price Information version 2'.xlsx
113 -rw-r--r-- 1 ht None 6273325 Apr 3 16:22 '../benjamin_rogers__1002__NYISO Price Information version 2.xlsx' 117 -rw-r--r-- 1 ht None 6273325 Apr 3 16:22 '../benjamin_rogers__1002__NYISO Price Information version 2.xlsx'
114 -rw-r--r-- 1 ht None 23221149 Jan 1 1980 xl/worksheets/sheet3.xml 118 -rw-r--r-- 1 ht None 23221149 Jan 1 1980 xl/worksheets/sheet3.xml
115 119