- 17 May, 2002 25 commits
-
-
Tim Peters authored
It's possible to get OOV wids here due to words the lexicon knows about that the index has no current instances of.
-
Tim Peters authored
in the reindexing text.
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
*any* words in common across the versions. Helped Will along by adding a pragmatic comment to his "knocking indeed" rant. Reworked to use the inscrutable magic of dict.setdefault.
-
Tim Peters authored
-
Tim Peters authored
take longer to construct now; both indexers' _get_frequencies routines were fiddled to return the same kind of stuff again, and I had previously fiddled the cosine indexer's _get_frequencies to do something weirder but (probably) faster than this.
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
code left in it!
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
get_words, and in calling contexts nothing but a list of wids could possibly make sense.
-
Tim Peters authored
I need a break.
-
Tim Peters authored
indexers. CAUTION: I'm sure I don't understand how persistency needs to be spelled. Is it enough to say just that the base class derives from Persistent, or does that need to be duplicated (or done instead exclusively) in the derived classes? Is there a point to keeping "import ZODB" in the derived-class files? Is there a point to keeping it anywhere <wink>?
-
Tim Peters authored
logic to deal with all cases. All the tests pass again.
-
Tim Peters authored
globToWordIds(): This was building a list of words and then throwing it away without referencing it. Deleted the code.
-
- 16 May, 2002 15 commits
-
-
Jeremy Hylton authored
If we update a document and reindex it, ZCTextIndex is currently broken. The test passes py virtue of calling unindex_object() after each update, then calling index_object() again. We need to fix our code, and then remove the calls to unindex_object() from the test. XXX This code causes OkapiIndex to fail because it doesn't expect to have no wordinfo for a wid. I tried to fix this in CosineIndex, but I want to Tim think more about it and try to fix OkapiIndex.
-
Jeremy Hylton authored
-
Jeremy Hylton authored
-
Jeremy Hylton authored
This case can arise when the last occurence of a word is removed, or when a lexicon is shared across multiple indexes. XXX Not sure this code is correct, but it might be and the tests pass. If it's wrong, we need more tests.
-
Guido van Rossum authored
-
Guido van Rossum authored
-
Guido van Rossum authored
-
Guido van Rossum authored
-
Jeremy Hylton authored
-
Tim Peters authored
internals.
-
Tim Peters authored
testStopWords(). That isn't the real point. The real point is to fiddle the scaffolding enough to make it possible to run a white box test of the Okapi indexer too. That isn't here yet.
-
Tim Peters authored
-
Tim Peters authored
-
Tim Peters authored
with the docid is greater than 0 too.
-
Tim Peters authored
and the Okapi indexer. testZCTextIndex was, and remains, strongly tied to the cosine indexer.
-