CatgirlIntelligenceAgency/code/features-index/index-reverse
Viktor Lofgren dcf6218cdb Fix bugs related to search result selection in the case with multiple search terms.
* A deduplication filter step ran too early, and removed many good results on the basis that they partially, but did not fully fit another set of search terms.

* Altered the query creation process to prefer documents where multiple terms appear in the priority index.
2023-03-29 15:18:52 +02:00
..
src Fix bugs related to search result selection in the case with multiple search terms. 2023-03-29 15:18:52 +02:00
build.gradle The refactoring will continue until morale improves. 2023-03-12 10:50:31 +01:00
readme.md Update readme.md 2023-03-22 17:09:48 +01:00

Reverse Index

The reverse index contains a mapping from word to document id.

There are two tiers of this index.

  • A priority index which only indexes terms that are flagged with priority flags1.
  • A full index that indexes all terms.

The full index also provides access to term-level metadata, while the priority index is a binary index that only offers information about which documents has a specific word.

[1] See WordFlags in common/model and KeywordMetadata in features-convert/keyword-extraction.

Central Classes