CatgirlIntelligenceAgency/code/features-index/index-reverse
Viktor Lofgren 9894f37412 (index) Implement new URL ID coding scheme.
Also refactor along the way.  Really needs an additional pass, these tests are very hairy.
2023-08-24 16:44:27 +02:00
..
src (index) Implement new URL ID coding scheme. 2023-08-24 16:44:27 +02:00
build.gradle Upgrade antique lombok plugin 2023-08-23 14:34:32 +00:00
readme.md Update readme.md 2023-03-22 17:09:48 +01:00

Reverse Index

The reverse index contains a mapping from word to document id.

There are two tiers of this index.

  • A priority index which only indexes terms that are flagged with priority flags1.
  • A full index that indexes all terms.

The full index also provides access to term-level metadata, while the priority index is a binary index that only offers information about which documents has a specific word.

[1] See WordFlags in common/model and KeywordMetadata in features-convert/keyword-extraction.

Central Classes