955 B
955 B
Lexicon
The lexicon contains a mapping for words to identifiers. This lexicon is populated from a journal. The actual word data isn't mapped, but rather a 64 bit hash.
The lexicon is written by crawl-processes/loading-process and read when services-core/index-service interprets queries.
Central Classes
- KeywordLexicon
- KeywordLexiconJournal
- DictionaryMap comes in two versions
-
- OnHeapDictionaryMap - basically just a fastutil Long2IntOpenHashMap
-
- OffHeapDictionaryHashMap - a heavily modified trove TLongIntHashMap that uses off heap memory