1d34224416
Look, this will make the git history look funny, but trimming unnecessary depth from the source tree is a very necessary sanity-preserving measure when dealing with a super-modularized codebase like this one. While it makes the project configuration a bit less conventional, it will save you several clicks every time you jump between modules. Which you'll do a lot, because it's *modul*ar. The src/main/java convention makes a lot of sense for a non-modular project though. This ain't that. |
||
---|---|---|
.. | ||
java/nu/marginalia/index/journal | ||
test/nu/marginalia/index/journal | ||
build.gradle | ||
readme.md |
Index Journal
The index journal contains a list of entries with keywords and keyword metadata per document.
This journal is written by processes/loading-process and read when constructing the forward and reverse indices.
The journal format is a file header, followed by a zstd-compressed list of entries, each containing a header with document-level data, and a data section with keyword-level data.
The journal data may be split into multiple files, and the journal writers and readers are designed to handle this transparently via their Paging implementation.