40c9d2050f
Removed the need to have to run an external tool to pre-process the data in order to load stackexchange-style data into the search engine. Removed the tool itself. This stirred up some issues with the dependencies, that were due to both third-party:ing xz and importing it as a dependency. This has been fixed, and :third-party:xz was removed. |
||
---|---|---|
.. | ||
diagram | ||
images | ||
crawling.md | ||
module-taxonomy.md | ||
parquet-howto.md | ||
readme.md | ||
sideloading-howto.md | ||
system-properties.md | ||
useful-resources.md |
Documentation
A lot of the architectural description is sprinkled into the code repository closer to the code. Start in 📁 ../code/ and poke around.
Operations
- System Properties - JVM property flags
How-To
- Sideloading How-To - How to sideload various data sets
- Parquet How-To - Useful tips in working with Parquet files
Set-up
- 📁 ../run/ - Set-up instructions
Documents
- Module Taxonomy - What is a 'service' and what is a 'library'.
- Useful Resources - Links to resources on IR and search engines.