91c7960800
This commit extracts several previously hardcoded configuration properties, and makes then available through system.properties. The documentation is updated to reflect the change. Dead code was also removed in the process. CrawlSpecGenerator is left feeling a bit over-engineered still, since it's built for a more general case, where all other implementations but the current one are removed, but we'll leave it like this for now as it's fairly readable still. |
||
---|---|---|
.. | ||
diagram | ||
images | ||
crawling.md | ||
module-taxonomy.md | ||
parquet-howto.md | ||
readme.md | ||
sideloading-howto.md | ||
system-properties.md | ||
useful-resources.md |
Documentation
A lot of the architectural description is sprinkled into the code repository closer to the code. Start in 📁 ../code/ and poke around.
Operations
- System Properties - JVM property flags
How-To
- Sideloading How-To - How to sideload various data sets
- Parquet How-To - Useful tips in working with Parquet files
Set-up
- 📁 ../run/ - Set-up instructions
Documents
- Module Taxonomy - What is a 'service' and what is a 'library'.
- Useful Resources - Links to resources on IR and search engines.