CatgirlIntelligenceAgency/code/common/config
Viktor Lofgren 0caef1b307 (warc) Toggle for saving WARC data
Add a toggle for saving the WARC data generated by the search engine's crawler.  Normally this is discarded, but for debugging or archival purposes, retaining it may be of interest.

The warc files are concatenated into larger archives, up to about 1 GB each.
An index is also created containing filenames, domain names, offsets and sizes
to help navigate these larger archives.

The warc data is saved in a directory warc/ under the crawl data storage.
2024-01-12 13:45:14 +01:00
..
src (warc) Toggle for saving WARC data 2024-01-12 13:45:14 +01:00
build.gradle (*) WIP Control GUI redesign, executor-service, multi-node mq 2023-10-14 12:08:43 +02:00
readme.md Move all code to a code directory. 2023-03-07 17:14:32 +01:00

Config

This package contains configuration injectables used by the services.