CatgirlIntelligenceAgency/code/processes/crawling-process
2023-08-01 17:28:29 +02:00
..
src (crawler) Fix rare ConcurrentModificationError due to HashSet 2023-08-01 17:28:29 +02:00
build.gradle (crawler) Reduce long term memory allocation in DomainCrawlFrontier 2023-07-29 19:18:12 +02:00
readme.md More restructuring, big bug fixes in keyword extraction. 2023-03-13 17:39:53 +01:00

Crawling Process

The crawling process downloads HTML and saves them into per-domain snapshots.

Central Classes

See Also