2023-03-12 10:50:31 +01:00
|
|
|
# Crawl Features
|
|
|
|
|
|
|
|
These are bits of search-engine related code that are relatively isolated pieces of business logic,
|
|
|
|
that benefit from the clarity of being kept separate from the rest of the crawling code.
|
|
|
|
|
2023-03-12 11:42:07 +01:00
|
|
|
* [crawl-blocklist](crawl-blocklist/) - IP and URL blocklists
|
|
|
|
* [link-parser](link-parser/) - Code for parsing and normalizing links
|