CatgirlIntelligenceAgency/code/common
Viktor Lofgren b74a3ebd85 (crawler) WIP integration of WARC files into the crawler process.
At this stage, the crawler will use the WARCs to resume a crawl if it terminates incorrectly.

This is a WIP commit, since the warc files are not fully incorporated into the work flow, they are deleted after the domain is crawled.

The commit also includes fairly invasive refactoring of the crawler classes, to accomplish better separation of concerns.
2023-12-11 19:32:58 +01:00
..
config Initial Commit Anchor Tags 2023-11-04 14:24:17 +01:00
db (db) Fix migrations, bump flyway to 10.0.1 2023-11-21 20:04:35 +01:00
linkdb (*) WIP Control GUI redesign, executor-service, multi-node mq 2023-10-14 12:08:43 +02:00
model (crawler) WIP integration of WARC files into the crawler process. 2023-12-11 19:32:58 +01:00
process (*) WIP Add node affinity to EC_DOMAIN 2023-10-19 17:48:34 +02:00
renderer (search) Fix script loading for mobile support 2023-12-02 17:06:40 +01:00
service (mqapi/control) Repair repartition endpoint, deprecate notify endpoints. 2023-11-27 16:01:12 +01:00
service-client (search) Move site information out of the search service and into assistant. 2023-12-09 16:30:06 +01:00
service-discovery (mqapi/control) Repair repartition endpoint, deprecate notify endpoints. 2023-11-27 16:01:12 +01:00
readme.md Update readme.md 2023-03-25 15:27:11 +01:00

Common

These are packages containing the basic building blocks for running a service as well as shared models.

  • db contains SQL code and some database-related utilities.
  • config contains some @Injectables.
  • renderer contains utility code for rendering website templates.
  • service is the shared base classes for main methods and web services.
  • service-client is the shared base class for RPC.
  • service-discovery contains tools that lets the services find each other.
  • process contains boiler plate for batch processes.