CatgirlIntelligenceAgency/code
Viktor Lofgren bf92c270dc (language) Rollback language filter change a bit.
It appears to lead to too much junk in the lexicon.
2023-08-23 10:16:57 +02:00
..
api (forward-index, valuator) HTML features in valuator 2023-08-18 11:54:56 +02:00
common (control-service) Basic GUI for deleting bad links from exploration mode 2023-08-21 18:35:26 +02:00
features-convert (summary) Reduce the chance of expensive operations 2023-08-16 15:48:34 +02:00
features-crawl (crawler) Update URL blocklist 2023-07-10 18:58:43 +02:00
features-index (forward-index, valuator) HTML features in valuator 2023-08-18 11:54:56 +02:00
features-search (search) Add refresh link to explore mode. 2023-08-22 12:43:44 +02:00
libraries (mq) Down-tune polling intervals in MQ 2023-08-22 11:49:30 +02:00
process-models (forward-index, valuator) HTML features in valuator 2023-08-18 11:54:56 +02:00
processes (language) Rollback language filter change a bit. 2023-08-23 10:16:57 +02:00
services-core (search) Add refresh link to explore mode. 2023-08-22 12:43:44 +02:00
services-satellite (api-service) Mq endpoint for flushing caches 2023-08-05 14:42:16 +02:00
tools (language) fasttext based language filter 2023-08-16 15:48:12 +02:00
readme.md (docs) Add control-service to the main readme.md 2023-08-01 22:52:41 +02:00

Code

This is a pretty large and diverse project with many moving parts.

You'll find a short description in each module of what it does and how it relates to other modules. The modules each have names like "library" or "process" or "feature". These have specific meanings. See doc/module-taxonomy.md.

Overview

A map of the most important components and how they relate can be found below.

image

Services

Processes

Processes are batch jobs that deal with data retrieval, processing and loading.

Tools

Features

Features are relatively stand-alone components that serve some part of the domain. They aren't domain-independent, but isolated.

Libraries and primitives

Libraries are stand-alone code that is independent of the domain logic.

  • common elements for creating a service, a client etc.
  • libraries containing non-search specific code.
    • array - large memory mapped area library
    • btree - static btree library