CatgirlIntelligenceAgency/code
2023-09-21 15:19:50 +02:00
..
api (converter) Make it possible to sideload documents from a directory tree 2023-09-17 14:35:06 +02:00
common (doc) Remove confusingly outdated ER-diagrams 2023-09-21 15:08:27 +02:00
features-convert (converter, stackexchange-xml) Add the ability to sideload stackexchange data 2023-09-21 12:48:33 +02:00
features-crawl Upgrade antique lombok plugin 2023-08-23 14:34:32 +00:00
features-index (reverse-index) Force() final docs after being written 2023-09-01 15:43:53 +02:00
features-search (index) Move to a lexicon-free index design 2023-08-28 14:02:23 +02:00
libraries (blocking-thread-pool) Add isTerminated convenience function 2023-09-21 12:47:41 +02:00
process-models (crawl-spec) Parquetify crawl spec 2023-09-17 09:41:34 +02:00
processes (converter) Bugfix: Don't try to Path.of() on optional field 2023-09-21 13:27:09 +02:00
services-core (converter, stackexchange-xml) Add the ability to sideload stackexchange data 2023-09-21 12:48:33 +02:00
services-satellite (dating) Implementing @samstorment's fantastic design polish 2023-09-21 15:19:50 +02:00
tools (stackexchange-converter) Create tool for converting stackexchange 7z-files to digestible sqlite db:s 2023-09-20 15:15:13 +02:00
readme.md (crawl-spec) Parquetify crawl spec 2023-09-17 09:41:34 +02:00

Code

This is a pretty large and diverse project with many moving parts.

You'll find a short description in each module of what it does and how it relates to other modules. The modules each have names like "library" or "process" or "feature". These have specific meanings. See doc/module-taxonomy.md.

Overview

A map of the most important components and how they relate can be found below.

image

Services

Processes

Processes are batch jobs that deal with data retrieval, processing and loading.

Tools

Features

Features are relatively stand-alone components that serve some part of the domain. They aren't domain-independent, but isolated.

Libraries and primitives

Libraries are stand-alone code that is independent of the domain logic.

  • common elements for creating a service, a client etc.
  • libraries containing non-search specific code.
    • array - large memory mapped area library
    • btree - static btree library