CatgirlIntelligenceAgency/third-party
Viktor Lofgren 27ffb8fa8a (converter) Integrate zim->db conversion into automatic encyclopedia processing workflow
Previously, in order to load encyclopedia data into the search engine, it was necessary to use the encyclopedia.marginalia.nu converter to first create a .db-file.  This isn't very ergonomic, so parts of that code-base was lifted in as a 3rd party library, and conversion from .zim to .db is now done automatically.

The output file name is based on the original filename, plus a crc32 hash and a .db-ending, to ensure we can recycle the data on repeat loads.
2024-01-19 13:59:03 +01:00
..
commons-codec (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
count-min-sketch (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
encyclopedia-marginalia-nu (converter) Integrate zim->db conversion into automatic encyclopedia processing workflow 2024-01-19 13:59:03 +01:00
monkey-patch-gson (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
monkey-patch-opennlp (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
openzim (converter) Integrate zim->db conversion into automatic encyclopedia processing workflow 2024-01-19 13:59:03 +01:00
parquet-floor (crawling-model) Implement a parquet format for crawl data 2023-12-13 16:22:19 +01:00
porterstemmer (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
rdrpostagger (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
symspell (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
uppend (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
xz (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
README.md (converter) Integrate zim->db conversion into automatic encyclopedia processing workflow 2024-01-19 13:59:03 +01:00

Third Party Code

This is a mix of code from other projects, that has either been aggressively modified to suite the needs of the project, or lack an artifact, or to override some default that is inappropriate for the type of data Marginalia throws at the library.

Sources and Licenses

Modified

Repackaged

Monkey Patched