CatgirlIntelligenceAgency/third-party
Viktor Lofgren 787a20cbaa (crawling-model) Implement a parquet format for crawl data
This is not hooked into anything yet.  The change also makes modifications to the parquet-floor library to support reading and writing of byte[] arrays.  This is desirable since we may in the future want to support inputs that are not text-based, and codifying the assumption that each document is a string will definitely cause us grief down the line.
2023-12-13 16:22:19 +01:00
..
commons-codec (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
count-min-sketch (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
monkey-patch-gson (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
monkey-patch-opennlp (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
openzim (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
parquet-floor (crawling-model) Implement a parquet format for crawl data 2023-12-13 16:22:19 +01:00
porterstemmer (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
rdrpostagger (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
symspell (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
uppend (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
xz (*) Upgrade to JDK21 with preview enabled. 2023-09-24 10:38:59 +02:00
README.md (hash) Modified version of common codec's Murmur3 hash 2023-08-01 14:57:40 +02:00

Third Party Code

This is a mix of code from other projects, that has either been aggressively modified to suite the needs of the project, or lack an artifact, or to override some default that is inappropriate for the type of data Marginalia throws at the library.

Sources and Licenses

Modified

Repackaged

Monkey Patched