40c9d2050f
Removed the need to have to run an external tool to pre-process the data in order to load stackexchange-style data into the search engine. Removed the tool itself. This stirred up some issues with the dependencies, that were due to both third-party:ing xz and importing it as a dependency. This has been fixed, and :third-party:xz was removed. |
||
---|---|---|
.. | ||
commons-codec | ||
count-min-sketch | ||
encyclopedia-marginalia-nu | ||
monkey-patch-gson | ||
monkey-patch-opennlp | ||
openzim | ||
parquet-floor | ||
porterstemmer | ||
rdrpostagger | ||
symspell | ||
README.md |
Third Party Code
This is a mix of code from other projects, that has either been aggressively modified to suite the needs of the project, or lack an artifact, or to override some default that is inappropriate for the type of data Marginalia throws at the library.
Sources and Licenses
Modified
- RDRPosTagger - GPL3
- PorterStemmer - LGPL3
- OpenZIM - GPL-2.0+
- Commons Codec - Apache 2.0
- encylopedia.marginalia.nu - GPL 2.0+
Repackaged
- SymSpell - LGPL-3.0
- Count-Min-Sketch - Apache 2.0
Monkey Patched
- Stanford OpenNLP - Apache-2.0
- GSON - Apache-2.0