40c9d2050f
Removed the need to have to run an external tool to pre-process the data in order to load stackexchange-style data into the search engine. Removed the tool itself. This stirred up some issues with the dependencies, that were due to both third-party:ing xz and importing it as a dependency. This has been fixed, and :third-party:xz was removed. |
||
---|---|---|
.. | ||
adblock | ||
anchor-keywords | ||
data-extractors | ||
keyword-extraction | ||
pubdate | ||
stackexchange-xml | ||
summary-extraction | ||
topic-detection | ||
readme.md |
Converter Features
Major features
- keyword-extraction - Identifies keywords to index in a document
- summary-extraction - Generate an excerpt/quote from a website to display on the search results page.
Smaller features:
- adblock - Simulates Adblock
- pubdate - Determines when a document was published
- topic-detection - Tries to identify the topic of a website