1b8b97b8ec
Tar files will reject entries with filenames over 100b, so we need a limit there. Also added a maximum size limit to keep the file sizes reasonable. |
||
---|---|---|
.. | ||
adblock | ||
anchor-keywords | ||
data-extractors | ||
keyword-extraction | ||
pubdate | ||
stackexchange-xml | ||
summary-extraction | ||
topic-detection | ||
readme.md |
Converter Features
Major features
- keyword-extraction - Identifies keywords to index in a document
- summary-extraction - Generate an excerpt/quote from a website to display on the search results page.
Smaller features:
- adblock - Simulates Adblock
- pubdate - Determines when a document was published
- topic-detection - Tries to identify the topic of a website