Viktor Lofgren
|
266ad2e4de
|
Re-introduce monkey patched GSON to make converter run better.
fixup! Re-introduce monkey patched GSON to make converter run better.
fixup! Re-introduce monkey patched GSON to make converter run better.
|
2023-06-19 17:58:19 +02:00 |
|
Viktor Lofgren
|
44b1fe0e6d
|
Move list-conversion into getDescription method.
|
2023-06-19 17:58:19 +02:00 |
|
Viktor Lofgren
|
88399e30e2
|
Consider keyword relevance signals when creating the document summary using the DOM walker.
|
2023-06-19 17:58:19 +02:00 |
|
Viktor Lofgren
|
a9f7b4c457
|
Add synthetic keywords for same-site files linked from a document (e.g. file:png). Also add category keywords, like file:image or file:document.
|
2023-04-30 19:29:13 +02:00 |
|
Viktor Lofgren
|
2ab26f37b8
|
Bug fix for document metadata encoding that breaks year based queries.
|
2023-04-14 16:56:49 +02:00 |
|
Viktor Lofgren
|
cc4e089a5d
|
Consider average sentence length when selecting search results. This promotes proses over code listings, tabular data, etc.
|
2023-03-30 15:46:15 +02:00 |
|
Viktor Lofgren
|
4d05be4095
|
Refactor InternalLinkGraph
|
2023-03-30 15:44:23 +02:00 |
|
Viktor Lofgren
|
03bd892b95
|
Improve document processing in conversion.
* Add flags for long and short documents.
* Break out common length logic from plugins.
* Cleaning up of related code.
|
2023-03-28 16:38:00 +02:00 |
|
Viktor Lofgren
|
ca22c287a5
|
Make use of DocumentFlags' flags
|
2023-03-21 16:03:15 +01:00 |
|
Viktor Lofgren
|
2eb972dea1
|
Remove unrelated code, break tools into their own directory.
|
2023-03-17 16:03:11 +01:00 |
|
Viktor Lofgren
|
449471a076
|
Yet more restructuring. Improved search result ranking.
|
2023-03-16 21:35:54 +01:00 |
|
Viktor Lofgren
|
d82532b7f1
|
More restructuring, big bug fixes in keyword extraction.
|
2023-03-13 17:39:53 +01:00 |
|