CatgirlIntelligenceAgency/code/processes/converting-process
Viktor Lofgren 7326ba74fe Tweaks to pub date heuristics to make it mostly get the 'historyofphilosophy.net' case right.
Use HTML standard for plausibility checks in the more guesswork-like heuristics. Added more class names to look for date strings.
2023-06-20 14:15:05 +02:00
..
src Tweaks to pub date heuristics to make it mostly get the 'historyofphilosophy.net' case right. 2023-06-20 14:15:05 +02:00
build.gradle Re-introduce monkey patched GSON to make converter run better. 2023-06-19 17:58:19 +02:00
readme.md More restructuring, big bug fixes in keyword extraction. 2023-03-13 17:39:53 +01:00

Converting Process

The converting process reads crawl data and extracts information to be fed into the index, such as keywords, metadata, urls, descriptions...

Central Classes

See Also