Commit Graph

269 Commits

Author SHA1 Message Date
Viktor Lofgren
a3a6b40cc3 Changes to crawler (#28)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/28
2022-06-15 16:54:27 +02:00
vlofgren
8100bd4879 conflict 2022-06-15 16:53:19 +02:00
vlofgren
81c77e7fcb Revert "Merge branch 'experimental' into master"
This reverts commit c3a432fdd4, reversing
changes made to 1de63f225d.
2022-06-15 16:49:18 +02:00
Viktor Lofgren
c3a432fdd4 Merge branch 'experimental' into master 2022-06-15 16:44:23 +02:00
vlofgren
88908c203d Refactoring conversion 2022-06-15 16:34:03 +02:00
vlofgren
8ba80931a9 Restructuring index code: Move dictionary 2022-06-15 12:59:56 +02:00
vlofgren
89f894eae2 Merge branch 'master' into experimental 2022-06-14 17:55:36 +02:00
vlofgren
1de63f225d Added support for <base href>-style tags. 2022-06-14 17:55:14 +02:00
vlofgren
3e64003252 Re-add quality property to URLs 2022-06-09 22:19:29 +02:00
vlofgren
1ee0c2b572 Merge branch 'master' into experimental 2022-06-09 21:48:19 +02:00
vlofgren
389818c6c3 Make website url configurable for search engine redirects 2022-06-09 21:47:59 +02:00
vlofgren
65aee9419d Tidy up 2022-06-09 21:25:31 +02:00
vlofgren
495e6a1639 Use 64 bit path hash for EC_URL 2022-06-08 16:52:46 +02:00
vlofgren
2faaed3393 Fixed conversion bug SQL->EdgeDomainIndexingState 2022-06-08 16:52:33 +02:00
vlofgren
5e472fe121 WIP: Refactored ranking algorithms to separate database code from ranking code 2022-06-08 16:18:00 +02:00
vlofgren
026ba714b5 WIP: Database refactoring 2022-06-08 15:32:03 +02:00
vlofgren
c915664fcc WIP: Database refactoring 2022-06-07 22:34:53 +02:00
vlofgren
0e65384781 Make WMSA_HOME configurable through an environment variable. 2022-06-03 13:32:08 +02:00
Viktor Lofgren
43fed18063 Update 'README.md' 2022-06-02 02:30:29 +02:00
Viktor Lofgren
dbbef2d119 Update 'README.md' 2022-06-01 18:00:30 +02:00
vlofgren
d8d0c0e5b2 Make User-agent configurable. 2022-06-01 14:46:51 +02:00
Viktor Lofgren
5c2f2d558f Update index.html for search engine (#25)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/25
2022-05-31 14:38:13 +02:00
vlofgren
80dad31753 Merge branch 'release'
# Conflicts:
#	marginalia_nu/src/main/java/nu/marginalia/wmsa/edge/index/service/query/IndexSearchBudget.java
2022-05-31 14:37:49 +02:00
vlofgren
c0e0579c8e Updated index.html for search engine to reflect changes in project status. 2022-05-31 14:35:05 +02:00
vlofgren
046b92e0bb Cleaning up index code 2022-05-31 14:35:05 +02:00
Viktor Lofgren
3679d433d9 Update 'README.md' 2022-05-31 14:15:20 +02:00
vlofgren
ab97044302 Fix deprecation warning for Bucket4J 2022-05-31 13:40:21 +02:00
Viktor Lofgren
9474f39225 Add time-based timeout to queries (#24)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/24
2022-05-31 13:38:26 +02:00
Viktor Lofgren
e5df033030 Merge branch 'release' into master 2022-05-31 13:38:11 +02:00
vlofgren
ec87c0689f Added timeout to queries 2022-05-31 13:37:24 +02:00
Viktor Lofgren
fcd2708fe3 Memory alignment tweaks for better performance (#22)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/22
2022-05-30 23:42:40 +02:00
Viktor Lofgren
18b1153136 Merge branch 'release' into master 2022-05-30 23:42:19 +02:00
vlofgren
fc070f2e0e Fixed memory alignment for MMFL 2022-05-30 23:41:16 +02:00
Viktor Lofgren
c7a095e497 Madvise tweaks (#21)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/21
2022-05-30 23:22:05 +02:00
Viktor Lofgren
b97a7fffcc Merge branch 'release' into master 2022-05-30 23:21:57 +02:00
vlofgren
6894121859 Tweaked madvise for index to be faster 2022-05-30 23:19:55 +02:00
Viktor Lofgren
44bee371e6 Actually add the commit with the previously mentioned instrumetation (#18)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/18
2022-05-30 21:12:15 +02:00
vlofgren
dc963d3e44 Added instrumentation for search queries 2022-05-30 21:11:19 +02:00
Viktor Lofgren
c201201c2d Instrumentation for search + index madvise tweaks (#17)
Co-authored-by: vlofgren <vlofgren@gmail.com>
Reviewed-on: https://git.marginalia.nu/marginalia/marginalia.nu/pulls/17
2022-05-30 21:02:53 +02:00
vlofgren
730e964475 Tweaked madvise for index to be faster 2022-05-30 21:01:58 +02:00
vlofgren
275e42197c Added rudimentary !bang-support 2022-05-30 17:26:51 +02:00
Viktor Lofgren
cfd01c7dbe Update 'README.md' 2022-05-30 16:48:11 +02:00
vlofgren
25776a9718 Refactored EdgeSearchService and broke functions like define:, browse:, site: etc. into separate classes. 2022-05-30 16:40:59 +02:00
vlofgren
41b686955f API-service was accidentally moved into a subdirectory of Auth 2022-05-30 12:46:30 +02:00
Viktor Lofgren
75c4986532 Merge branch 'encyclopedia-service' into master 2022-05-30 12:41:30 +02:00
vlofgren
5a1ec53a84 WIP: Encyclopedia service 2022-05-28 14:35:32 +02:00
vlofgren
0acdd5b660 Switch to beefier docker image to fix 'Could not initialize class sun.awt.X11FontManager' for math rendering in Encyclopedia test. 2022-05-28 13:59:50 +02:00
vlofgren
ac9064096d Rewrote Encyclopedia loader, added functioning E2E test for new encyclopedia service 2022-05-28 13:51:29 +02:00
vlofgren
ad4521da9e WIP: Killing off Archive service, adding new Encyclopedia service consisting largely of what Archive was and a few features from Assistant. 2022-05-28 00:16:31 +02:00
vlofgren
e7b4ac0d34 WIP: Killing off Archive service, adding new Encyclopedia service consisting largely of what Archive was and a few features from Assistant. 2022-05-27 23:45:29 +02:00