Viktor Lofgren
38d20022ad
(search) Fix script loading for mobile support
2023-12-02 17:06:40 +01:00
Viktor Lofgren
280132dad0
(search) Fix script loading for mobile support
2023-12-02 17:06:40 +01:00
Viktor Lofgren
61de4e2789
(search) Retain filter options when performing a new search from the input field
2023-12-02 17:06:40 +01:00
Viktor Lofgren
f9d3455320
(search) Reduce visual weight of search results
2023-12-02 17:06:40 +01:00
Viktor Lofgren
2ff64c3c12
(search) New toggle for reducing tracking
2023-12-02 17:06:40 +01:00
Viktor Lofgren
902f235b5b
(search) Integrate 'similar' tab in site info.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
97d43a6fa2
(search) Revamp browse results with new look.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
9bc65ff0ca
(search) Desaturate search result titles according to rank
2023-12-02 17:06:40 +01:00
Viktor Lofgren
6cd6a615fd
(search) Add data-filter to body as a data attribute
...
For future shenanigans ;D
2023-12-02 17:06:40 +01:00
Viktor Lofgren
5639f0653d
(search) Rename SearchProfile.name into filterId
...
Avoid foot-gun caused by name clash with the Enumeration method name(), which returns the Java name of the enumeration value.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
251174c9a2
(search) Update front page with new look
2023-12-02 17:06:40 +01:00
Viktor Lofgren
42ea87d637
(search) Update conversion results, error page, and dictionary results with new CSS.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
7c8a60b8cf
(search) Site info view is mostly done
...
Also optimize the rendering a bit to avoid having to allocate huge string buffers, writing directly to Spark's response instead.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
2f4500be5a
(search) New frontend look
2023-12-02 17:06:40 +01:00
Viktor Lofgren
fa7534a362
(search) Remove dead code
2023-12-02 17:06:40 +01:00
Viktor Lofgren
a258f0af7a
(search) Refactor search parameters to include query
2023-12-02 17:06:40 +01:00
Viktor Lofgren
01621c6344
(renderer) Make helpers configurable on a by-service basis.
2023-12-02 17:06:40 +01:00
Viktor Lofgren
c7934342a6
(control) Automatic recrawl
2023-12-02 17:06:24 +01:00
Viktor Lofgren
f5c324c06b
(minor) Fix broken test
2023-12-01 17:44:39 +01:00
Viktor Lofgren
f615cf2391
(convert) Loosen up the rules enforcement for documents that have external links.
2023-12-01 17:44:29 +01:00
Viktor Lofgren
c984a97262
(docs) Update crawling.md
2023-11-30 21:53:56 +01:00
Viktor Lofgren
a02c06a837
(docs) Update sideloading-howto.md
2023-11-30 21:51:03 +01:00
Viktor Lofgren
21d6aa421c
(docs) Update setup instructions
2023-11-30 21:44:29 +01:00
Viktor Lofgren
e5d274fe1c
(docs) Improve architectural documentation
2023-11-30 21:38:57 +01:00
Viktor Lofgren
166a391eae
(docs) Improve architectural documentation for the crawler.
2023-11-30 21:30:57 +01:00
Viktor Lofgren
5fb24bb27f
(docs) Improve architectural documentation for the converter.
2023-11-30 20:43:22 +01:00
Viktor Lofgren
5a5430b383
(convert) Wiki specialization that should do a better job at removing junk keywords and providing a useful summary.
2023-11-30 20:04:46 +01:00
Viktor Lofgren
67a1e1c874
(control) GUI for triggering control-side actors
2023-11-29 15:31:14 +01:00
Viktor Lofgren
4155fbe94c
(control) Reprocess-all actor
2023-11-28 17:58:48 +01:00
Viktor Lofgren
347fe6b7be
(control) Reindex-all actor
2023-11-28 16:41:09 +01:00
Viktor Lofgren
ff3ceb981e
(control) Button for removing a stale 'NEW' status
...
If a process is violently terminated, the associated file storage may get stuck in the ephemeral 'NEW' state, preventing future operations on the associated data.
To remedy this without having to dig through the database, a button was added to reset the state. It's a band-aid, but the situation is rare enough that I think it's fine.
2023-11-28 15:18:24 +01:00
Viktor Lofgren
1dafa0c74d
(mqapi/control) Repair repartition endpoint, deprecate notify endpoints.
...
The repartition endpoint was mis-addressing its mqapi notifications, omitting the proper nodeId. In fixing this, it became apparent that having both @MqRequest and @MqNotification is a serious footgun, and the two should be unified into a single API where the caller isn't burdened with knowledge of the remote end's implementation specifics.
2023-11-27 16:01:12 +01:00
Viktor Lofgren
09917837d0
(process) Ensure construction exceptions are logged
...
Wrapping these exceptions in a try-catch and logging them with slf4j will ensure they end up in the process logs.
The way it worked using the default exception handler, they'd print on console (which nothing captures!), leading to a very annoying debugging experience.
2023-11-22 18:32:06 +01:00
Viktor Lofgren
dd507a3808
(db) Fix migrations, bump flyway to 10.0.1
...
Tricky problem, creating a procedure apparently needs delimiter shenanigans in Flyway, otherwise it will truncate the END statement and mariadb will be sad.
2023-11-21 20:04:35 +01:00
Viktor Lofgren
e67dcf4d68
(docker) Fix image tagging
...
Should now be possible to push a tagged image with e.g.
./gradlew --info dockerPush -Pdocker-registry=registry.marginalia.nu -Pdocker-tag=test2
2023-11-18 13:27:12 +01:00
Viktor Lofgren
dd9406d0ac
(control) Make storage type tabs consistent
...
This had fallen off in the Create New Specification view, it lacked Exports.
2023-11-17 11:26:45 +01:00
Viktor Lofgren
6a80ac62a5
(doc) Amend crawling documentation
2023-11-17 11:16:06 +01:00
Viktor Lofgren
98efb08e17
(gradle) Make docker image registry and tag configurable
...
This is to enable running an external repository for production and test.
Use the ./gradle -Pdocker-registry=registry.foo.bar -Pdocker-tag=my-tag while building to accomplish this. By default, use 'marginalia' for repository and 'latest' as tag.
2023-11-16 21:12:55 +01:00
Viktor Lofgren
f58a9f46be
(loader) Don't truncate the entire links table on load
...
This behavior is an old vestige from the days of only having a single loader process. We'd truncate the links table because doing inserts/updates was too slow. This was also important because we had 32 bit ID, and there's a lot of links between domains to go around...
Instead we delete the rows associated with the current node with a stored procedure PURGE_LINKS_TABLE.
We also update the PRIMARY KEY to a BIGINT. We'll need to load the data in excess of billion times to hit an ID rollover, so it'll be fine.
2023-11-16 10:30:12 +01:00
Viktor Lofgren
fd77e62a13
set registry to registry.marginalia.nu
2023-11-15 14:04:44 +01:00
Viktor Lofgren
376228e199
(docker) set registry to registry.marginalia.nu
2023-11-15 14:03:22 +01:00
Viktor Lofgren
8a5b853fae
Fix experiment runner
2023-11-15 14:03:17 +01:00
Viktor Lofgren
1cbf23e7e7
(test) Don't fail test if atags.parquet is not in ~vlofgren
2023-11-15 09:11:38 +01:00
Viktor Lofgren
63554ba171
(explore2) Add robots.txt
2023-11-14 09:15:32 +01:00
Viktor Lofgren
5de37cb820
(converter) Set feature flags appropriately on stackexchange posts
2023-11-12 15:48:08 +01:00
Viktor Lofgren
e5cee1f46d
(sideload) Fix sideloading so that it doesn't get disproportionately good rankings
...
Also add type flags so that e.g. wikipedia shows up in the wikis filter.
2023-11-12 14:57:57 +01:00
Viktor Lofgren
e9a01caa5c
(index) Fix broken metrics
2023-11-11 12:53:47 +01:00
Viktor Lofgren
858357a246
(metrics) Get prometheus up out of disrepair
...
* Fix bad labels
* Add nodeId where appropriate
* Hopefully fix histogram buckets for index query times
2023-11-08 14:01:28 +01:00
Viktor Lofgren
ef16502159
(doc) Update readme
2023-11-07 16:00:18 +01:00
Viktor Lofgren
29e2c43e01
(gradle) Up to gradle 8.4 since it has better Java 21 compatibility
2023-11-07 16:00:08 +01:00