Commit Graph

  • 2515993536 (search) Fix issue where searchTitle setting gets lost when searching again Viktor Lofgren 2024-02-15 13:52:11 +0100
  • 66b3e71e56 (search) Expose more search options Viktor Lofgren 2024-02-15 13:39:51 +0100
  • 652d151373 (process-models) Improve documentation Viktor Lofgren 2024-02-15 12:21:12 +0100
  • 300b1a1b84 (index-query) Add some tests for the QueryFilter code Viktor Lofgren 2024-02-15 12:03:30 +0100
  • 6c3b49417f (index-query) Improve documentation and code quality Viktor Lofgren 2024-02-15 11:33:50 +0100
  • dcc5cfb7c0 (index-journal) Improve documentation and code quality Viktor Lofgren 2024-02-15 10:51:49 +0100
  • d970836605
    Merge pull request #79 from MarginaliaSearch/reddit Viktor 2024-02-15 09:17:56 +0100
  • 8021bd0aae (control) Sort upload listing results Viktor Lofgren 2024-02-15 09:13:40 +0100
  • 8f91156d80 (control) Improve sideload UX Viktor Lofgren 2024-02-14 18:38:20 +0100
  • fab36d6e63 (converter) Loader for reddit data Viktor Lofgren 2024-02-14 11:11:23 +0100
  • 3d54879c14 (API, minor) Clean up comments. Viktor Lofgren 2024-02-14 12:09:16 +0100
  • e17fcde865 (API, minor) Remove unnecessary inject. Viktor Lofgren 2024-02-14 12:05:50 +0100
  • 6950dffcb4 (API) Fix result order in API results Viktor Lofgren 2024-02-14 11:47:14 +0100
  • 02dd5c5853 (converter) Look at properties when deciding pool size v24.01.1 Viktor Lofgren 2024-02-12 16:24:19 +0100
  • 5a1087dbf9 (qs-gui) Update documentation, add param for domain limit Viktor Lofgren 2024-02-12 16:13:48 +0100
  • 7564dfeb7a (minor) Correct link in documentation for app services Viktor Lofgren 2024-02-12 15:55:06 +0100
  • 10bad635a8 (search) Experimental support for clustering search results Viktor Lofgren 2024-02-11 20:00:11 +0100
  • 7cc8b0fed5 (search) Experimental support for clustering search results Viktor Lofgren 2024-02-11 19:58:55 +0100
  • a77846373b (search) Experimental support for clustering search results Viktor Lofgren 2024-02-11 19:48:48 +0100
  • bcd0dabb92 (search) Experimental support for clustering search results Viktor Lofgren 2024-02-11 17:31:38 +0100
  • 9d68062553 (converter) Make processing pool size configurable Viktor Lofgren 2024-02-10 20:59:08 +0100
  • e66d0b7431 (warc) Minor code clean-up. Viktor Lofgren 2024-02-10 18:30:33 +0100
  • ba26f6ce84 (doc) Documentation corrections Viktor Lofgren 2024-02-10 14:16:01 +0100
  • 929caed0b9 (warc) Improve WARC standard adherence Viktor Lofgren 2024-02-09 20:07:01 +0100
  • 8340aa2b6c (warc) Improve WARC standard adherence Viktor Lofgren 2024-02-09 17:29:21 +0100
  • 1188fe3bf0 (conf) Improve naming consistency Viktor Lofgren 2024-02-09 14:43:08 +0100
  • b15f47d80e (db) Retire the EC_DOMAIN_LINK table Viktor Lofgren 2024-02-08 15:52:30 +0100
  • ef261cbbd7 (search) Remove stray spaces in bang commands Viktor Lofgren 2024-02-08 14:46:12 +0100
  • 06997ff255
    Merge pull request #78 from conor-f/patch-1 Viktor 2024-02-08 13:45:38 +0100
  • 9d7df87886
    (search) Fix broken !ddg handling Conor Flynn 2024-02-08 13:28:02 +0100
  • a4b2323ca3 (search) Change default search profile to No Filter Viktor Lofgren 2024-02-08 13:03:59 +0100
  • e8de468b0b
    Make executor API talk GRPC (#75) Viktor 2024-02-08 13:01:12 +0100
  • 08466780c4 (*) Fix copy-paste issues with GRPC client Viktor Lofgren 2024-02-08 13:00:33 +0100
  • d83a3bf4e2 (search) Fix broken !w handling Viktor Lofgren 2024-02-08 12:11:33 +0100
  • f2b39ad055 (search) Fix broken !bang handling Viktor Lofgren 2024-02-08 12:05:09 +0100
  • 83b7e84a7f (executor-client) Clean up API Viktor Lofgren 2024-02-08 11:37:04 +0100
  • c9e4796fd6 (*) Don't use jakarta inject/sinleton annotations Viktor Lofgren 2024-02-08 11:27:24 +0100
  • ede690b407 (executor-api) Make executor API talk GRPC Viktor Lofgren 2024-02-07 17:50:38 +0100
  • 95d1bd98e4 (array) Update documentation, make unsafe configurable Viktor Lofgren 2024-02-07 12:26:47 +0100
  • 8acbc6a6b4 (index-construction) Split repartition into two actions cont'd Viktor Lofgren 2024-02-06 19:54:17 +0100
  • 467ba5be20 (index-construction) Split repartition into two actions Viktor Lofgren 2024-02-06 17:20:07 +0100
  • 29ddf9e61d (doc) Update docs Viktor Lofgren 2024-02-06 16:29:55 +0100
  • 92e119cab3 (doc) Update docs Viktor Lofgren 2024-02-06 12:43:42 +0100
  • 92049ba8e4 (doc) Update docs Viktor Lofgren 2024-02-06 12:41:28 +0100
  • 54330b9921 (*) Remove dead code Viktor Lofgren 2024-02-06 12:41:13 +0100
  • d1aeb030f2 (doc) Update RandomWriteFunnel documentation Viktor Lofgren 2024-02-06 12:35:24 +0100
  • f89274d1ea (minor) Fix broken test Viktor Lofgren 2024-02-06 12:12:26 +0100
  • 7286596fb4 (deps) Remove monkey patched GSON Viktor Lofgren 2024-02-06 12:11:39 +0100
  • a2fc83d94e (control) Add configurable border styling Viktor Lofgren 2024-02-06 12:05:02 +0100
  • 2161799cc3 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:18:00 +0100
  • c88f132057 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:10:03 +0100
  • c6313a5906 (sideload) Fix filename error in dealing with stackoverflow files Viktor Lofgren 2024-02-06 11:06:36 +0100
  • eadcdb5bed (minor) Improve error handling, naming logging in IndexResultDecorator Viktor Lofgren 2024-02-05 21:05:44 +0100
  • 6e7649b5f7 (loader) Mitigate fragile paging behavior Viktor Lofgren 2024-02-05 21:05:03 +0100
  • d986f90074 (index) Fix consistency between RandomFileAssembler implementations Viktor Lofgren 2024-02-05 21:01:32 +0100
  • 53c575db3f (index-construction) Make random-write file strategy configurable Viktor Lofgren 2024-02-05 12:31:15 +0100
  • 6dcc20038c (index-journal) Make index journal page size configurable Viktor Lofgren 2024-02-05 11:26:05 +0100
  • 885cd00aee Added implementation in wmsa home / setup.sh to grab suffix list. howdycat 2024-02-04 14:38:17 -0500
  • fa145f632b (sideload) Add special handling for sideloaded wiki documents Viktor Lofgren 2024-02-02 21:22:07 +0100
  • 785d8deadd (crawler) Improve meta-tag redirect handling, add tests for redirects. Viktor Lofgren 2024-02-01 20:30:43 +0100
  • 93a2d5afbf (*) Fix poorly named test Viktor Lofgren 2024-02-01 20:08:15 +0100
  • d60c6b18d4 (doc) Update the readme's the crawler, as they've grown stale. Viktor Lofgren 2024-02-01 18:10:55 +0100
  • d1e02569f4 (language-processing) Add a system property for configuring which language detection model to use Viktor Lofgren 2024-01-31 13:02:33 +0100
  • 9ce67029ca (language-processing) Add a system property for configuring which language detection model to use Viktor Lofgren 2024-01-31 13:02:16 +0100
  • 98f3382cea (minor) Fix test and improve error message Viktor Lofgren 2024-01-31 11:53:41 +0100
  • 52a0255814 (*) Add flag for disabling ASCII flattening Viktor Lofgren 2024-01-31 11:50:59 +0100
  • eb59ac8535 (index-ranking) Adjust the BM25P factors a bit Viktor Lofgren 2024-01-30 21:27:29 +0100
  • acc2b4e10f (*) Update the readme with a link to the demo video Viktor Lofgren 2024-01-26 13:49:41 +0100
  • 6f830f0e08 (*) Update the readme with a link to the demo video Viktor Lofgren 2024-01-26 13:48:47 +0100
  • 6edc318597 (control) Fix typo in URL linking to new-crawl-specs v24.01.0 Viktor Lofgren 2024-01-26 10:43:10 +0100
  • 182c0cf28e (control) Add warnings about domain data contamination Viktor Lofgren 2024-01-25 18:26:15 +0100
  • 0b105b5986 (converter) Update hyperlink text for new crawl spec creation. Viktor Lofgren 2024-01-25 18:05:11 +0100
  • e91d5dc339 Added getTld method howdycat 2024-01-25 11:36:04 -0500
  • 081c7d22bc Fix typo in install.sh Viktor Lofgren 2024-01-25 17:08:18 +0100
  • 6aee896657 (*) Add single-node barebones configuration Viktor Lofgren 2024-01-25 16:40:28 +0100
  • cae1bad274 (*) Add download-sample action, refactor file storage Viktor Lofgren 2024-01-25 13:36:30 +0100
  • 1b8b97b8ec (sample-exporter) Add some limits on sizes and lengths Viktor Lofgren 2024-01-25 11:51:53 +0100
  • 0846606b12 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:39:33 +0100
  • 245ebcdfc6 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:37:58 +0100
  • 1b1e711c93 (doc) Add ide quick-start guide Viktor Lofgren 2024-01-24 14:36:44 +0100
  • c088c25b09 (*) Fix broken test, clean up code Viktor Lofgren 2024-01-24 12:50:41 +0100
  • 958d64720e (control) Add a view for restarting aborted processes Viktor Lofgren 2024-01-24 12:47:10 +0100
  • 2f648d2bb7 initial tld parser howdycat 2024-01-23 21:21:07 -0500
  • 805afad4fe (control) New GUI for exporting crawl data samples Viktor Lofgren 2024-01-23 17:07:45 +0100
  • 400f4840ad (*) Fix broken code in jmh Viktor Lofgren 2024-01-23 17:07:57 +0100
  • ee7792596d (*) Fix broken test Viktor Lofgren 2024-01-23 12:03:47 +0100
  • 0081328aca (converter) Adjust which flags are set by anchor text keywords Viktor Lofgren 2024-01-23 11:54:00 +0100
  • 3fff7f6878 (converter) Fix issue where quality limits were no longer enforced Viktor Lofgren 2024-01-23 11:42:17 +0100
  • f15dd06473 (index) Delayed close() of SearchIndexReader Viktor Lofgren 2024-01-23 11:08:41 +0100
  • dd26819d66 (actor) Try to rare data race where a finished job is considered dead. Viktor Lofgren 2024-01-22 21:22:38 +0100
  • 562012fb22 (doc) Migrate documentation https://docs.marginalia.nu/ Viktor Lofgren 2024-01-22 19:40:08 +0100
  • a6d257df5b (converter) Update Stackexchange sideload instruction Viktor Lofgren 2024-01-22 18:29:20 +0100
  • 41d896ba3e (converter) Refactor content type check in PlainTextDocumentProcessorPlugin Viktor Lofgren 2024-01-22 17:52:14 +0100
  • 51cdf46645 (control) Improve accessibility in search-to-ban template Viktor Lofgren 2024-01-22 15:01:00 +0100
  • 1eb0adf6d3 (array) Add sun.misc.Unsafe variant of LongArray Viktor Lofgren 2024-01-22 13:38:42 +0100
  • 40c9d2050f (control) Fully automatic conversion Viktor Lofgren 2024-01-22 13:01:09 +0100
  • 3a325845c7 (mq) Add better error handling in fsm and mq Viktor Lofgren 2024-01-22 12:58:33 +0100
  • 6a1bfd6270 (array) Remove unused 'madvise' code and 3rd party dependency on 'uppend' Viktor Lofgren 2024-01-22 12:56:45 +0100
  • b91ea1d7ca (control) Re-add gui for sideloading dirtrees Viktor Lofgren 2024-01-20 18:09:40 +0100
  • c5760cd535 (test) Fix broken test Viktor Lofgren 2024-01-20 13:39:40 +0100