Commit Graph

40 Commits

Author SHA1 Message Date
Viktor Lofgren
0846606b12 (doc) Add ide quick-start guide 2024-01-24 14:39:33 +01:00
Viktor Lofgren
245ebcdfc6 (doc) Add ide quick-start guide 2024-01-24 14:37:58 +01:00
Viktor Lofgren
1b1e711c93 (doc) Add ide quick-start guide 2024-01-24 14:36:44 +01:00
Viktor Lofgren
562012fb22 (doc) Migrate documentation https://docs.marginalia.nu/ 2024-01-22 19:40:08 +01:00
Viktor Lofgren
40c9d2050f (control) Fully automatic conversion
Removed the need to have to run an external tool to pre-process the data in order to load stackexchange-style data into the search engine.

Removed the tool itself.

This stirred up some issues with the dependencies, that were due to both third-party:ing xz and importing it as a dependency.  This has been fixed, and :third-party:xz was removed.
2024-01-22 13:03:24 +01:00
Viktor Lofgren
91c7960800 (crawler) Extract additional configuration properties
This commit extracts several previously hardcoded configuration properties, and makes then available through system.properties.

The documentation is updated to reflect the change.

Dead code was also removed in the process. CrawlSpecGenerator is left feeling a bit over-engineered still, since it's built for a more general case, where all other implementations but the current one are removed, but we'll leave it like this for now as it's fairly readable still.
2024-01-20 10:36:04 +01:00
Viktor Lofgren
ec8fe9f031 (doc) Add screenshot to conversion step in crawling doc 2024-01-15 16:31:33 +01:00
Viktor Lofgren
ce5ae1931d (doc) Update Crawling Docs
Still a WIP, but now more accurately reflects the new GUI, with screenshots to boot!
2024-01-15 16:08:01 +01:00
Viktor Lofgren
b9445d4f62 (doc) Update Crawling Docs
Still a WIP, but now more accurately reflects the new GUI, with screenshots to boot!
2024-01-15 16:06:59 +01:00
Viktor Lofgren
7c6e18f7a7 (*) Overhaul settings and properties
Use a system.properties file to configure the system.  This is loaded statically by MainClass or ProcessMainClass.  Update the property names to be more consistent, and update the documentations to reflect the changes.
2024-01-13 17:12:18 +01:00
Viktor Lofgren
c984a97262 (docs) Update crawling.md 2023-11-30 21:53:56 +01:00
Viktor Lofgren
a02c06a837 (docs) Update sideloading-howto.md 2023-11-30 21:51:03 +01:00
Viktor Lofgren
6a80ac62a5 (doc) Amend crawling documentation 2023-11-17 11:16:06 +01:00
Viktor Lofgren
e97259aca3 (docs) Update documentation 2023-10-27 13:22:11 +02:00
Viktor Lofgren
c0930ead0f (doc) Update conceptual-overview.svg 2023-10-19 17:48:34 +02:00
Viktor Lofgren
c899f1cb85 (docs) Update documentation to reflect new query service 2023-10-09 14:56:59 +02:00
Viktor Lofgren
0a579814a2 (docs) Parquet How-to 2023-09-24 19:40:45 +02:00
Viktor Lofgren
9338f35cd8 (doc) Remove confusingly outdated ER-diagrams 2023-09-21 15:08:27 +02:00
Viktor Lofgren
ead6fa9daa (doc) Update conceptual-overview.svg to reflect the removal of the lexicon 2023-09-21 13:47:05 +02:00
Viktor Lofgren
70aa04c047 (converter, stackexchange-xml) Add the ability to sideload stackexchange data 2023-09-21 12:48:33 +02:00
Viktor
dd380a5fb3
(doc) Add control-service to conceptual overview
Not adding every interaction as it would turn into a rat king.
2023-08-20 13:28:32 +02:00
Viktor Lofgren
e088eb9ec8 (scripts|docs) Update scripts and documentations for the new operator's gui and file storage workflows. 2023-08-01 22:50:33 +02:00
Viktor Lofgren
19402772fc (scripts|docs) Update scripts and documentations for the new operator's gui and file storage workflows. 2023-08-01 22:50:05 +02:00
Viktor Lofgren
ba724bc1b2 (scripts|docs) Update scripts and documentations for the new operator's gui and file storage workflows. 2023-08-01 22:47:37 +02:00
Viktor Lofgren
995657c6ce (big-string) Make big-string disable:able 2023-07-21 19:50:35 +02:00
Viktor Lofgren
4c627d0e1d Improvements to crawling.md 2023-06-22 18:01:43 +02:00
Viktor Lofgren
c8dd45e37d First draft for crawling documentation. 2023-06-22 17:44:24 +02:00
Viktor
a57ab427b3
Update useful-resources.md 2023-05-27 12:01:45 +02:00
Viktor Lofgren
1e65ac3940 Improve useful-resources.md 2023-03-28 16:35:58 +02:00
Viktor Lofgren
b60fcd0918 Documentation improvements 2023-03-27 17:25:27 +02:00
Viktor Lofgren
c5f4cb34bf Documentation for DB 2023-03-25 16:14:16 +01:00
Viktor
2e69179f12
Update readme.md 2023-03-25 15:47:45 +01:00
Viktor
19000ab339
Create readme.md 2023-03-25 15:46:19 +01:00
Viktor
5edc0c8d52
Add files via upload 2023-03-22 17:00:01 +01:00
Viktor
d9c456d772
Create module-taxonomy.md 2023-03-21 17:24:39 +01:00
Viktor
b2599a6d33
Make colors less eye-grating on dark theme. 2023-03-21 17:18:47 +01:00
Viktor
85fea2ecaa
Add files via upload 2023-03-21 17:08:34 +01:00
vlofgren
8e2225e346
Create useful-resources.md 2023-03-20 16:44:02 +01:00
Viktor Lofgren
4fdaaa16ba Restructuring the git repo 2023-03-04 13:19:01 +01:00
vlofgren
74ae97f8f4 Added test util for the tests to remove hard coding of LanguageModels. 2022-05-19 18:05:10 +02:00