CatgirlIntelligenceAgency/code/libraries/language-processing
Viktor Lofgren 52a0255814 (*) Add flag for disabling ASCII flattening
The production configuration assumes all content of interest is 7 bit ASCII, and makes a series of optimizations based on this.  This assumption holds poorly in the wild.

Adding an **experimental** system property 'system.noFlattenUnicode', that when set to TRUE, will disable this behavior.

IMPORTANT!! The index needs to be re-constructed when this flag is changed, as different hash functions are selected for the keyword->identifier mappings.
2024-01-31 11:50:59 +01:00
..
src (*) Add flag for disabling ASCII flattening 2024-01-31 11:50:59 +01:00
build.gradle (build) Move unit test configuration to root build.gradle 2023-10-04 12:46:22 +02:00
readme.md (refactor) Remove features-search and update documentation 2023-10-09 15:12:30 +02:00

Language Processing

This library contains various tools used in language processing.

Central Classes

See Also

features-convert/keyword-extraction uses this code to identify which keywords are important.

features-qs/query-parser also does some language processing.