2023-03-13 17:39:53 +01:00
|
|
|
# Language Processing
|
|
|
|
|
|
|
|
This library contains various tools used in language processing.
|
|
|
|
|
|
|
|
## Central Classes
|
|
|
|
|
|
|
|
* [SentenceExtractor](src/main/java/nu/marginalia/language/sentence/SentenceExtractor.java) -
|
|
|
|
Creates a [DocumentLanguageData](src/main/java/nu/marginalia/language/model/DocumentLanguageData.java) from a text, containing
|
|
|
|
its words, how they stem, POS tags, and so on.
|
|
|
|
|
|
|
|
## See Also
|
|
|
|
|
|
|
|
[features-convert/keyword-extraction](../../features-convert/keyword-extraction) uses this code to identify which keywords
|
|
|
|
are important.
|
|
|
|
|
2023-10-09 15:12:30 +02:00
|
|
|
[features-qs/query-parser](../../features-qs/query-parser) also does some language processing.
|