16 lines
372 B
Markdown
16 lines
372 B
Markdown
# Term Frequency Extractor
|
|
|
|
Generates a term frequency dictionary file from a batch of crawl data.
|
|
|
|
Usage:
|
|
|
|
```shell
|
|
PATH_TO_SAMPLES=run/samples/crawl-s
|
|
export JAVA_OPTS=-Dcrawl.rootDirRewrite=/crawl:${PATH_TO_SAMPLES}
|
|
|
|
term-frequency-extractor ${PATH_TO_SAMPLES}/plan.yaml out.dat
|
|
```
|
|
|
|
## See Also
|
|
|
|
* [libraries/term-frequency-dict](../../libraries/term-frequency-dict) |