corpus (0.9.0)

Text Corpus Analysis.

Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies (including n-grams).

Maintainer: Patrick O. Perry
Author(s): Patrick O. Perry [aut, cre], Martin Porter and Richard Boulton [ctb, cph] (Snowball), Carlo Strapparava and Alessandro Valitutti [ctb, cph] (WordNet-Affect), Unicode, Inc. [ctb, cph] (Unicode Character Database)

License: Apache License (== 2.0) | file LICENSE

Uses: Matrix, testthat, knitr
Reverse depends: crqanlp
Reverse suggests: utf8

Released over 2 years ago.