corpus (0.9.4)

Text Corpus Analysis.

Text corpus data analysis, with full support for international text (Unicode). Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies, including n-grams.

Maintainer: Patrick O. Perry
Author(s): Patrick O. Perry [aut, cph, cre], Finn rup Nielsen [cph, dtc] (AFINN Sentiment Lexicon), Martin Porter and Richard Boulton [ctb, cph, dtc] (Snowball Stemmer and Stopword Lists), Carlo Strapparava and Alessandro Valitutti [cph, dtc] (WordNet-Affect Lexicon), Unicode, Inc. [cph, dtc] (Unicode Character Database)

License: Apache License (== 2.0) | file LICENSE

Uses: Matrix, testthat, knitr
Enhances: tm, quanteda
Reverse depends: crqanlp
Reverse suggests: utf8

Released about 2 years ago.