corpus (0.6.0)
Text Corpus Analysis.
https://github.com/patperry/r-corpus
http://cran.r-project.org/web/packages/corpus
Text corpus data analysis, with full support for Unicode. Functions for reading data from newline-delimited JSON files, for normalizing and tokenizing text, and for computing term occurrence frequencies (including n-grams).
Maintainer:
Patrick O. Perry
Author(s): Patrick O. Perry [aut, cre], Martin Porter and Richard Boulton [ctb, cph] (Snowball), Unicode, Inc. [ctb, cph] (Unicode Character Database)
License: Apache License (== 2.0) | file LICENSE
Uses: Matrix, testthat
Reverse depends: crqanlp
Reverse suggests: utf8
Released over 2 years ago.