corpus (0.10.0)

0 users

Text Corpus Analysis.

Text corpus data analysis, with full support for international text (Unicode). Functions for reading data from newline-delimited 'JSON' files, for normalizing and tokenizing text, for searching for term occurrences, and for computing term occurrence frequencies, including n-grams.

Maintainer: Patrick O. Perry
Author(s): Patrick O. Perry [aut, cph, cre], Finn rup Nielsen [cph, dtc] (AFINN Sentiment Lexicon), Martin Porter and Richard Boulton [ctb, cph, dtc] (Snowball Stemmer and Stopword Lists), The Regents of the University of California [ctb, cph] (Strtod Library Procedure), Carlo Strapparava and Alessandro Valitutti [cph, dtc] (WordNet-Affect Lexicon), Unicode, Inc. [cph, dtc] (Unicode Character Database)

License: Apache License (== 2.0) | file LICENSE

Uses: utf8, Matrix, testthat, knitr
Enhances: tm, quanteda
Reverse depends: crqanlp
Reverse suggests: utf8

Released about 2 years ago.

14 previous versions



  (0 votes)


  (0 votes)

Log in to vote.


No one has written a review of corpus yet. Want to be the first? Write one now.

Related packages:(20 best matches, based on common tags.)

Search for corpus on google, google scholar, r-help, r-devel.

Visit corpus on R Graphical Manual.