textTinyR (1.0.3)

0 users

Text Processing for Small or Big Data Files.

https://github.com/mlampros/textTinyR
http://cran.r-project.org/web/packages/textTinyR

Processes big text data files in batches efficiently. For this purpose, it offers functions for splitting, parsing, tokenizing and creating a vocabulary. Moreover, it includes functions for building either a document-term matrix or a term-document matrix and extracting information from those (term-associations, most frequent terms). Lastly, it embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.

Maintainer: Lampros Mouselimis
Author(s): Lampros Mouselimis <mouselimislampros@gmail.com>

License: GPL-3

Uses: data.table, Matrix, R6, Rcpp, testthat, knitr, rmarkdown, covr

Released about 1 month ago.


3 previous versions

Ratings

Overall:

  (0 votes)

Documentation:

  (0 votes)

Log in to vote.

Reviews

No one has written a review of textTinyR yet. Want to be the first? Write one now.


Related packages:(20 best matches, based on common tags.)


Search for textTinyR on google, google scholar, r-help, r-devel.

Visit textTinyR on R Graphical Manual.