textTinyR (1.1.1)

0 users

Text Processing for Small or Big Data Files.

https://github.com/mlampros/textTinyR
http://cran.r-project.org/web/packages/textTinyR

It offers functions for splitting, parsing, tokenizing and creating a vocabulary for big text data files. Moreover, it includes functions for building a document-term matrix and extracting information from those (term-associations, most frequent terms). It also embodies functions for calculating token statistics (collocations, look-up tables, string dissimilarities) and functions to work with sparse matrices. Lastly, it includes functions for Word Vector Representations (i.e. 'GloVe', 'fasttext') and incorporates functions for the calculation of (pairwise) text document dissimilarities. The source code is based on 'C++11' and exported in R through the 'Rcpp', 'RcppArmadillo' and 'BH' packages.

Maintainer: Lampros Mouselimis
Author(s): Lampros Mouselimis <mouselimislampros@gmail.com>

License: GPL-3

Uses: data.table, Matrix, R6, Rcpp, testthat, knitr, rmarkdown, covr

Released 2 months ago.


11 previous versions

Ratings

Overall:

  (0 votes)

Documentation:

  (0 votes)

Log in to vote.

Reviews

No one has written a review of textTinyR yet. Want to be the first? Write one now.


Related packages:(20 best matches, based on common tags.)


Search for textTinyR on google, google scholar, r-help, r-devel.

Visit textTinyR on R Graphical Manual.