mlvocab (0.1)

0 users

Vocabulary and Corpus Preprocessing for Natural Language Pipelines.

https://github.com/vspinu/mlvocab/
http://cran.r-project.org/web/packages/mlvocab

Utilities for preprocessing of text corpora into data structures suitable for natural language models: integer sequences or matrices, vocabulary embedding matrices, term-doc, doc-term, term co-occurrence matrices etc. All functions allow for full or partial hashing of the terms in the vocabulary.

Maintainer: Vitalie Spinu
Author(s): Vitalie Spinu [aut, cre]

License: GPL-3

Uses: digest, Matrix, Rcpp, sparsepp, testthat, knitr

Released 5 months ago.


1 previous version

Ratings

Overall:

  (0 votes)

Documentation:

  (0 votes)

Log in to vote.

Reviews

No one has written a review of mlvocab yet. Want to be the first? Write one now.


Related packages:(20 best matches, based on common tags.)


Search for mlvocab on google, google scholar, r-help, r-devel.

Visit mlvocab on R Graphical Manual.