mlvocab (0.1)

0 users

Vocabulary and Corpus Preprocessing for Natural Language Pipelines.

Utilities for preprocessing of text corpora into data structures suitable for natural language models: integer sequences or matrices, vocabulary embedding matrices, term-doc, doc-term, term co-occurrence matrices etc. All functions allow for full or partial hashing of the terms in the vocabulary.

Maintainer: Vitalie Spinu
Author(s): Vitalie Spinu [aut, cre]

License: GPL-3

Uses: digest, Matrix, Rcpp, sparsepp, testthat, knitr

Released over 1 year ago.

1 previous version



  (0 votes)


  (0 votes)

Log in to vote.


No one has written a review of mlvocab yet. Want to be the first? Write one now.

Related packages:(20 best matches, based on common tags.)

Search for mlvocab on google, google scholar, r-help, r-devel.

Visit mlvocab on R Graphical Manual.