PGRdup (

0 users

Discover Probable Duplicates in Plant Genetic Resources Collections.

Provides functions to aid the identification of probable/possible duplicates in Plant Genetic Resources (PGR) collections using 'passport databases' comprising of information records of each constituent sample. These include methods for cleaning the data, creation of a searchable Key Word in Context (KWIC) index of keywords associated with sample records and the identification of nearly identical records with similar information by fuzzy, phonetic and semantic matching of keywords.

Maintainer: J. Aravind
Author(s): J. Aravind [aut, cre] (<>), J. Radhamani [aut], Kalyani Srinivasan [aut], B. Ananda Subhash [aut], Rishi Kumar Tyagi [aut], ICAR-NBGPR [cph] (, Maurice Aubrey [ctb] (Double Metaphone), Kevin Atkinson [ctb] (Double Metaphone), Lawrence Philips [ctb] (Double Metaphone)

License: GPL-2 | GPL-3

Uses: data.table, ggplot2, gridExtra, igraph, stringdist, stringi, XML, diagram, microbenchmark, wordcloud, knitr, pander, rmarkdown

Released 10 days ago.

9 previous versions



  (0 votes)


  (0 votes)

Log in to vote.


No one has written a review of PGRdup yet. Want to be the first? Write one now.

Related packages:(20 best matches, based on common tags.)

Search for PGRdup on google, google scholar, r-help, r-devel.

Visit PGRdup on R Graphical Manual.