Description
N-Gram Analysis of Biological Sequences.
Description
Tools for extraction and analysis of various n-grams (k-mers) derived from biological sequences (proteins or nucleic acids). Contains QuiPT (quick permutation test) for fast feature-filtering of the n-gram data.
README.md
biogram package
This package contains tools for extraction and analysis of various n-grams (sequences of n items) derived from biological sequences (proteins or nucleic acids). To deal with the curse of dimensionality of the n-grams, biogram uses Quick Permutation Test (QuiPT) for fast feature filtering.
Installation
biogram is available on CRAN, so installation is as simple as:
install.packages("biogram")
You can install the latest development version of the code using the devtools
R package.
# Install devtools, if you haven't already.
install.packages("devtools")
library(devtools)
install_github("michbur/biogram")
For citation type:
citation("biogram")
or use: Michal Burdukiewicz, Piotr Sobczyk and Chris Lauber (2016). biogram: N-Gram Analysis of Biological Sequences. R package version 1.3. https://cran.r-project.org/package=biogram.