Efficient RNA/DNA/Protein Primary/Secondary Structure.
This is a base library for bioinformatics with emphasis on RNA and DNA primary structure as well as amino acid sequences.
Provided are efficient encodings for short (limited by the amount of RAM) sequences, as required by RNA folding tools. Extended RNA secondary structures can be represented as well.
Contains data from:
Frequency and isostericity of RNA base pairs
Jesse Stombaugh, Craig L. Zirbel, Eric Westhof, and Neocles B. Leontis
Nucl. Acids Res. (2009)
BiobaseXNA
Efficient encoding of (short) biological sequences. This package ist designed to deal with in-memory snippets of DNA, RNA, and amino acids. The encoding is geared toward time-efficiency, not necessarily space efficiency (we use Int's for encoding characters, not the smallest type possible).
Additional modules provide conversion capabilities between different types of characters according to biological laws, and some biochemical constraint information. The latter includes canonical and non-canonical pairing information for RNA.
Actual energy parameters for pairings are provided by other packages, for example BiobaseTurner for the loop energy model with measured parameters.
Contact
Christian Hoener zu Siederdissen
Leipzig University, Leipzig, Germany
[email protected]
http://www.bioinf.uni-leipzig.de/~choener/