MyNixOS website logo
Description

Chemical Metrics for Microbial Communities.

Combines taxonomic classifications of high-throughput 16S rRNA gene sequences with reference proteomes of archaeal and bacterial taxa to generate amino acid compositions of community reference proteomes. Calculates chemical metrics including carbon oxidation state ('Zc'), stoichiometric oxidation and hydration state ('nO2' and 'nH2O'), H/C, N/C, O/C, and S/C ratios, grand average of hydropathicity ('GRAVY'), isoelectric point ('pI'), protein length, and average molecular weight of amino acid residues. Uses precomputed reference proteomes for archaea and bacteria derived from the Genome Taxonomy Database ('GTDB'). Also includes reference proteomes derived from the NCBI Reference Sequence ('RefSeq') database and manual mapping from the 'RDP Classifier' training set to 'RefSeq' taxonomy as described by Dick and Tan (2023) <doi:10.1007/s00248-022-01988-9>. Processes taxonomic classifications in 'RDP Classifier' format or OTU tables in 'phyloseq-class' objects from the Bioconductor package 'phyloseq'.

CRAN DOI R-CMD-check

chem16S calculates chemical metrics for microbial communities by combining taxonomic abundances with genomic reference sequences for proteins. The chemical representation of communities has applications ranging from human microbiomes to Earth-life coevolution.

Supported input formats:

Supported reference databases:

Description

The chem16S R package combines taxonomic classifications of high-throughput 16S rRNA gene sequences with precomputed amino acid compositions of reference proteomes for archaea and bacteria to obtain the amino acid compositions of community reference proteomes. Chemical metrics of community reference proteomes such as carbon oxidation state (Z/sub) and stoichiometric hydration state (nH2O) reveal new types of adaptations of microbial genomes to environmental conditions. For instance, an association of lower nH2O with higher salinity in the Baltic Sea suggests a genomically encoded dehydration trend:

Baltic Sea nH2O-Zc plot (example from chem16S::plot_metrics)

PSU stands for practical salinity units. The sequence data analyzed for this plot was taken from Herlemann et al. (2016) and the code to make this plot is available in the help page for chem16S::plot_metrics.

Methods

  • Scripts in the GTDB_214 and RefSeq_206 directories were used to generate reference proteomes for genus- and higher-level archaeal and bacterial taxa (and viruses for RefSeq).

  • It is recommended to use 16S rRNA sequences from GTDB for taxonomic classification (files are available for DADA2 and the RDP Classifier) so that taxonomic assignments can be automatically matched to GTDB reference proteomes available in chem16S.

  • For taxonomic classifications made using the RDP training set (No. 18 07/2020, used in RDP Classifier version 2.13), chem16S includes manual mappings to the NCBI taxonomy described by Dick and Tan (2023).

Installation

First install phyloseq from Bioconductor:

if(!require("BiocManager", quietly = TRUE)) install.packages("BiocManager")
BiocManager::install("phyloseq")

Then install the release version of chem16S from CRAN:

install.packages("chem16S")

Or use install_github from remotes or devtools to install the development version of chem16S from GitHub:

if(!require("remotes", quietly = TRUE)) install.packages("remotes")
remotes::install_github("jedick/chem16S", build_vignettes = TRUE)
Metadata

Version

1.1.0

License

Unknown

Platforms (75)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-darwin
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-darwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows