MyNixOS website logo
Description

Codon Usage Bias Analysis.

A suite of functions for rapid and flexible analysis of codon usage bias. It provides in-depth analysis at the codon level, including relative synonymous codon usage (RSCU), tRNA weight calculations, machine learning predictions for optimal or preferred codons, and visualization of codon-anticodon pairing. Additionally, it can calculate various gene- specific codon indices such as codon adaptation index (CAI), effective number of codons (ENC), fraction of optimal codons (Fop), tRNA adaptation index (tAI), mean codon stabilization coefficients (CSCg), and GC contents (GC/GC3s/GC4d). It also supports both standard and non-standard genetic code tables found in NCBI, as well as custom genetic code tables.

cubar

CRAN status Lifecycle: experimental DOI

Overview

cubar is a package for codon usage bias analysis in R. Main features are as follows:

  • Codon level analyses
    • Calculate tRNA weights;
    • Calculate relative synonymous codon usage (RSCU);
    • Machine learning-based inference of optimal codons;
    • Visualization codon-anticodon pairing relationships;
  • Gene level analyses
    • Tabulate codon frequency of each coding sequence;
    • Measure codon usage similarity to highly expressed genes with Codon Adaptation Index (CAI);
    • Quantify the influnce of codon usage on mRNA stability with Mean Codon Stabilization Coefficients (CSCg);
    • Measure codon usage bias with the nonparametric index Effective number of codons (ENC);
    • Measure the fraction of pre-determined optimal codons (Fop) in each sequence;
    • Overall GC content (GC) or that of 3rd synonymous positions (GC3s) or 4-fold degenerate sites (GC4d);
    • Quantify whether codon usage matches tRNA availability using tRNA Adaptation Index (tAI);
  • Utilities
    • Sliding window analysis of codon usage within a coding sequence;
    • Optimize codon usage based on optimal codons for heterologous expression;
    • Test differential usage of codons between two sets of sequences;

Main advantages of cubar are as follows:

  • Process large datasets (>10,0000 sequences) efficiently using the Biostrings and data.table backends;
  • Support genetic codes cataloged by NCBI as well as custom ones;
  • Integrate with other data analysis or bioinformatic packages in the R ecosystem;

Dependencies

Depends

  • R (>= 4.1.0)

Imports

  • Biostrings (>= 2.60.0),
  • IRanges (>= 2.34.0),
  • data.table (>= 1.14.0),
  • ggplot2 (>= 3.3.5),
  • rlang (>= 0.4.11)

Installation

The latest release of cubar can be installed with:

install.packages("cubar")

The latest developmental version of cubar can be installed with:

devtools::install_github("mt1022/cubar", dependencies = TRUE)

Usage

Documentation can be found within R (by typing ?function_name). The following tutorials are available from our website:

Getting help

Please use GitHub issues for bug reports, questions, and feature requests.

Suggests

  • Biostrings for sequence input/output and manipulation;
  • Peptides for peptide- or protein-related indices;

Acknowledgements

GitHub Copilot was used to suggest code snippets in the development of this package. Thanks the GitHub Education teacher program for providing free access to GitHub Copilot.

Metadata

Version

1.0.0

License

Unknown

Platforms (75)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-darwin
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-darwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows