MyNixOS website logo
Description

Genetic Analysis of Pooled Samples.

Analyzing genetic data obtained from pooled samples. This package can read in Fragment Analysis output files, process the data, and score peaks, as well as facilitate various analyses, including cluster analysis, calculation of genetic distances and diversity indices, as well as bootstrap resampling for statistical inference. Specifically tailored to handle genetic data efficiently, researchers can explore population structure, genetic differentiation, and genetic relatedness among samples. We updated some functions from Covarrubias-Pazaran et al. (2016) <doi:10.1186/s12863-016-0365-6> to allow for the use of new file formats and referenced the following to write our genetic analysis functions: Long et al. (2022) <doi:10.1038/s41598-022-04776-0>, Jost (2008) <doi:10.1111/j.1365-294x.2008.03887.x>, Nei (1973) <doi:10.1073/pnas.70.12.3321>, Foulley et al. (2006) <doi:10.1016/j.livprodsci.2005.10.021>, Chao et al. (2008) <doi:10.1111/j.1541-0420.2008.01010.x>.

R-CMD-check R-CMD-check

Package Overview

pooledpeaks is designed for analyzing genetic data obtained from Fragment Analysis output files (.fsa) of pooled biological samples. It provides functions for a comprehensive analysis pipeline from processing .fsa files, to cleaning the peak data, and conducting population genetic analyses. Some features are listed below and a usage example of the entire pipeline is included as a vignette. Please check out the Contributing Guidelines for information on how to add to this package.

Installation Instructions

You can install the package directly from GitHub using the following instructions:

Open R and copy the following code into your console

Install devtools and pooledpeaks from GitHub

install.packages("devtools")
devtools::install_github("kmkuesters/pooledpeaks")

Install pooledpeaks directly from CRAN

install.packages("pooledpeaks")

Features

For a detailed example of how to apply the functions contained in this package please see the Introduction to Using the pooledpeaks Workflow. Example data can be found on GitHub under the inst/extdata folder including .fsa files and a formatted "Multiplex_frequencies.txt" file for the Genetic Analysis portion.

  • Peak Scoring: Process .fsa files and score peaks contained therein.
check_fsa_v_batch()
fsa_metadata()
fsa_batch_imp()
associate_dyes()
score_markers_rev3()
  • Data Manipulation: Clean and prepare peak data for downstream analyses.
clean_scores()
lf_to_tdf()
data_manipulation()
Rep_check()
PCDM()
LoadData()
  • Population Genetics Analysis:

    • Calculate Gene Identity Matrix and Genetic Distance Matrix

    • Calculate diversity indices

    • Calculate differentiation indices

    • Perform cluster analysis

TypedLoci()
GeneIdentityMatrix()
GeneticDistanceMatrix()
GST()
JostD()
cluster()
  • Visualization: Visualize the peak scoring and genetic analysis results.
MDSplot()

Sample Data

The sample .fsa files included in this package are provided for demonstration purposes and originate from two sources:

  • Schistosoma haematobium laboratory isolates, used for preliminary testing of the pooledpeaks workflow. These data contain no identifiable or human subject information.
  • De-identified Schistosoma mansoni samples from a three studies conducted in Brazil, extracted from discarded human waste. These files were originally used for genetic analysis and are shared here in anonymized form to illustrate compatibility with additional species and data sources.These studies are described in detail by Long et al. (2022), available at https://www.nature.com/articles/s41598-022-04776-0:

These files are intended solely to demonstrate the functionality of the pooledpeaks package and are not for diagnostic or clinical use.To access the example .fsa files included with the package, use the following path within R:

system.file("extdata", package = "pooledpeaks")

The pooledpeaks package was developed by the Blanton Lab as part of Kathleen Kuesters' dissertation.

References:

Metadata

Version

1.2.2

License

Unknown

Platforms (75)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-freebsd
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64-windows
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows