MyNixOS website logo
Description

Machine Learning for Integrating Partially Overlapped Genetic Datasets.

Tools to simulate genetic distance matrices, align and compare them via multidimensional scaling (MDS) and Procrustes, and evaluate imputation with the Bootstrapping Evaluation for Structural Missingness Imputation (BESMI) framework. Methods align with Zhu et al. (2025) <doi:10.3389/fpls.2025.1543956> and the associated software resource Zhu (2025) <doi:10.26188/28602953>.

DataFusionGDM

DOI (software) DOI (paper)

Machine Learning Solutions for Integrating Partially Overlapped Genetic Datasets.

How to cite

If you use DataFusion-GDM, please cite:

  • Paper: Zhu J., Malmberg M.M., Shinozuka M., Retegan R.M., Cogan N.O., Jacobs J.L., Giri K., Smith K.F. (2025). Machine learning solutions for integrating partially overlapping genetic datasets and modelling host–endophyte effects in ryegrass (Lolium) dry matter yield estimation. Frontiers in Plant Science. https://doi.org/10.3389/fpls.2025.1543956
  • Software: Zhu, J. (2025). DataFusion-GDM. The University of Melbourne. Software. https://doi.org/10.26188/28602953

Author ORCID: https://orcid.org/0000-0002-9916-9732

Install

In R:

if (!requireNamespace("remotes", quietly = TRUE)) install.packages("remotes")
remotes::install_github("jiashuaiz/DataFusion-GDM")

Usage

library(DataFusionGDM)

# Simulate a GDM in memory and visualize
res <- run_genetic_scenario("island", n_pops = 40)
res$plots$heatmap()
res$plots$mds()

# Optionally export to CSV if needed (defaults to tempdir)
tmp <- export_simulated_gdm(scenario = "default", n_pops = 40, verbose = FALSE)
# unlink(tmp)  # clean up when finished

# Simulate and visualize
source(system.file("examples/simulate_gdm_quick.R", package = "DataFusionGDM"), echo = TRUE)

# MDS + Procrustes
source(system.file("examples/mds_procrustes_demo.R", package = "DataFusionGDM"), echo = TRUE)

# BESMI batch (small demo)
source(system.file("examples/besmi_batch_quick.R", package = "DataFusionGDM"), echo = TRUE)

Vignettes

See the package vignettes for end-to-end guides:

  • Getting started
  • MDS + Procrustes sensitivity
  • BESMI batch imputation

Open vignettes in R:

browseVignettes("DataFusionGDM")
vignette("getting-started", package = "DataFusionGDM")

Contents

  • Simulation and visualization APIs in R/simulate_gdm.R
  • MDS & Procrustes APIs in R/mds_procrustes.R
  • BESMI preparation and imputation APIs in R/besmi*.R
  • Vignettes under vignettes/ (no bundled data; examples use in-memory/temp files)

License

GPL-3.0

Metadata

Version

1.3.2

License

Unknown

Platforms (78)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    uefi
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-freebsd
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64-uefi
  • aarch64-windows
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-linux
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-uefi
  • x86_64-windows