MyNixOS website logo
Description

Weighted Subspace Random Forest for Classification.

A parallel implementation of Weighted Subspace Random Forest. The Weighted Subspace Random Forest algorithm was proposed in the International Journal of Data Warehousing and Mining by Baoxun Xu, Joshua Zhexue Huang, Graham Williams, Qiang Wang, and Yunming Ye (2012) <DOI:10.4018/jdwm.2012040103>. The algorithm can classify very high-dimensional data with random forests built using small subspaces. A novel variable weighting method is used for variable subspace selection in place of the traditional random variable sampling.This new approach is particularly useful in building models from high-dimensional data.

wsrf: An R Package for Scalable Weighted Subspace Random Forests

License Version on CRAN Number of downloads from RStudio CRAN mirror

The wsrf is a parallel implementation of the Weighted Subspace Random Forest algorithm (wsrf) of Xu et al. A novel variable weighting method is used for variable subspace selection in place of the traditional approach of random variable sampling. This new approach is particularly useful in building models for high dimensional data---often consisting of thousands of variables. Parallel computation is used to take advantage of multi-core machines and clusters of machines to build random forest models from high dimensional data with reduced elapsed times.

Documentation & Examples

The package ships with a html vignette including more details and a few examples.

Installation

Currently, wsrf requires R (>= 3.3.0), Rcpp (>= 0.10.2). For the use of multi-threading, a C++ compiler with C++11 standard support of threads (for example, GCC 4.8.1) is required. Since the latest version of R has added support for C++11 on all operating systems, we do not provide support for the old version of R and C++ compiler without C++11 support. To install the latest version of the package, from within R run:

R> install.packages("wsrf")

NOTE

Previous version of wsrf provide support on systems without C++11 or using Boost for multithreading. Though we do not provide support for these options anymore, but still leave the usage here for someone with needs of previous version of wsrf. The choice is available at installation time depending on what is available to the user:

# To install previous version of wsrf without C++11
R> install.packages("wsrf", type = "source", configure.args = "--enable-c11=no")

# To install previous version of wsrf with Boost for multithreading
R> install.packages("wsrf",
+                   type = "source",
+                   configure.args = "--with-boost-include=<Boost include path>
                                      --with-boost-lib=<Boost lib path>")

After installation, one can use the built-in function wsrfParallelInfo to query whether the version installed is what they really want (distributed or multi-threaded).

R> wsrfParallelInfo()

License

GPL (>= 2)

Metadata

Version

1.7.30

License

Unknown

Platforms (75)

    Darwin
    FreeBSD 13
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-darwin
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-darwin
  • i686-freebsd13
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd13
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows