Description
Bayesian Additive Regression Trees.
Description
An advanced implementation of Bayesian Additive Regression Trees with expanded features for data analysis and visualization.
README.md
Benchmark Suite: bartMachine vs randomForest
This benchmark suite runs repeated K-fold cross-validation and compares out-of-sample performance between bartMachine and randomForest across datasets drawn from standard benchmark libraries (e.g., datasets, MASS, mlbench, ISLR).
Quick start
- Install the package and optional dataset libraries:
install.packages(c("bartMachine", "randomForest", "mlbench", "ISLR", "pROC"))
- Run the suite from the package root:
Rscript inst/benchmarks/run_benchmark_suite.R --folds=5 --repeats=3
- Outputs are written to
inst/benchmarks/results:benchmark_folds.csv: per-fold metricsbenchmark_summary.csv: mean and standard deviation by dataset/modelbenchmark_skipped.csv: datasets that were skipped and whybenchmark_results.rds: all results + configsessionInfo.txt: session metadata
Notes
- Set Java memory and other parameters before loading
bartMachine. You can do this via:options(java.parameters = c("-Xmx20g", "--add-modules=jdk.incubator.vector", "-XX:+UseZGC"))
- Use
--listto view all dataset definitions and--dry-runto list the filtered selection without running. - Large or slow datasets can be skipped with
--skip-tags=large. pROCis optional; if installed, AUC is computed for classification tasks.
Example filters
Rscript inst/benchmarks/run_benchmark_suite.R --packages=datasets,MASS --folds=3
Rscript inst/benchmarks/run_benchmark_suite.R --datasets=Boston,BostonHousing
Rscript inst/benchmarks/run_benchmark_suite.R --skip-tags=large --repeats=2