Ordered Correlation Forest.
Ordered Correlation Forest
R package to implement ordered correlation forests (OCF), a nonparametric estimator specifically optimized for handling ordered non-numeric outcomes.
OCF modifies a standard random forest splitting criterion to build a collection of forests, each estimating the conditional probabilities of a single class. Under an \open honesty" condition, the estimator inherits the asymptotic properties of random forests, namely the consistency and asymptotic normality of their predictions. The particular honesty implementation used by OCF allows us to obtain standard errors for the covariates' marginal effects. The estimated standard errors can then be used to construct conventional confidence intervals.
To get started, please check the online vignette for a short tutorial.
Installation
The current development version of the package can be installed using the devtools
package:
devtools::install_github("riccardo-df/ocf") # run install.packages("devtools") if needed.
References
Athey, S., Tibshirani, J., & Wager, S. (2019). Generalized Random Forests.Annals of Statistics, 47(2). [paper]
Lechner, M., & Mareckova, J. (2022). Modified Causal Forest.arXiv preprint arXiv:2209.03744. [paper]
Lechner, M., & Okasa, G. (2019). Random Forest Estimation of the Ordered Choice Model.arXiv preprint arXiv:1907.02436. [paper]
Wager, S., & Athey, S. (2018). Estimation and Inference of Heterogeneous Treatment Effects using Random Forests.Journal of the American Statistical Association, 113(523). [paper]
Wright, M. N. & Ziegler, A. (2017). ranger: A fast implementation of random forests for high dimensional data in C++ and R.Journal of Statistical Software, 77(1). [paper]