Description
Get the Category of Content Hosted by a Domain.
Description
Get the category of content hosted by a domain. Use Shallalist <http://shalla.de/>, Virustotal (which provides access to lots of services) <https://www.virustotal.com/>, Alexa <https://aws.amazon.com/awis/>, DMOZ <https://curlie.org/>, University Domain list <https://github.com/Hipo/university-domains-list> or validated machine learning classifiers based on Shallalist data to learn about the kind of content hosted by a domain.
README.md
rdomains: Classify Domains Based on Their Content
The package provides a few ways to classify domains based on their content. You can either get the categorizations from shallalist (which has stopped its service --- the latest you will get is from 1/14/22), trusted (McAfee), DMOZ (the service has ended; available at curlie), Alexa API, which uses the DMOZ Data (now hosted at https://curlie.org), or virustotal API, or use validated machine learning models based off the shallalist data.
Installation
To get the current release version from CRAN:
install.packages("rdomains")
To get the current development version from GitHub:
# install.packages("devtools")
devtools::install_github("themains/rdomains", build_vignettes = TRUE)
Usage
To learn how to use rdomains, launch the vignette within R:
vignette("rdomains", package = "rdomains")
License
Scripts are released under the MIT License.