Description
Frictionless Standards.
Description
A "tabular-data-resource" (<https://specs.frictionlessdata.io/tabular-data-resource/>) is a simple format to describe a singular tabular data resource such as a CSV file. It includes support both for metadata such as author and title and a schema to describe the data, for example the types of the fields/columns in the data. Create a tabular-data-resource by providing a data.frame and specifying metadata. Write and read tabular-data-resources to and from disk.
README.md
fr
fr
provides fr_tdr
, fr_schema
, and fr_field
objects for implementing frictionlesstabular-data-resource standards in R.
Installation
Install {fr} from CRAN:
install.packages("fr")
You can install the development version of {fr} from GitHub with:
# install.packages("pak")
pak::pak("cole-brokamp/fr")
Usage
- See
vignette("read_fr_tdr")
to read a Frictionless Tabular Data Resource from disk or the web, access metadata, and manipulate it as a data.frame - See
vignette("creating_a_tabular-data-resource")
to create a Frictionless Tabular Data Resource by converting a data.frame or tibble, adding metadata, and writing to disk
Frictionless Standards
Developed by the Open Knowledge Foundation, the frictionlessstandards are a set of patterns for describing data, including datasets (Data Package), files (Data Resource), and tables (Table Schema). A Data Package is a simple container format used to describe and package a collection of data and metadata, including schemas. These metadata are contained in a specific file (separate from the data file), usually written in JSON or YAML, that describes something specific to each Frictionless Standard:
- Table Schema: describes a tabular file by providing its dimension, field data types, relations, and constraints
- Data Resource: describes an exact tabular file providing a path to the file and details like title, description, and others
- Tabular Data Resource = Data Resource + Table Schema
- CSV dialect: describes the formatting specific to the various dialects of CSV files
- Data Package & Tabular Data Package: describes a collection of tabular files providing data resource information from above along with general information about the package itself, a license, authors, and other metadata.