Description
Asynchronous Disk-Based Representation of Massive Data.
Description
Storing very large data objects on a local drive, while still making it possible to manipulate the data in an efficient manner.
README.md
Andromeda
Andromeda is part of HADES.
Introduction
AsynchroNous Disk-based Representation of MassivE DAta (ANDROMEDA): An R package for storing large data objects. Andromeda allow storing data objects on a local drive, while still making it possible to manipulate the data in an efficient manner.
Features
- Allows storage of data objects much larger than what can fit in memory.
- Integrates with dplyr package for data manipulation.
- Objects are stored in a temporary location on the local file system.
- Ability to save and load the objects to a compressed file in a permanent location on the local file system.
Examples
library(Andromeda)
bigData <- andromeda()
# Add some 'big' data:
bigData$cars <- cars
# Manipulate using dplyr:
bigData$cars %>% filter(speed > 10) %>% count() %>% collect()
# # A tibble: 1 x 1
# n
# <int>
# 1 41
saveAndromeda(bigData, "bigData.zip")
close(bigData)
Technology
The Andromeda package is an R package wrapped around RSQLite.
System Requirements
Running the package requires R.
Installation
To install the latest development version, install from GitHub:
install.packages("devtools")
devtools::install_github("ohdsi/Andromeda")
User Documentation
Documentation can be found on the package website.
- Vignette: Using Andromeda
- Package manual: Andromeda manual
Support
- Developer questions/comments/feedback: OHDSI Forum
- We use the GitHub issue tracker for all bugs/issues/enhancements
Contributing
Read here how you can contribute to this package.
License
Andromeda is licensed under Apache License 2.0
Development
Andromeda is being developed in R Studio.
Development status
Beta. Use at your own risk.