MyNixOS website logo
Description

Fast, Flexible, and User-Friendly Record Linkage Methods.

Provides a flexible set of tools for matching two un-linked data sets. 'fedmatch' allows for three ways to match data: exact matches, fuzzy matches, and multi-variable matches. It also allows an easy combination of these three matches via the tier matching function.

fedmatch

CRANstatus

Any views expressed here do not reflect those of the Federal Reserve Board or Federal Reserve System.

The goal of fedmatch is to match un-linked datasets. It provides a variety of tools that allow a user to build a custom matching algorithm for their specific application. To get started, see the “Introduction to fedmatch” vignette.

You can view all the vignettes, and the rest of the documentation, on the fedmatch website.

Features

  • String cleaning tools
  • Fuzzy matching with standard string distance metrics from the package stringdist
  • A new fuzzy matching method which we call a Weighted Jaccard metric
  • Numeric matching using a trained logit model
  • A system to sequentially execute many different types of match algorithms
  • A system for evaluating matches post-hoc

Installation

You can install ‘fedmatch’ from CRAN with

install.packages("fedmatch")

You can install the development version from GitHub with:

install.packages("devtools")
devtools::install_github("seunglee98/fedmatch", build_vignettes = TRUE)

Or, you can download the folder from github, either by cloning it or downloading it manually and unzipping it, then running:

devtools::install("path_to_fedmatch", build_vignettes = TRUE)

Citation

This package is licensed under the terms of the MIT license. See the LICENSE.md file for details.

If you use this package for your research, please cite the technical paper:

Gregory J. Cohen, Jacob Dice, Melanie Friedrichs, Kamran Gupta, William Hayes, Isabel Kitschelt, Seung Jung Lee, W. Blake Marsh, Nathan Mislang, Maya Shaton, Martin Sicilian, Chris Webster. “The U.S. Syndicated Loan Market: Matching Data.” Journal of Financial Research, 2021.

Metadata

Version

2.0.6

License

Unknown

Platforms (75)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-darwin
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-darwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-windows