MyNixOS website logo
Description

Pubmed Unified REtrieval for Multi-Output Exploration.

Access a variety of 'PubMed' data through a single, user-friendly interface, including abstracts, bibliometrics from 'iCite', pubtations from 'PubTator3', and full-text records from 'PMC'.

puremoe

PubMed Unified REtrieval for Multi-Output Exploration. An R package that provides a single interface for accessing a range of NLM/PubMed databases, including:

This unified interface simplifies the data retrieval process, allowing users to interact with multiple PubMed services/APIs/output formats through a single R function.

The package also includes MeSH thesaurus resources as simple data frames, including Descriptor Terms, Descriptor Tree Structures, and Supplementary Concept Terms. Via the mesh-resources library.

Installation

Get the released version from CRAN:

install.packages('puremoe')

Or the development version from GitHub with:

remotes::install_github("jaytimm/puremoe")

Usage

PubMed search

The package has two basic functions: search_pubmed and get_records. The former fetches PMIDs from the PubMed API based on user search; the latter scrapes PMID records from a user-specified PubMed endpoint – pubmed_abstracts, pubmed_affiliations, pubtations, icites, or pmc_fulltext.

Search syntax is the same as that implemented in standard PubMed search.

pmids <- puremoe::search_pubmed('("political ideology"[TiAb])',
                                 use_pub_years = F)

# pmids <- puremoe::search_pubmed('immunity', 
#                                  use_pub_years = T,
#                                  start_year = 2022,
#                                  end_year = 2024) 

Get record-level data

pubmed <- pmids |> 
  puremoe::get_records(endpoint = 'pubmed_abstracts', 
                       cores = 3, 
                       sleep = 1) 

affiliations <- pmids |> 
  puremoe::get_records(endpoint = 'pubmed_affiliations', 
                       cores = 1, 
                       sleep = 0.5)

icites <- pmids |>
  puremoe::get_records(endpoint = 'icites',
                       cores = 3,
                       sleep = 0.25)

pubtations <- pmids |> 
  puremoe::get_records(endpoint = 'pubtations',
                       cores = 2)

# For PMC full text, first convert PMIDs to PMC IDs
pmc_ids <- puremoe::pmid_to_pmc(pmids, sleep = 0.5)
pmc_fulltext <- pmc_ids$url[!is.na(pmc_ids$url)] |> 
  puremoe::get_records(endpoint = 'pmc_fulltext', 
                       cores = 1, 
                       sleep = 1)

For the pmc_fulltext endpoint, first use pmid_to_pmc() to convert PMIDs to PMC IDs and full URLs, then pass the url column to get_records().

Summary

OutputColnameDescription
pubmed_abstractspmidPMID
pubmed_abstractsyearPublication year
pubmed_abstractsjournalJournal name
pubmed_abstractsarticletitleArticle title
pubmed_abstractsabstractArticle abstract
pubmed_abstractsannotationsMesh/Chem/Keywords annotations
pubmed_affiliationspmidPMID
pubmed_affiliationsAuthorAuthor name
pubmed_affiliationsaffiliationAuthor affiliation
pubtationspmidPMID
pubtationstiabTitle or abstract
pubtationsidEntity ID
pubtationsentityExtracted entity
pubtationsidentifierKnowledge base link (KB link)
pubtationstypeEntity type
pubtationsstartStart position (char)
pubtationsendEnd position (char)
pmc_fulltextpmidPMID
pmc_fulltextsectionFull text section
pmc_fulltexttextFull text content
icitespmidPMID
icitesis_research_articleResearch article indicator
icitesnih_percentileNIH percentile rank
icitesis_clinicalClinical article indicator
icitescitation_countCitation count
icitesref_countReference count
icitescitation_netCitation network (to/from edgelist)
Metadata

Version

1.0.3

License

Unknown

Platforms (78)

    Darwin
    FreeBSD
    Genode
    GHCJS
    Linux
    MMIXware
    NetBSD
    none
    OpenBSD
    Redox
    Solaris
    uefi
    WASI
    Windows
Show all
  • aarch64-darwin
  • aarch64-freebsd
  • aarch64-genode
  • aarch64-linux
  • aarch64-netbsd
  • aarch64-none
  • aarch64-uefi
  • aarch64-windows
  • aarch64_be-none
  • arm-none
  • armv5tel-linux
  • armv6l-linux
  • armv6l-netbsd
  • armv6l-none
  • armv7a-linux
  • armv7a-netbsd
  • armv7l-linux
  • armv7l-netbsd
  • avr-none
  • i686-cygwin
  • i686-freebsd
  • i686-genode
  • i686-linux
  • i686-netbsd
  • i686-none
  • i686-openbsd
  • i686-windows
  • javascript-ghcjs
  • loongarch64-linux
  • m68k-linux
  • m68k-netbsd
  • m68k-none
  • microblaze-linux
  • microblaze-none
  • microblazeel-linux
  • microblazeel-none
  • mips-linux
  • mips-none
  • mips64-linux
  • mips64-none
  • mips64el-linux
  • mipsel-linux
  • mipsel-netbsd
  • mmix-mmixware
  • msp430-none
  • or1k-none
  • powerpc-linux
  • powerpc-netbsd
  • powerpc-none
  • powerpc64-linux
  • powerpc64le-linux
  • powerpcle-none
  • riscv32-linux
  • riscv32-netbsd
  • riscv32-none
  • riscv64-linux
  • riscv64-netbsd
  • riscv64-none
  • rx-none
  • s390-linux
  • s390-none
  • s390x-linux
  • s390x-none
  • vc4-none
  • wasm32-wasi
  • wasm64-wasi
  • x86_64-cygwin
  • x86_64-darwin
  • x86_64-freebsd
  • x86_64-genode
  • x86_64-linux
  • x86_64-netbsd
  • x86_64-none
  • x86_64-openbsd
  • x86_64-redox
  • x86_64-solaris
  • x86_64-uefi
  • x86_64-windows