Download and Analyze Crash Data.
rfars
The goal of rfars
is to facilitate transportation safety analysis by simplifying the process of extracting data from official crash databases. The National Highway Traffic Safety Administration collects and publishes a census of fatal crashes in the Fatality Analysis Reporting System and a sample of fatal and non-fatal crashes in the Crash Report Sampling System (an evolution of the General Estimates System). The Fatality and Injury Reporting System Tool allows users to query these databases, and can produce simple tables and graphs. This suffices for simple analysis, but often leaves researchers wanting more. Digging any deeper, however, involves a time-consuming process of downloading annual ZIP files and attempting to stitch them together - after first combing through immense data dictionaries to determine the required variables and table names.
rfars
allows users to download FARS and GES/CRSS data back to 2011 with just one line of code. The result is a full, rich dataset ready for mapping, modeling, and other downstream analysis. Helper functions are also provided to produce common counts and comparisons.
Installation
You can install the latest version of rfars
from GitHub with:
# install.packages("devtools")
devtools::install_github("s87jackson/rfars")
or the CRAN stable release with:
install.packages("rfars")
Then load rfars and some helpful packages:
library(rfars)
library(dplyr)
Getting and Using Data
The get_fars()
and get_gescrss()
are the primary functions of the rfars
package. These functions either download and process data files directly from NHTSA’s FTP Site, or pull the prepared file stored on your local machine. They take the functions years
and states
(FARS) or regions
(GES/CRSS). As the source data files follow an annual structure, years
determines how many file sets are downloaded, and states
/regions
filters the resulting dataset. Downloading and processing these files can take several minutes. Before downloading rfars
will inform you that it’s about to download files and asks your permission to do so. To skip this dialog, set proceed = TRUE
. You can use the dir
and cache
parameters to save an RDS file to your local machine. The dir
parameter specifices the directory, and cache
names the file (be sure to include the .rds file extension).
Here we get one year of FARS data for Virginia:
myFARS <- get_fars(years = 2022, states = "VA", proceed = TRUE)
#> ✓ 2022 data downloaded
#> Preparing raw data files...
#> ✓ Accident file processed
#> ✓ Vehicle file processed
#> ✓ Person file processed
#> ✓ Weather file(s) processed
#> ✓ Crash risk factors file processed
#> ✓ Vehicle-level files processed
#> ✓ PBtype file processed
#> ✓ SafetyEq file processed
#> ✓ Person-level files processed
#> ✓ Flat file constructed
#> ✓ Multi_acc file constructed
#> ✓ Multi_veh file constructed
#> ✓ Multi_per file constructed
#> ✓ SOE file constructed
#> ✓ Prepared files saved in C:/Users/s87ja/AppData/Local/Temp/RtmpOI3o4P/FARS data/prepd/2022
#> ✓ Codebook file saved in C:/Users/s87ja/AppData/Local/Temp/RtmpOI3o4P/FARS data/prepd/
We could have saved that file locally with:
myFARS <- get_fars(years=2022, states = "VA", proceed = TRUE, dir = getwd(), cache = "myFARS.rds")
Note that you can assign and save this data with one function call.
We could similarly get one year of CRSS data for the south (MD, DE, DC, WV, VA, KY, TN, NC, SC, GA, FL, AL, MS, LA, AR, OK, TX):
myCRSS <- get_gescrss(years = 2022, regions = "s", proceed = TRUE)
myCRSS <- get_gescrss(years = 2022, regions = "s", proceed = TRUE, dir = getwd(), cache = "myCRSS.rds")
The data returned by get_fars()
and get_gescrss()
adhere to the same structure: a list with six tibbles: flat
, multi_acc
, multi_veh
, multi_per
, events
, and codebook
. FARS and GES/CRSS share many but not all data elements. See the FARS Analytical User’s Manual and CRSS Analytical User’s Manual for more information.
The following section decribes the results from get_fars()
but largely applies to get_gescrss()
as well.
The flat
tibble contains all variables for which there is just one value per crash (“accident”), vehicle, or person (e.g., intersection type, travel speed, age). Each row corresponds to a person involved in a crash. As there may be multiple people and/or vehicles involved in one crash, some variable-values are repeated within a crash or vehicle. Each crash is uniquely identified with id
, which is a combination of year
and st_case
. Note that st_case
is not unique across years, for example, st_case
510001 will appear in each year. The id
variable attempts to avoid this issue.
glimpse(myFARS$flat, width = 100)
#> Rows: 2,107
#> Columns: 196
#> $ year <dbl> 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022, 2022…
#> $ state <chr> "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Vir…
#> $ st_case <dbl> 510001, 510001, 510001, 510001, 510001, 510001, 510002, 510002, 510002, 5100…
#> $ id <dbl> 2022510001, 2022510001, 2022510001, 2022510001, 2022510001, 2022510001, 2022…
#> $ veh_no <dbl> 0, 1, 1, 1, 1, 1, 1, 2, 2, 0, 1, 2, 1, 1, 1, 2, 1, 1, 1, 1, 1, 1, 1, 2, 1, 2…
#> $ per_no <dbl> 1, 1, 2, 3, 4, 5, 1, 1, 2, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 1, 2, 1, 1, 1…
#> $ county <dbl> 117, 117, 117, 117, 117, 117, 177, 177, 177, 73, 73, 73, 153, 155, 171, 171,…
#> $ city <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 2370, 2370, 0, 0, 468, 0, 0, 440, …
#> $ lon <dbl> -78.40886, -78.40886, -78.40886, -78.40886, -78.40886, -78.40886, -77.63212,…
#> $ lat <dbl> 36.66222, 36.66222, 36.66222, 36.66222, 36.66222, 36.66222, 38.17428, 38.174…
#> $ acc_type <chr> NA, "C13-Single Driver-Forward Impact-Pedestrian/ Animal", "C13-Single Drive…
#> $ age <chr> "59 Years", "17 Years", "Not Reported", "Not Reported", "Not Reported", "Not…
#> $ air_bag <chr> "Not a Motor Vehicle Occupant", "Not Deployed", "Not Reported", "Not Reporte…
#> $ alc_res <chr> "0.197 % BAC", "Test Not Given", "Test Not Given", "Test Not Given", "Test N…
#> $ alc_status <chr> "Test Given", "Test Not Given", "Test Not Given", "Test Not Given", "Test No…
#> $ arr_hour <chr> "Unknown EMS Scene Arrival Hour", "Unknown EMS Scene Arrival Hour", "Unknown…
#> $ arr_min <chr> "Unknown if Arrived", "Unknown if Arrived", "Unknown if Arrived", "Unknown i…
#> $ atst_typ <chr> "Vitreous", "Test Not Given", "Test Not Given", "Test Not Given", "Test Not …
#> $ bikecgp <chr> "Bicyclist Failed to Yield - Sign-Controlled Intersection", NA, NA, NA, NA, …
#> $ bikectype <chr> "Bicyclist Ride Through - Sign-Controlled Intersection", NA, NA, NA, NA, NA,…
#> $ bikedir <chr> "Facing Traffic", NA, NA, NA, NA, NA, NA, NA, NA, "Not a Cyclist", NA, NA, N…
#> $ bikeloc <chr> "At Intersection", NA, NA, NA, NA, NA, NA, NA, NA, "Not a Cyclist", NA, NA, …
#> $ bikepos <chr> "Travel Lane", NA, NA, NA, NA, NA, NA, NA, NA, "Not a Cyclist", NA, NA, NA, …
#> $ body_typ <chr> NA, "Large utility (ANSI D16.1 Utility Vehicle Categories and \"Full Size\" …
#> $ bus_use <chr> NA, "Not a Bus", "Not a Bus", "Not a Bus", "Not a Bus", "Not a Bus", "Not a …
#> $ cargo_bt <chr> NA, "Not Applicable (N/A)", "Not Applicable (N/A)", "Not Applicable (N/A)", …
#> $ cdl_stat <chr> NA, "No (CDL)", "No (CDL)", "No (CDL)", "No (CDL)", "No (CDL)", "No (CDL)", …
#> $ cityname <chr> "NOT APPLICABLE", "NOT APPLICABLE", "NOT APPLICABLE", "NOT APPLICABLE", "NOT…
#> $ countyname <chr> "MECKLENBURG (117)", "MECKLENBURG (117)", "MECKLENBURG (117)", "MECKLENBURG …
#> $ day <dbl> 1, 1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 5, 3, 3, 5, 8, 2, 8, 8, 1, 1, 1, 7, 7…
#> $ day_week <chr> "Saturday", "Saturday", "Saturday", "Saturday", "Saturday", "Saturday", "Sat…
#> $ death_da <chr> "1", "Not Applicable (Non-Fatal)", "Not Applicable (Non-Fatal)", "Not Applic…
#> $ death_hr <chr> "16:00-16:59", "Not Applicable (Non-fatal)", "Not Applicable (Non-fatal)", "…
#> $ death_mn <chr> "28", "Not Applicable (Non-fatal)", "Not Applicable (Non-fatal)", "Not Appli…
#> $ death_mo <chr> "January", "Not Applicable (Non-Fatal)", "Not Applicable (Non-Fatal)", "Not …
#> $ death_tm <chr> "1628", "Not Applicable (Non-fatal)", "Not Applicable (Non-fatal)", "Not App…
#> $ death_yr <chr> "2022", "Not Applicable (Non-fatal)", "Not Applicable (Non-fatal)", "Not App…
#> $ deaths <dbl> NA, 0, 0, 0, 0, 0, 0, 1, 1, NA, 0, 0, 1, 1, 1, 0, 1, 1, 1, 1, 1, 1, 1, 0, 1,…
#> $ deformed <chr> NA, "7", "7", "7", "7", "7", "Disabling Damage", "Disabling Damage", "Disabl…
#> $ devmotor <dbl> 3, NA, NA, NA, NA, NA, NA, NA, NA, 0, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
#> $ devtype <dbl> 3, NA, NA, NA, NA, NA, NA, NA, NA, 0, NA, NA, NA, NA, NA, NA, NA, NA, NA, NA…
#> $ doa <chr> "Died at Scene", "Not Applicable", "Not Applicable", "Not Applicable", "Not …
#> $ dr_drink <chr> NA, "No", "No", "No", "No", "No", "No", "No", "No", NA, "No", "No", "Yes", "…
#> $ dr_hgt <chr> NA, "69", "69", "69", "69", "69", "62", "63", "63", NA, "67", "69", "67", "7…
#> $ dr_pres <chr> NA, "Yes", "Yes", "Yes", "Yes", "Yes", "Yes", "Yes", "Yes", NA, "Yes", "Yes"…
#> $ dr_wgt <chr> NA, "130 lbs.", "130 lbs.", "130 lbs.", "130 lbs.", "130 lbs.", "140 lbs.", …
#> $ dr_zip <chr> NA, "23893", "23893", "23893", "23893", "23893", "23024", "22551", "22551", …
#> $ drinking <chr> "Reported as Unknown", "No (Alcohol Not Involved)", "Not Reported", "Not Rep…
#> $ drugs <chr> "Reported as Unknown", "No (drugs not involved)", "Not Reported", "Not Repor…
#> $ dstatus <chr> "Test Given", "Test Not Given", "Test Not Given", "Test Not Given", "Test No…
#> $ ej_path <chr> "Ejection Path Not Applicable", "Ejection Path Not Applicable", "Ejection Pa…
#> $ ejection <chr> "Not Applicable", "Not Ejected", "Not Reported", "Not Reported", "Not Report…
#> $ emer_use <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ extricat <chr> "Not Extricated or Not Applicable", "Not Extricated or Not Applicable", "Not…
#> $ fatals <dbl> 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1…
#> $ fire_exp <chr> NA, "No or Not Reported", "No or Not Reported", "No or Not Reported", "No or…
#> $ first_mo <chr> NA, "November", "November", "November", "November", "November", "No Record",…
#> $ first_yr <chr> NA, "2021", "2021", "2021", "2021", "2021", "No Record", "No Record", "No Re…
#> $ func_sys <chr> "Principal Arterial - Other", "Principal Arterial - Other", "Principal Arter…
#> $ gvwr_from <chr> NA, "Class 2: 6,001 - 10,000 lbs. (2,722 - 4,536 kg)", "Class 2: 6,001 - 10,…
#> $ gvwr_to <chr> NA, "Class 2: 6,001 - 10,000 lbs. (2,722 - 4,536 kg)", "Class 2: 6,001 - 10,…
#> $ harm_ev <chr> "Pedalcyclist", "Pedalcyclist", "Pedalcyclist", "Pedalcyclist", "Pedalcyclis…
#> $ haz_cno <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ haz_id <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ haz_inv <chr> NA, "No", "No", "No", "No", "No", "No", "No", "No", NA, "No", "No", "No", "N…
#> $ haz_plac <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ haz_rel <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ helm_mis <chr> "Not a Motor Vehicle Occupant", "None Used/Not Applicable", "None Used/Not A…
#> $ helm_use <chr> "Not a Motor Vehicle Occupant", "Not Applicable", "Not Applicable", "Not App…
#> $ hispanic <chr> "Non-Hispanic", "Not A Fatality (not Applicable)", "Not A Fatality (not Appl…
#> $ hit_run <chr> NA, "No", "No", "No", "No", "No", "No", "No", "No", NA, "No", "No", "No", "N…
#> $ hosp_hr <chr> "Not Applicable (Not Transported)", "Not Applicable (Not Transported)", "Not…
#> $ hosp_mn <chr> "Not Applicable (Not Transported)", "Not Applicable (Not Transported)", "Not…
#> $ hospital <chr> "Not Transported for Treatment", "Not Transported for Treatment", "Not Trans…
#> $ hour <chr> "4:00pm-4:59pm", "4:00pm-4:59pm", "4:00pm-4:59pm", "4:00pm-4:59pm", "4:00pm-…
#> $ icfinalbody <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ impact1 <chr> NA, "12 Clock Point", "12 Clock Point", "12 Clock Point", "12 Clock Point", …
#> $ inj_sev <chr> "Fatal Injury (K)", "No Apparent Injury (O)", "No Apparent Injury (O)", "No …
#> $ j_knife <chr> NA, "Not an Articulated Vehicle", "Not an Articulated Vehicle", "Not an Arti…
#> $ l_compl <chr> NA, "Valid license for this class vehicle", "Valid license for this class ve…
#> $ l_endors <chr> NA, "No Endorsements required for this vehicle", "No Endorsements required f…
#> $ l_restri <chr> NA, "Restrictions, Compliance Unknown", "Restrictions, Compliance Unknown", …
#> $ l_state <chr> NA, "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", …
#> $ l_status <chr> NA, "Valid", "Valid", "Valid", "Valid", "Valid", "Valid", "Valid", "Valid", …
#> $ l_type <chr> NA, "Full Driver License", "Full Driver License", "Full Driver License", "Fu…
#> $ lag_hrs <chr> "0", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unkn…
#> $ lag_mins <chr> "6", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unkn…
#> $ last_mo <chr> NA, "November", "November", "November", "November", "November", "No Record",…
#> $ last_yr <chr> NA, "2021", "2021", "2021", "2021", "2021", "No Record", "No Record", "No Re…
#> $ lgt_cond <chr> "Daylight", "Daylight", "Daylight", "Daylight", "Daylight", "Daylight", "Day…
#> $ location <chr> "At Intersection - Not In Crosswalk", "Occupant of a Motor Vehicle", "Occupa…
#> $ m_harm <chr> NA, "Pedalcyclist", "Pedalcyclist", "Pedalcyclist", "Pedalcyclist", "Pedalcy…
#> $ mak_mod <chr> NA, "GMC Fullsize Jimmy/Yukon", "GMC Fullsize Jimmy/Yukon", "GMC Fullsize Ji…
#> $ make <chr> NA, "GMC", "GMC", "GMC", "GMC", "GMC", "Chevrolet", "Ford", "Ford", NA, "Int…
#> $ man_coll <chr> "The First Harmful Event was Not a Collision with a Motor Vehicle in Transpo…
#> $ mcarr_i1 <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ mcarr_i2 <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ mcarr_id <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ milept <chr> "3586", "3586", "3586", "3586", "3586", "3586", "370", "370", "370", "101", …
#> $ minute <chr> "22", "22", "22", "22", "22", "22", "55", "55", "55", "35", "35", "35", "11"…
#> $ mod_year <chr> NA, "2016", "2016", "2016", "2016", "2016", "2019", "2008", "2008", NA, "201…
#> $ model <dbl> NA, 421, 421, 421, 421, 421, 422, 481, 481, NA, 881, 37, 43, 404, 32, 881, 4…
#> $ month <chr> "January", "January", "January", "January", "January", "January", "January",…
#> $ motdir <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not Applicable", NA, NA…
#> $ motman <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not Applicable", NA, NA…
#> $ nhs <chr> "This section IS ON the NHS", "This section IS ON the NHS", "This section IS…
#> $ nmhelmet <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ nmlight <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ nmothpre <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ nmothpro <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ nmpropad <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ nmrefclo <chr> "Not Reported", NA, NA, NA, NA, NA, NA, NA, NA, "Not Reported", NA, NA, NA, …
#> $ not_hour <chr> "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown", "Unknown",…
#> $ not_min <chr> "Unknown if Notified", "Unknown if Notified", "Unknown if Notified", "Unknow…
#> $ numoccs <chr> NA, "05", "05", "05", "05", "05", "01", "02", "02", NA, "01", "01", "01", "0…
#> $ owner <chr> NA, "Driver (in this crash) Not Registered Owner (Other Private Owner Listed…
#> $ p_crash1 <chr> NA, "Negotiating a Curve", "Negotiating a Curve", "Negotiating a Curve", "Ne…
#> $ p_crash2 <chr> NA, "Pedalcyclist or other non-motorist in road", "Pedalcyclist or other non…
#> $ p_crash3 <chr> NA, "Unknown/Not Reported", "Unknown/Not Reported", "Unknown/Not Reported", …
#> $ pbcwalk <chr> "None Noted", NA, NA, NA, NA, NA, NA, NA, NA, "None Noted", NA, NA, NA, NA, …
#> $ pbswalk <chr> "None Noted", NA, NA, NA, NA, NA, NA, NA, NA, "None Noted", NA, NA, NA, NA, …
#> $ pbszone <chr> "None Noted", NA, NA, NA, NA, NA, NA, NA, NA, "None Noted", NA, NA, NA, NA, …
#> $ pcrash4 <chr> NA, "Tracking", "Tracking", "Tracking", "Tracking", "Tracking", "Tracking", …
#> $ pcrash5 <chr> NA, "Stayed in original travel lane", "Stayed in original travel lane", "Sta…
#> $ pedcgp <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Unusual Circumstances",…
#> $ pedctype <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Disabled Vehicle-Relate…
#> $ peddir <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not Applicable", NA, NA…
#> $ pedleg <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not Applicable", NA, NA…
#> $ pedloc <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not At Intersection", N…
#> $ pedpos <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Other/Unknown", NA, NA,…
#> $ peds <dbl> 1, 1, 1, 1, 1, 1, 0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
#> $ pedsnr <chr> "Not a Pedestrian", NA, NA, NA, NA, NA, NA, NA, NA, "Not Applicable", NA, NA…
#> $ per_typ <chr> "Bicyclist", "Driver of a Motor Vehicle In-Transport", "Passenger of a Motor…
#> $ permvit <dbl> 5, 5, 5, 5, 5, 5, 3, 3, 3, 2, 2, 2, 1, 1, 2, 2, 1, 1, 1, 2, 2, 3, 3, 3, 3, 3…
#> $ pernotmvit <dbl> 1, 1, 1, 1, 1, 1, 0, 0, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
#> $ persons <dbl> 5, 5, 5, 5, 5, 5, 3, 3, 3, 2, 2, 2, 1, 1, 2, 2, 1, 1, 1, 2, 2, 3, 3, 3, 3, 3…
#> $ prev_acc <chr> NA, "1", "1", "1", "1", "1", "None", "None", "None", NA, "3", "None", "None"…
#> $ prev_dwi <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "Non…
#> $ prev_oth <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "4",…
#> $ prev_spd <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "2",…
#> $ prev_sus1 <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "Non…
#> $ prev_sus2 <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "Non…
#> $ prev_sus3 <chr> NA, "None", "None", "None", "None", "None", "None", "None", "None", NA, "4",…
#> $ pvh_invl <dbl> 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
#> $ rail <chr> "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", "Not…
#> $ rd_owner <chr> "State Highway Agency", "State Highway Agency", "State Highway Agency", "Sta…
#> $ reg_stat <chr> NA, "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", …
#> $ rel_road <chr> "On Roadway", "On Roadway", "On Roadway", "On Roadway", "On Roadway", "On Ro…
#> $ reljct1 <chr> "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "Yes…
#> $ reljct2 <chr> "Intersection-Related", "Intersection-Related", "Intersection-Related", "Int…
#> $ rest_mis <chr> "Not a Motor Vehicle Occupant", "No Indication of Mis-Use", "None Used/Not A…
#> $ rest_use <chr> "Not a Motor Vehicle Occupant", "Shoulder and Lap Belt Used", "Not Reported"…
#> $ rolinloc <chr> NA, "No Rollover", "No Rollover", "No Rollover", "No Rollover", "No Rollover…
#> $ rollover <chr> NA, "No Rollover", "No Rollover", "No Rollover", "No Rollover", "No Rollover…
#> $ route <chr> "U.S. Highway", "U.S. Highway", "U.S. Highway", "U.S. Highway", "U.S. Highwa…
#> $ rur_urb <chr> "Rural", "Rural", "Rural", "Rural", "Rural", "Rural", "Rural", "Rural", "Rur…
#> $ sch_bus <chr> "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "No", "No"…
#> $ seat_pos <chr> "Not a Motor Vehicle Occupant", "Front Seat, Left Side", "Not Reported", "No…
#> $ sex <chr> "Male", "Male", "Not Reported", "Not Reported", "Not Reported", "Not Reporte…
#> $ sp_jur <chr> "No Special Jurisdiction", "No Special Jurisdiction", "No Special Jurisdicti…
#> $ spec_use <chr> NA, "No Special Use", "No Special Use", "No Special Use", "No Special Use", …
#> $ speedrel <chr> NA, "Yes, Exceeded Speed Limit", "Yes, Exceeded Speed Limit", "Yes, Exceeded…
#> $ statename <chr> "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Virginia", "Vir…
#> $ str_veh <dbl> 1, 0, 0, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0…
#> $ tow_veh <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ towed <chr> NA, "Not Reported", "Not Reported", "Not Reported", "Not Reported", "Not Rep…
#> $ trav_sp <chr> NA, "060 MPH", "060 MPH", "060 MPH", "060 MPH", "060 MPH", "065 MPH", "Stopp…
#> $ trlr1gvwr <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ trlr1vin <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ trlr2gvwr <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ trlr2vin <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ trlr3gvwr <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ trlr3vin <chr> NA, "No Trailing Units", "No Trailing Units", "No Trailing Units", "No Trail…
#> $ tway_id <chr> "US-58", "US-58", "US-58", "US-58", "US-58", "US-58", "SR-208/LAKE ANNA PKWY…
#> $ tway_id2 <chr> "JEFFERSON ST", "JEFFERSON ST", "JEFFERSON ST", "JEFFERSON ST", "JEFFERSON S…
#> $ typ_int <chr> "Four-Way Intersection", "Four-Way Intersection", "Four-Way Intersection", "…
#> $ underoverride <dbl> NA, 7, 7, 7, 7, 7, 0, 0, 0, NA, 0, 0, 7, 7, 0, 0, 7, 7, 7, 7, 7, 0, 0, 0, 0,…
#> $ unittype <chr> NA, "Motor Vehicle In-Transport (Inside or Outside the Trafficway)", "Motor …
#> $ v_config <chr> NA, "Not Applicable", "Not Applicable", "Not Applicable", "Not Applicable", …
#> $ valign <chr> NA, "Curve - Right", "Curve - Right", "Curve - Right", "Curve - Right", "Cur…
#> $ ve_forms <dbl> 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 1, 1, 2, 2, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2…
#> $ ve_total <dbl> 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 1, 1, 2, 2, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2…
#> $ vin <chr> NA, "1GKS2BKC8GR3", "1GKS2BKC8GR3", "1GKS2BKC8GR3", "1GKS2BKC8GR3", "1GKS2BK…
#> $ vnum_lan <chr> NA, "Five lanes", "Five lanes", "Five lanes", "Five lanes", "Five lanes", "T…
#> $ vpavetyp <chr> NA, "Blacktop, Bituminous, or Asphalt", "Blacktop, Bituminous, or Asphalt", …
#> $ vpicbodyclass <chr> NA, "Sport Utility Vehicle (SUV)/Multi-Purpose Vehicle (MPV)", "Sport Utilit…
#> $ vpicmake <chr> NA, "GMC", "GMC", "GMC", "GMC", "GMC", "Chevrolet", "Ford", "Ford", NA, "Int…
#> $ vpicmodel <chr> NA, "Yukon", "Yukon", "Yukon", "Yukon", "Yukon", "Suburban", "F-150", "F-150…
#> $ vprofile <chr> NA, "Level", "Level", "Level", "Level", "Level", "Level", "Level", "Level", …
#> $ vspd_lim <chr> NA, "55 MPH", "55 MPH", "55 MPH", "55 MPH", "55 MPH", "55 MPH", "55 MPH", "5…
#> $ vsurcond <chr> NA, "Dry", "Dry", "Dry", "Dry", "Dry", "Wet", "Wet", "Wet", NA, "Dry", "Dry"…
#> $ vtcont_f <chr> NA, "No Controls", "No Controls", "No Controls", "No Controls", "No Controls…
#> $ vtrafcon <chr> NA, "No Controls", "No Controls", "No Controls", "No Controls", "No Controls…
#> $ vtrafway <chr> NA, "Two-Way, Not Divided With a Continuous Left-Turn Lane", "Two-Way, Not D…
#> $ work_inj <chr> "No", "Not Applicable (not a fatality)", "Not Applicable (not a fatality)", …
#> $ wrk_zone <chr> "None", "None", "None", "None", "None", "None", "None", "None", "None", "Non…
The multi_
tibbles contain those variables for which there may be a varying number of values for any entity (e.g., driver impairments, vehicle events, weather conditions at time of crash). Each tibble has the requisite data elements corresponding to the entity: multi_acc
includes st_case
and year
, multi_veh
adds veh_no
(vehicle number), and multi_per
adds per_no
(person number).
The top name-value pairs of each tibble are shown below.
myFARS$multi_acc %>% filter(!is.na(value)) %>% group_by(name, value) %>% summarize(n=n(), .groups = "drop") %>% arrange(desc(n)) %>% slice(1:10) %>% select(name, value, n) %>% knitr::kable(format = "html")
name | value | n |
---|---|---|
weather | Rain | 96 |
crashrf | Motor Vehicle struck by falling cargo,or something that came loose from or something that was set in motion by a vehicle | 24 |
crashrf | Indication of a Stalled/Disabled Vehicle | 12 |
crashrf | Police Pursuit Involved | 11 |
crashrf | Recent/Previous Crash scene Nearby | 7 |
weather | Fog, Smog, Smoke | 6 |
weather | Snow | 4 |
crashrf | Regular Congestion | 3 |
crashrf | Non-occupant struck by falling cargo, or something that came loose from, or something that was set in motion by a vehicle | 2 |
crashrf | 10 | 1 |
myFARS$multi_veh %>% filter(!is.na(value)) %>% group_by(name, value) %>% summarize(n=n(), .groups = "drop") %>% arrange(desc(n)) %>% slice(1:10) %>% select(name, value, n) %>% knitr::kable(format = "html")
name | value | n |
---|---|---|
vehiclecc | None Noted | 1352 |
vision | No Obstruction Noted | 1264 |
damage | 12 Clock Value | 1093 |
drdistract | Not Distracted | 1025 |
damage | 11 Clock Value | 917 |
damage | 1 Clock Value | 884 |
drimpair | None/Apparently Normal | 736 |
damage | 10 Clock Value | 699 |
damage | 2 Clock Value | 635 |
damage | 9 Clock Value | 560 |
myFARS$multi_per %>% filter(!is.na(value)) %>% group_by(name, value) %>% summarize(n=n(), .groups = "drop") %>% arrange(desc(n)) %>% slice(1:10) %>% select(name, value, n) %>% knitr::kable(format = "html")
name | value | n |
---|---|---|
order | 1 | 2107 |
multrace | No | 2096 |
drugspec | Whole Blood | 1292 |
drugres | Test Not Given | 1236 |
drugspec | Test Not Given | 1236 |
race | Not a Fatality (not Applicable) | 1099 |
race | White | 597 |
drugres | Tested, No Drugs Found/Negative | 579 |
drugspec | Vitreous | 353 |
race | Black or African American | 254 |
The events
tibble provides a sequence of numbered events for each vehicle in each crash. See the vignette for more information.
head(myFARS$events, 10) %>% knitr::kable(format="html")
state | st_case | veh_no | aoi | soe | veventnum | year |
---|---|---|---|---|---|---|
Virginia | 510001 | 1 | 12 Clock Point | Pedalcyclist | 1 | 2022 |
Virginia | 510002 | 1 | 12 Clock Point | Motor Vehicle In-Transport | 1 | 2022 |
Virginia | 510002 | 2 | 6 Clock Point | Motor Vehicle In-Transport | 1 | 2022 |
Virginia | 510003 | 1 | Non-Harmful Event | Ran Off Roadway - Right | 1 | 2022 |
Virginia | 510003 | 1 | 9 Clock Point | Motor Vehicle In-Transport | 2 | 2022 |
Virginia | 510003 | 1 | 5 Clock Point | Motor Vehicle In-Transport | 3 | 2022 |
Virginia | 510003 | 1 | Non-Collision | Rollover/Overturn | 4 | 2022 |
Virginia | 510003 | 2 | Non-Harmful Event | Ran Off Roadway - Right | 1 | 2022 |
Virginia | 510003 | 2 | 11 Clock Point | Motor Vehicle In-Transport | 2 | 2022 |
Virginia | 510003 | 2 | 12 Clock Point | Pedestrian | 3 | 2022 |
The codebook
tibble provides a searchable codebook for the data, useful if you know what concept you’re looking for but not the variable that describes it. The rfars
package includes a codebook for FARS and GESCRSS (rfars::fars_codebook
and rfars::gescrss_codebook
). These tables span 2011-2022 whereas the codebook
object returned from get_fars()
and get_gescrss()
only include the specified years
. See the vignette for more information.
Helpful Links
- National Highway Traffic Safety Administration (NHTSA)
- Fatality Analysis Reporting System (FARS)
- Fatality and Injury Reporting System Tool (FIRST)
- FARS Analytical User’s Manual
- General Estimates System (GES)
- Crash Report Sampling System (CRSS)
- CRSS Analytical User’s Manual
- NCSA and Other Data Sources
- NHTSA FTP Site.