Diagnose, Visualize, and Aggregate Event Report Level Data.
eventreport 
The goal of the eventreport package is to diagnose, visualize, and aggregate event report level data to the event level. Users provide an event report level dataset, specify their aggregation rules, and the package produces a dataset aggregated at the event level. The package also allows the user to diagnose how sensitive their event report level data is to aggregation choices. In addition, the package includes the Modes and Agents of Election-Related Violence in Côte d’Ivoire and Kenya (MAVERICK) dataset, an event report level dataset that records all documented instances of electoral violence from the first multiparty election to 2022 in Côte d’Ivoire (1995-2022) and Kenya (1992-2022).
When using the data, please refer to the following article and codebook:
Sebastian van Baalen & Kristine Höglund (2026) Introducing the Modes and Agents of Election-Related Violence in Côte d’Ivoire (MAVERICK) datset. Journal of Peace Research, online first.
Sebastian van Baalen, David Edberg Landeström, Tor Richardson-Golinski & Kristine Höglund (2025) The MAVERICK Dataset Codebook Version 1.0. Uppsala: Department of Peace and Conflict Research, Uppsala University.
For methodological details, and when using the package, please refer to the following working paper:
Sebastian van Baalen & Kristine Höglund (2025) Trials and Triangulations: Analyzing Aggregation Sensitivity in Event Data on Political Violence. Uppsala: Department of Peace and Conflict Research, Uppsala University.
Installation
Once on CRAN, you can install the released version of eventreport from CRAN with:
#install.packages("eventreport")
You can also install the development version of eventreport from GitHub with:
# install.packages("devtools")
devtools::install_github("sebastianvanbaalen/eventreport")
What is event report level data?
Event report level data refers to data where each observation is an event that takes place on a single day and in a particular location as reported in a single source. The report level means that multiple reports about the same event constitute separate observations. For example, if both BBC and Reuters report about a violent post-election demonstration, the demonstration is the event, whereas the BBC and Reuters reports constitute the event reports. For a solid primer on event report level data, see this introduction to the method by Nils B Weidmann and Espen Geelmuyden Rød and this in-depth exploration of aggregation sensitivity by Scott J Cook and Nils B Weidmann.
The table below provides an example of event report level data from the MAVERICK dataset, and lists six unique reports about a single electoral violence event.
| event_id | city | location | actor1 | actor1_type | deaths_best | source |
|---|---|---|---|---|---|---|
| CIV-0004 | Abidjan | Abobo | Unknown security force (Côte d’Ivoire) | Security forces | 5 | Amnesty International (All Africa) (2011-01-12) Fresh Violence Erupts as Armed Groups Clash |
| CIV-0004 | Abidjan | Abobo | Unknown security force (Côte d’Ivoire) | Security forces | 1 | LEJD (2011-01-12) Nouveaux affrontements en Côte d’Ivoire |
| CIV-0004 | Abidjan | Unknown security force (Côte d’Ivoire) | Security forces | 5 | Reuters (2011-01-12) More die in Cote d’Ivoire violence | |
| CIV-0004 | Abidjan | Abobo | Police (Côte d’Ivoire) | Security forces | 6 | Xinhua News Agency (2011-01-12) Côte d’Ivoire : au total six policiers tués dans un quartier pro Ouattara à Abidjan |
| CIV-0004 | Abidjan | Police (Côte d’Ivoire) | Security forces | 6 | Al Jazeera (2011-01-13) Tensions persist in Cote d’Ivoire | |
| CIV-0004 | Abidjan | Abobo | Unknown actor (Côte d’Ivoire) | 7 | The Times (2011-01-15) Coup fears as death toll rises |
Why the eventreport package?
R already contains some functions that can be used for aggregating event report level data to the event level, such as the mean and median base R calls. However, as we detail in the package introduction article, the aggregation of event reports often demands additional functionalities, such as the use of tie-break rules or information contained in meta variables.
The eventreport package adds several functionalities not contained in existing software. Among those benefits, the package:
Handles different variable classes:
eventreporthandles a range of different variables, including character, date, numeric, and binary numeric variables. This feature makes the package ideal for working with event report datasets that include different variable classes.Enables tie-breaking rules: many vectors are multi-modal, meaning that simple functions for identifying the most frequent values will yield multiple results.
eventreporttherefore enables users to specify up to two tie-breaking rules that help adjudicate between multiple modes variables.Integrates precision scores: sometimes researchers are interested in recording the most precise value, such as more precise location estimates or more precise actor names.
eventreportallows users to specify precision score variables that help prioritize what values to select when the values cannot be ranked.Provides simple functions: aggregating event report level data is a complex coding project.
eventreportmakes this procedure more straightforward by providing simple functions that carry out complex tasks. All functions were developed in the context of a concrete event report level data collection effort, and are therefore both needs-based and well-tested.Allows easy customization: the combination of simple functions and several convenience functions allows users to stipulate a range of complex aggregation rule sets with minimal coding. Moreover, because
eventreportistidyversecompatible, users can integrate the package functions in a tidy workflow.
Examples
We provide a host of examples in our vignette and in the MAVERICK dataset codebook. Below are three basic examples of the functionalities in the eventreport package.
For aggregation diagnostics, users can use mean_dscore to visualize the mean normalized divergence score per variable (the mean number of divergent values per event divided by the total number of unique values in each variable). This diagnostic helps users assess what and to what extent variables are sensitive to aggregation choices. Simply run:
mean_dscore(
small_maverick_event_report,
group_var = "event_id",
variables = c("country", "actor1", "deaths_best", "injuries_best"),
normalize = TRUE,
plot = TRUE
)

For aggregating data, users can use calc_mode to find the mode value using two different tie-breaking rules:
calc_mode(
c("Sweden", "Sweden", "Denmark", "Denmark"),
tie_break = c(1, 1, 1, 1),
second_tie_break = c(1, 4, 1, 1)
)
#> [1] "Sweden"
For aggregating entire dataframes, users can use aggregateData to stipulate a set of aggregation rules and aggregate the full dataset (here presented using the tidytable package):
output <- small_maverick_event_report %>%
aggregateData(
group_var = "event_id",
find_mode = "city"
) %>%
utils::head(10)
tinytable::tt(output)
| event_id | city | number_of_sources | unit_of_analysis |
|---|---|---|---|
| CIV-0001 | Duékoué | 5 | Event |
| CIV-0002 | 2 | Event | |
| CIV-0003 | Abidjan | 12 | Event |
| CIV-0004 | Abidjan | 6 | Event |
| CIV-0008 | Man | 1 | Event |
| CIV-0009 | Vavoua | 2 | Event |
| CIV-0010 | Abidjan | 1 | Event |
| CIV-0011 | Yamoussoukro | 1 | Event |
| CIV-0012 | Gagnoa | 4 | Event |
| CIV-0013 | Daloa | 4 | Event. |