Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.
Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.
Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.
Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.
The Colombian Truth Commission (CEV), the Special Jurisdiction for Peace (JEP), and the Human Rights Data Analysis Group (HRDAG) have worked together to integrate data and calculate statistical estimates of the number of victims of the armed conflict, including homicides, forced disappearances, kidnapping, and the recruitment of child soldiers. Data are available through National Administrative Department of Statistics (DANE), the Truth Commission, and GitHub. Published in 2023.
The Colombian Truth Commission (CEV), the Special Jurisdiction for Peace (JEP), and the Human Rights Data Analysis Group (HRDAG) have worked together to integrate data and calculate statistical estimates of the number of victims of the armed conflict, including homicides, forced disappearances, kidnapping, and the recruitment of child soldiers. Data are available through National Administrative Department of Statistics (DANE), the Truth Commission, and GitHub.
The Sri Lankan army must explain to the families of the disappeared and missing what happened to an estimated 500 Tamils who disappeared in their custody at the war end on/around 18 May 2009, said two international NGOs who have been collating and analysing lists of names.
Sri Lanka has one of the largest numbers in the world of enforced disappearances but these 500 represent the largest number of disappearances all in one place and time in the country. For a detailed account of the process of estimating the 500 please see: “How many people disappeared on 17-19 May 2009 in Sri Lanka?” .
Solving for X documents Patrick's team as they travel to Guatemala, Kosovo, and Liberia, helping human rights supporters apply sophisticated computer analysis to human rights events.
At HRDAG, we worry about what we don't know. Specifically, we worry about how we can use statistical techniques to estimate homicides that are not observed by human rights groups. Based on what we've seen studying many conflicts over the last 25 years, what we don't know is often quite different from what we do know.
The technique we use most often to estimate what we don't know is called "multiple systems estimation." In this medium-technical post, I explain how to organize data and use three R packages to estimate unobserved events.
Click here for Computing Multiple Systems Estimation in R.
Kevin Uhrmacher of the Washington Post prepared a graph that illustrates reported deaths over time, by number of organizations reporting the deaths.
Ten data nerds gathered in a large hilltop beach house to analyze counts of killings from several war-torn countries. The time was January 16-20, 2014, the place was near San Francisco, the agenda was packed, and I was excited to be there.
Having defended my dissertation at Carnegie Mellon University just days before, I had often supposed that my thesis on a generalization of
log-linear models for capture-recapture might serve little other purpose than to fill a line on my curriculum vitae. This perception faded after a mid-2013 discussion with Patrick convinced me that HRDAG's data challenges could easily be the best match to my research ...
Kristian Lum, Chesa Boudin and Megan Price (2020). The impact of overbooking on a pre-trial risk assessment tool. FAT* '20: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. January 2020. Pages 482–491. https://doi.org/10.1145/3351095.3372846 ©ACM, Inc., 2020.
Kristian Lum, Chesa Boudin and Megan Price (2020). The impact of overbooking on a pre-trial risk assessment tool. FAT* ’20: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. January 2020. https://doi.org/10.1145/3351095.3372846 ©ACM, Inc., 2020.
Exploración y análisis de los datas para comprender la realidad. Patrick Ball y Michael Reed Hurtado. 2015. Forensis 16, no. 1 (July): 529-545. © 2015 Instituto Nacional de Medicina Legal y Ciencias Forenses (República de Colombia).
From the Guatemalan military to the South African apartheid police, code cruncher Patrick Ball singles out the perpetrators of political violence.
Today Guatemala’s former national police chief Colonel Héctor Rafael Bol de la Cruz was convicted and sentenced to 40 years in prison for his role in the 1984 kidnapping and disappearance of 27-year-old student union leader Fernando Garcia, who was last seen when officers detained him outside his home. Along with Bol de la Cruz, former senior police officer Jorge Gomez was also tried; he received a sentence of 40 years in prison. That verdict comes in part because of testimony this month by HRDAG’s Patrick Ball, who served as an expert witness and presented data analysis done with colleague Daniel Guzmán to assess the flow of thousands of ...
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
You may contact us via info @ hrdag.org or use this form.
Would you like to receive our newsletter?
Great! Please sign up here.
Find us on Mastodon
Follow HRDAG on Mastodon.
Employment with HRDAG
Please keep in touch by signing up for our newsletters and following us on Twitter @hrdag or Mastodon.
If you do not see a job listed here, please do not send your CV or résumé, as we do not file or save them, and we will only have to send you a sad “no thank you” letter.
Volunteering with HRDAG
Are you interested in volunteering your time to the Human Rights Data Analysis Group? We’re very flattered—but at this time we’re ...
The primer addresses what pretrial risk assessment is and what the research supports.
We have accomplished so much in the last 10 years at the Historical Archive of the National Police. And yet, despite the efforts, dedication, and commitment of each person who since 2006 has worked in the AHPN, we still can not say “mission accomplished.”
In 10 years the environment at the Archive has changed so much and become so full of life. Where the building once sheltered unknown stories, over time some of those stories have been revealed. But Guatemala has a long way to go in letting the world get to know more deeply about the secrets within the documents stored there.
Guatemalans and the rest of the world have a very important ...
In our work, we merge many databases to figure out how many people have been killed in violent conflict. Merging is a lot harder than you might think.
Many of the database records refer to the same people--the records are duplicated. We want to identify and link all the records that refer to the same victims so that each victim is counted only once, and so that we can use the structure of overlapping records to do multiple systems estimation.
Merging records that refer to the same person is called entity resolution, database deduplication, or record linkage. For definitive overviews of the field, see Scheuren, Herzog, and Winkler, Data Quality ...