Kristian Lum spoke about "Understanding the Context and Consequences of Pre-Trial Detention" at the Conference on Fairness, Accountability, and Transparency (FAT*).
Ten years ago, in July 2005, human rights officers stumbled upon a nondescript warehouse in a commercial zone of Guatemala City and changed history. They had discovered an archive–its existence kept secret–belonging to the Guatemalan National Police, whose officers committed human rights atrocities on behalf of the government during the civil war.
Inside the building was the bureaucratic detritus typical of a large government agency: 80 million pages detailing shifts worked, tasks assigned, assignments fulfilled, workers’ whereabouts, and who was supervising whom. The documents, which were found stacked on dirty floors, shoved into bags, ...
The Historic Archive of the Guatemalan National Police (hereafter the Archive) was discovered, quite by accident, in July 2005. Researchers immediately recognized both the importance and the fragility of the Archive's contents. As a result, in early 2006 the Archive team invited Patrick to evaluate the documents and help them answer a seemingly simple question: How can we learn about the contents of the Archive in a shorter period of time than is needed to systematically examine each individual document?
After inspecting the Archive, Patrick designed a multi-stage random sample of documents. In May 2006, Tamy Guberek, Daniel Guzmán, and ...
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
Trina Reynolds-Tyler's internship at HRDAG helped her use data science to find patterns in state-sanctioned violence.
James Johndrow, Patrick Ball, Maria Gargiulo, and Kristian Lum. (2020). Estimating the Number of SARS-CoV-2 Infections and the Impact of Mitigation Policies in the United States. Harvard Data Science Review. 24 November, 2020. © The Authors, 2020, CC BY 4.0. https://doi.org/10.1162/99608f92.7679a1ed
James Johndrow, Patrick Ball, Maria Gargiulo, and Kristian Lum. (2020). Estimating the Number of SARS-CoV-2 Infections and the Impact of Mitigation Policies in the United States. Harvard Data Science Review. 24 November, 2020. © The Authors, 2020, CC BY 4.0. https://doi.org/10.1162/99608f92.7679a1ed
I got an email from my superheroic PhD adviser in June 2006: Would I be interested in relocating to Palo Alto for six months in order to work with Patrick Ball at the Human Rights Data Analysis Group? (She'd gotten a grant and would cover my stipend.) Since I'd spent the last several months in New Haven wrestling ineffectually with giant, brain-melting methodological problems, I said yes immediately.
The plan with my adviser was simple: I'd digitize the ancient, multiply-photocopied pages of data from the United Nations Truth Commission for El Salvador, combine them with two other datasets, match across all the records, and produce reliable ...
Working at the Historic Archive of the National Police (AHPN) of Guatemala, there are many skills I learned on the job. My many years of work on the team that studies the recovered documents have been like a custom-made course in how to do quantitative research.
The Archive documents I study are the result of 36 years of creation during civil war (1960 to 1996). Many of these documents are simply administrative—but we are able to use them to understand patterns that occurred during the conflict, to get a sense of what mattered to the National Police and what didn’t. Our quantitative research shows us the Police behavior in broad strokes. ...
What follows is an elaborate criss-crossing of collaborations—retreat is a time to embrace the productivity that comes with being in the same room.
HRDAG analysis shows that the government figures are a gross underestimation of the drug-related killings in the Philippines.
Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.
Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.
HRDAG researchers and analysts at Peru's Truth and Reconciliation Commission (TRC) estimated conflict mortality due to violence using Capture-Recapture methods.
Tarak Shah (2020). How many people are infected with Covid-19? Significance. 09 April 2020. © 2020 The Royal Statistical Society.
Tarak Shah (2020). How many people are infected with Covid-19? Significance. 09 April 2020. © 2020 The Royal Statistical Society.
We aim to produce code that is clear, replicatable across machines and operating systems, and that leaves an easy-to-follow audit trail.
Last week Forensis, the Colombian National Institute of Forensic Medicine’s flagship publication, published the first of our analyses of homicide patterns in Colombia. Authored by HRDAG executive director Patrick Ball and UN colleague Michael Reed Hurtado, “Cuentas y mediciones de la criminalidad y de la violencia” (pages 529-545) explores, as the title suggests, the quality of “truth” contained within crime registries. Citing the problem of partial data, missing data, and inherent design bias, Patrick and Michael write that no register, official or unofficial, can present a true reflection of what has really happened.
This publication...
HRDAG associate Miguel Cruz has an epiphany. All those data he’s drowning in? Each datapoint is a personal tragedy, a story both dark and urgent, and he’s privileged to have access.
Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.
Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.
Romesh Silva and Patrick Ball. “The Demography of Conflict-Related Mortality in Timor-Leste (1974-1999): Empirical Quantitative Measurement of Civilian Killings, Disappearances & Famine-Related Deaths” In Statistical Methods for Human Rights, J. Asher, D. Banks and F. Scheuren, eds., Springer (New York) (2007)