Multiple systems estimation, or MSE, is a family of techniques for statistical inference. MSE uses the overlaps between several incomplete lists of human rights violations to determine the total number of violations. In this blogpost, and four more to follow, I’ll answer both conceptual and practical questions about this important method. (In posts to follow, questions that refer to specific statistical procedures or debates will be marked, "In depth.") (more…)
Issues surrounding policing in the United States are at the forefront of our national attention. Among these is the use of “predictive policing,” which is the application of statistical or machine learning models to police data, with the goal of predicting where or by whom crime will be committed in the future. Today Significance magazine published an article on this topic that I co-authored with William Isaac. Significance has kindly made this article open access (free!) for all of October. In the article we demonstrate the mechanism by which the use of predictive policing software may amplify the biases that already pervade our criminal ...
HRDAG has sampled and analyzed documents at Guatemala's AHPN and has testified against war criminals based on that analysis.
Solving for X documents Patrick's team as they travel to Guatemala, Kosovo, and Liberia, helping human rights supporters apply sophisticated computer analysis to human rights events.
Patrick Ball, César Rodríguez and Valentina Rozo (2018). Asesinatos de líderes sociales en Colombia en 2016–2017: una estimación del universo. Dejusticia and Human Rights Data Analysis Group. August 2018. © 2018 HRDAG. Creative Commons.
Patrick Ball, César Rodríguez and Valentina Rozo (2018). Asesinatos de líderes sociales en Colombia en 2016–2017: una estimación del universo. Dejusticia and Human Rights Data Analysis Group. August 2018. © 2018 HRDAG. Creative Commons.
Today Guatemala’s former national police chief Colonel Héctor Rafael Bol de la Cruz was convicted and sentenced to 40 years in prison for his role in the 1984 kidnapping and disappearance of 27-year-old student union leader Fernando Garcia, who was last seen when officers detained him outside his home. Along with Bol de la Cruz, former senior police officer Jorge Gomez was also tried; he received a sentence of 40 years in prison. That verdict comes in part because of testimony this month by HRDAG’s Patrick Ball, who served as an expert witness and presented data analysis done with colleague Daniel Guzmán to assess the flow of thousands of ...
Using multiple system estimation, we estimate the total population of social movement leaders killed in Colombia during 2018.
Patrick Ball and Frances Harrison (2018). How many people disappeared on 17–19 May 2009 in Sri Lanka? Human Rights Data Analysis Group. 12 December 2018.© 2018 HRDAG. Creative Commons.
Patrick Ball and Frances Harrison (2018). How many people disappeared on 17–19 May 2009 in Sri Lanka? Human Rights Data Analysis Group. 12 December 2018.© 2018 HRDAG. Creative Commons.
HRDAG is identifying and interpreting the best science we can find to shed light on the global crisis brought on by the novel coronavirus, about which we still know so little. Right now, most of the data on the virus SARS-CoV-2 and Covid-19, the condition caused by the virus, are incomplete and unrepresentative, which means that there is a great deal of uncertainty. But making sense of imperfect datasets is what we do. HRDAG is contributing to a better understanding with explainers, essays, and original research, and we are highlighting trustworthy resources for those who want to dig deeper.
Papers and articles by HRDAG
.ugb-bbeb275 .ugb-blo...
Kristian Lum: “The historical over-policing of minority communities has led to a disproportionate number of crimes being recorded by the police in those locations. Historical over-policing is then passed through the algorithm to justify the over-policing of those communities.”
If we could glean key missing information from those fields, we would be able to use more records.
Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab - Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents." The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.
Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab – Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents.” The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.
We work around the world
Here’s more information about How We Choose Projects.
I got an email from my superheroic PhD adviser in June 2006: Would I be interested in relocating to Palo Alto for six months in order to work with Patrick Ball at the Human Rights Data Analysis Group? (She'd gotten a grant and would cover my stipend.) Since I'd spent the last several months in New Haven wrestling ineffectually with giant, brain-melting methodological problems, I said yes immediately.
The plan with my adviser was simple: I'd digitize the ancient, multiply-photocopied pages of data from the United Nations Truth Commission for El Salvador, combine them with two other datasets, match across all the records, and produce reliable ...
Bailey’s analysis stemmed from data we had access to as part of our ongoing collaboration with the Invisible Institute.
Sloppy recordkeeping by Chicago police has compromised missing persons cases. HRDAG is working with Pulitzer Prize-winning Invisible Institute to shed light on these stories.