The modular nature of the workflow and use of Git allowed us to work on different parts of the project from across the country.
From the article: “As we seek to advance the responsible use of data for racial injustice, we encourage individuals and organizations to support and build upon efforts already underway.” HRDAG is listed in the Data Driven Activism and Advocacy category.
HRDAG contributes to the project by helping to classify, filter, extract, and standardize the records so that they can be useful in the database.
“Patrick Ball, HRDAG’s Director of Research and the statistician behind the code, explained that the Random Forest classifier was able to predict with 100% accuracy which counties that would go on to have mass graves found in them in 2014 by using the model against data from 2013. The model also predicted the counties that did not have mass hidden graves found in them, but that show a high likelihood of the possibility. This prediction aspect of the model is the part that holds the most potential for future research.”
HRDAG’s funding comes from private, international donors: the John D. and Catherine T. MacArthur Foundation, Open Society Foundations, an anonymous U.S.-based private foundation, Ford Foundation, The National Endowment for Democracy and individual donors. This funding supports both specific projects, as well as our scientific work generally in human rights data analysis.
For the entirety of its existence, HRDAG has been a project of non-profit organizations, first at the American Association for the Advancement of Science (AAAS), and then at Benetech, a non-profit Silicon Valley technology company. In February 2013, HRDAG became ...
Today Guatemala’s former national police chief Colonel Héctor Rafael Bol de la Cruz was convicted and sentenced to 40 years in prison for his role in the 1984 kidnapping and disappearance of 27-year-old student union leader Fernando Garcia, who was last seen when officers detained him outside his home. Along with Bol de la Cruz, former senior police officer Jorge Gomez was also tried; he received a sentence of 40 years in prison. That verdict comes in part because of testimony this month by HRDAG’s Patrick Ball, who served as an expert witness and presented data analysis done with colleague Daniel Guzmán to assess the flow of thousands of ...
Three months after the announcement of the momentous verdict finding former Chadian president Hissène Habré guilty of crimes against humanity, the presiding judges have released the full, written 681-page judgment of the court. Testimony given by HRDAG’s director of research, Patrick Ball, is mentioned at three points in the verdict.
The judges included in their written judgment the HRDAG analysis that the mortality rate in Habré prisons was staggeringly high—much higher than the mortality rate among the population as a whole. Here’s an excerpt from the judgment, page 358 (translated by Google):
The statistical expert, Patrick Ball, ...
Megan Price, Jeff Klingner, and Patrick Ball (2013). The Benetech Human Rights Program, commissioned by the United Nations Office of the High Commissioner for Human Rights (OHCHR). January 2, 2013. © 2013 HRDAG. Creative Commons BY-NC-SA.
The Historic Archive of the Guatemalan National Police (hereafter the Archive) was discovered, quite by accident, in July 2005. Researchers immediately recognized both the importance and the fragility of the Archive's contents. As a result, in early 2006 the Archive team invited Patrick to evaluate the documents and help them answer a seemingly simple question: How can we learn about the contents of the Archive in a shorter period of time than is needed to systematically examine each individual document?
After inspecting the Archive, Patrick designed a multi-stage random sample of documents. In May 2006, Tamy Guberek, Daniel Guzmán, and ...
The summer of 2002 in Washington, DC, was steamy and hot, which is how I remember my introduction to HRDAG. I had begun working with them, while they were still at AAAS, in the late spring, learning all about their core concepts: duplicate reporting and MSE, controlled vocabularies, inter-rater reliability, data models and more. The days were long, with a second shift more often than not running late into the evening. In addition to all the learning, I also helped with matching for the Chad project – that is, identifying multiple records of the same violation – back when matching was done by hand. But it was not long after I arrived in Washington ...
Patrick Ball won the Karl E. Peace Award for Outstanding Statistical Contributions for the Betterment of Society at the 2018 Joint Statistical Meeting.
Inaccurate statistics can damage the credibility of human rights claims—and that's why we strive to ensure that statistics about human rights violations are generated with as much rigor and are as scientifically accurate as possible.
But, what are the pitfalls leading to inaccuracy—when, where, and how do data become compromised? How are patterns biased by having only partial data? And what are the best scientific methods for collecting, managing, processing and analyzing data?
Here are the data pitfalls that HRDAG has identified, as well as some of our approaches for meeting these challenges. We believe that human rights researchers must take ...
Sarah L. Desmarais and Evan M. Lowder (2019). Pretrial Risk Assessment Tools: A Primer for Judges, Prosecutors, and Defense Attorneys. Safety and Justice Challenge, February 2019. © 2019 Safety and Justice Challenge. <<HRDAG's Kristian Lum and Tarak Shah served as Project Members and made contributions to the primer.>>
Sarah L. Desmarais and Evan M. Lowder (2019). Pretrial Risk Assessment Tools: A Primer for Judges, Prosecutors, and Defense Attorneys. Safety and Justice Challenge, February 2019. © 2019 Safety and Justice Challenge. <<HRDAG’s Kristian Lum and Tarak Shah served as Project Members and made significant contributions to the primer.>>
<< Previous post: MSE: The Basics
Q3. What are the steps in an MSE analysis?
Q4. What does data collection look like in the human rights context? What kind of data do you collect?
Q5. [In depth] Do you include unnamed or anonymous victims in the matching process?
Q6. What do you mean by "cleaning" and "canonicalization?"
Q7. [In depth] What are some of the challenges of canonicalization? (more…)
In Puerto Rico, some people are more likely to be victims of police violence than others. HRDAG processed a flood of data to illuminate the racial bias.
New paper in Biometrika, co-authored by HRDAG's Kristian Lum and James Johndrow: Theoretical limits of microclustering in record linkage.