Issues surrounding policing in the United States are at the forefront of our national attention. Among these is the use of “predictive policing,” which is the application of statistical or machine learning models to police data, with the goal of predicting where or by whom crime will be committed in the future. Today Significance magazine published an article on this topic that I co-authored with William Isaac. Significance has kindly made this article open access (free!) for all of October. In the article we demonstrate the mechanism by which the use of predictive policing software may amplify the biases that already pervade our criminal ...
In this video, Trina Reynolds-Tyler of Invisible Institute talks about her work with HRDAG on the missing persons project in Chicago and Beneath the Surface.
According to the Human Rights Data Analysis Group, at least 17,723 people were killed in government custody from the start of the uprising in March 2011 to December 2015 – an average of 300 deaths each month. There are no figures for subsequent years but there is no reason to believe the killings stopped.
Romesh Silva and Patrick Ball. “On the Use of Sample Surveys and Multiple Systems Estimations in Assessing Large-Scale Human Rights Violations: Recent Experiences from Timor-Leste.” Proceedings of the Social Statistics Section – Joint Statistical Meetings. New York, (USA). August, 2005.
Patrick Ball and Miguel Cruz (2003). “Human freedom and free software: Why choices about technology matter to human rights advocates.”
For more than 10 years, and with regularity, Mexican authorities have been discovering mass graves, known as fosas clandestinas, in which hundreds of bodies and piles of bones have been found. The casualties are attributed broadly to the country’s “drug war,” although the motivations and perpetrators behind the mass murders are often unknown.
Recently, HRDAG collaborated with two partners in Mexico—Data Cívica and Programa de Derechos Humanos of the Universidad Iberoamericana—to model the probability of identifying a hidden grave in each county (municipio). The model uses an set of independent variables and data about graves from 2013 ...
Valentina Rozo Ángel has worked with HRDAG and the Colombian Truth Commission to acknowledge victims of the 50-year conflict who are not visible or easily counted.
The data on killings in Kosovo are in four files. All of the files are comma-delimited ASCII. The fields in each file are described below.
If you use these data on Kosovo killings, please cite them with the following citation, as well as this note:
“These are convenience sample data, and as such they are not a statistically representative sample of events in this conflict. These data do not support conclusions about patterns, trends, or other substantive comparisons (such as over time, space, ethnicity, age, etc.).”
Patrick Ball, Wendy Betts, Fritz Scheuren, Jana Dudukovich, and Jana Asher. (2002). AAAS/ABA-CEELI/Human Rights Data ...
Over the last few years, we've tried to make the data organized in our projects publicly accessible. We have encouraged our partners to publish the data at the completion of the project. We continue to believe it is important to offer access to the data used in our projects for the sake of transparency as well as to encourage further research and analysis. However, we are increasingly concerned about how raw data are used. Data collected by what we can observe is what statisticians call a convenience sample, which is subject to selection bias.
We're keeping these datasets available for researchers who want to use them for simulation or estimation ...
Next week, on June 11, Oxford University Press officially puts Counting Civilian Casualties: An Introduction to Recording and Estimating Nonmilitary Deaths in Conflict on the market. This textbook, edited by Taylor B. Seybolt, Jay D. Aronson, and Baruch Fischhoff, responds to the increasing concern for civilians in conflict and aims to promote scientific dialogue by highlighting the strengths and weaknesses of the most commonly used casualty recording and estimation techniques.
HRDAG is very well represented here, as our colleagues have co-authored four chapters, and Nicholas Jewell, who sits on our Science Committee, has co-authored a fifth. ...
HRDAG contributes to the project by helping to classify, filter, extract, and standardize the records so that they can be useful in the database.
Multiple Systems Estimation
What is MSE?
What do you mean by statistical inference?
What is an overlap, and how do we know when lists overlap?
How does MSE find the total number of violations?
How was MSE originally developed?
How does the Benetech Human Rights Program use MSE?
1. What is MSE?
A: Multiple Systems Estimation, or MSE, is a family of techniques for statistical inference. MSE uses the overlaps between several incomplete lists of human rights violations to determine the total number of violations.
Return to Top
2. What do you mean by statistical inference?
A: ...
Cory Doctorow of Boing Boing writes about HRDAG director of research Patrick Ball’s article “Violence in Blue,” published March 4 in Granta. From the post: “In a must-read article in Granta, Ball explains the fundamentals of statistical estimation, and then applies these techniques to US police killings, merging data-sets from the police and the press to arrive at an estimate of the knowable US police homicides (about 1,250/year) and the true total (about 1,500/year). That means that of all the killings by strangers in the USA, one third are committed by the police.”
HRDAG analysis shows that the government figures are a gross underestimation of the drug-related killings in the Philippines.
HRDAG and our partners Data Cívica and the Iberoamericana University created a machine-learning model to predict which counties (municipios) in Mexico have the highest probability of unreported hidden graves. The predictions help advocates to bring public attention and government resources to search for the disappeared in the places where they are most likely to be found.
Context
For more than ten years, Mexican authorities have been discovering hidden graves (fosas clandestinas). The casualties are attributed broadly—and sometimes inaccurately—to the country’s “drug war,” but the motivations and perpetrators behind the mass murders ...