Issues surrounding policing in the United States are at the forefront of our national attention. Among these is the use of “predictive policing,” which is the application of statistical or machine learning models to police data, with the goal of predicting where or by whom crime will be committed in the future. Today Significance magazine published an article on this topic that I co-authored with William Isaac. Significance has kindly made this article open access (free!) for all of October. In the article we demonstrate the mechanism by which the use of predictive policing software may amplify the biases that already pervade our criminal ...
At HRDAG, 2021 was all about service and partnership.
Ball analyzed the data reporters had collected from a variety of sources – including on-the-ground interviews, police records, and human rights groups – and used a statistical technique called multiple systems estimation to roughly calculate the number of unreported deaths in three areas of the capital city Manila.
The team discovered that the number of drug-related killings was much higher than police had reported. The journalists, who published their findings last month in The Atlantic, documented 2,320 drug-linked killings over an 18-month period, approximately 1,400 more than the official number. Ball’s statistical analysis, which estimated the number of killings the reporters hadn’t heard about, found that close to 3,000 people could have been killed – more than three times the police figure.
Ball said there are both moral and technical reasons for making sure everyone who has been killed in mass violence is counted.
“The moral reason is because everyone who has been murdered should be remembered,” he said. “A terrible thing happened to them and we have an obligation as a society to justice and to dignity to remember them.”
Access to the records contained in archives is a concern shared by many. Archives support memory and free access to them strengthens democratic processes. Everyone should be allowed to see first-hand the records contained in an archive and be free to interpret them as needed.
Access to archives can increase knowledge on various topics and opens opportunities for different fields of knowledge. (more…)
In 2019, HRDAG aimed to count those who haven't been counted.
Patrick Ball is kicking himself for a decision he made almost 25 years ago. “I was clever, but I wasn’t smart,” he says ruefully, as he considers the labyrinth of tables and ASCII-encoded keystrings he used to design a database of human rights violations for the pioneering Salvadoran non-governmental Human Rights Commission (CDHES). Now I’m sitting in his office in San Francisco’s Mission District watching over his shoulder, and trying to keep up, as he bangs out code to decipher the priceless data contained in these old files. Created in 1991 and 1992, during the last days of El Salvador’s internal armed conflict, the files detail ...
Kristian Lum, Chesa Boudin and Megan Price (2020). The impact of overbooking on a pre-trial risk assessment tool. FAT* '20: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. January 2020. Pages 482–491. https://doi.org/10.1145/3351095.3372846 ©ACM, Inc., 2020.
Kristian Lum, Chesa Boudin and Megan Price (2020). The impact of overbooking on a pre-trial risk assessment tool. FAT* ’20: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency. January 2020. https://doi.org/10.1145/3351095.3372846 ©ACM, Inc., 2020.
Text in English
Para evaluar afirmaciones sobre la reducción de la violencia letal en Colombia
En marzo de 2007, el Grupo de Análisis de Datos de Derechos Humanos (HRDAG por sus siglas en inglés) publicó un estudio con el título de "Para Evaluar Afirmaciones Sobre la Reducción de la Violencia Letal en Colombia." Los autores de dicho estudio evaluaron aseveraciones que la violencia en Colombia disminuyó tras la desmovilización de los paramilitares. Demostraron que tales afirmaciones se basan tanto en una sobreinterpretación de datos no ajustados como en inferencias causales infundadas. Los autores concluyeron que se requieren múltip...
HRDAG’s funding comes from private, international donors: the John D. and Catherine T. MacArthur Foundation, Open Society Foundations, an anonymous U.S.-based private foundation, Ford Foundation, The National Endowment for Democracy and individual donors. This funding supports both specific projects, as well as our scientific work generally in human rights data analysis.
For the entirety of its existence, HRDAG has been a project of non-profit organizations, first at the American Association for the Advancement of Science (AAAS), and then at Benetech, a non-profit Silicon Valley technology company. In February 2013, HRDAG became ...
Illuminating Data's Dark Side: Big data create conveniences, but we must consider who designs these tools, who benefits from them, and who is left out of the equation.
Principled Data Processing is a way to prove to someone, usually yourself, that what you did was right.
On the anniversary of the Universal Declaration of Human Rights, HRDAG executive director Megan Price tells us why she loves her work, and why she feels hopeful about the future.
I got an email from my superheroic PhD adviser in June 2006: Would I be interested in relocating to Palo Alto for six months in order to work with Patrick Ball at the Human Rights Data Analysis Group? (She'd gotten a grant and would cover my stipend.) Since I'd spent the last several months in New Haven wrestling ineffectually with giant, brain-melting methodological problems, I said yes immediately.
The plan with my adviser was simple: I'd digitize the ancient, multiply-photocopied pages of data from the United Nations Truth Commission for El Salvador, combine them with two other datasets, match across all the records, and produce reliable ...
Multiple Systems Estimation
What is MSE?
What do you mean by statistical inference?
What is an overlap, and how do we know when lists overlap?
How does MSE find the total number of violations?
How was MSE originally developed?
How does the Benetech Human Rights Program use MSE?
1. What is MSE?
A: Multiple Systems Estimation, or MSE, is a family of techniques for statistical inference. MSE uses the overlaps between several incomplete lists of human rights violations to determine the total number of violations.
Return to Top
2. What do you mean by statistical inference?
A: ...
We are saddened by the passing of Steve Fienberg yesterday in Pittsburgh, at the age of 74. He is perhaps best known around the world for bringing statistics to science and public policy and was a beloved professor at Carnegie Mellon University. At HRDAG we are in awe of and grateful for the work Steve did formalizing multiple systems estimation. His work on that front blazed a trail and essentially enabled all of our most important analytical work at the intersection of human rights and statistical science.
If we are to reduce the amount of human violence in the world, the first task is to determine the scope of the violence, to know how much of ...
The International Truth and Justice Project and HRDAG have published a report on 500 Tamils who disappeared while in Army custody in Sri Lanka in 2009.
The report is titled “How many people disappeared on 17-19 May 2009 in Sri Lanka?” and Patrick Ball, director of research at HRDAG, is the lead author.
From the article: “Intentionally inconsistent tracking can also influence the final tally, notes Megan Price, a statistician at the Human Rights Data Analysis Group. During the Iraq War, for example, officials worked to conceal mortality or to cherry pick existing data to steer the political narrative. While wars are handled differently from pandemics, Price thinks the COVID-19 data could still be at risk of this kind of manipulation.”