With so many dashboards and shiny visualizations, how can an interested non-technical reader find good science among the noise?
HRDAG’s core values all have a connection to Scott Weikart, 1951–2023.
Dr. Patrick Ball recently visited the Plutopia News Network podcast for a wide-ranging, inspiring conversation about his work for the Human Rights Data Analysis Group.
Patrick spoke about how he first discovered human rights work during his time in El Salvador with the Peace Brigades International. That led to his ongoing work as a statistician and computer programmer working to assess and analyze human rights violations. He also unpacked some common statistical techniques used by researchers at Human Rights Data Analysis Group, such as multiple systems estimation, which uses multiple different datasets to gain insights into the data we don't ...
The Sri Lankan army must explain to the families of the disappeared and missing what happened to an estimated 500 Tamils who disappeared in their custody at the war end on/around 18 May 2009, said two international NGOs who have been collating and analysing lists of names.
Sri Lanka has one of the largest numbers in the world of enforced disappearances but these 500 represent the largest number of disappearances all in one place and time in the country. For a detailed account of the process of estimating the 500 please see: “How many people disappeared on 17-19 May 2009 in Sri Lanka?” .
The data science field is always changing, which means that I'll always be learning.
HRDAG associate Miguel Cruz has an epiphany. All those data he’s drowning in? Each datapoint is a personal tragedy, a story both dark and urgent, and he’s privileged to have access.
In July 2009, HRDAG concluded a three-year project with the Liberian Truth and Reconciliation Commission (TRC) to help clarify Liberia’s violent history and hold perpetrators accountable. A military coup in 1979 sparked 24 years of civil war in Liberia where warring factions subjected civilians to severe human rights abuses. The TRC sought to determine whether these violations represented a systematic pattern or policy. This chapter describes how HRDAG developed a statistical analysis of the more than 17,000 victim and witness statements collected by the TRC and applied Ball’s “Who Did What To Whom?” methodology. HRDAG scientist Kristen ...
Kristian Lum spoke about "Understanding the Context and Consequences of Pre-Trial Detention" at the Conference on Fairness, Accountability, and Transparency (FAT*).
The modular nature of the workflow and use of Git allowed us to work on different parts of the project from across the country.
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
Laurel Eckhouse, Kristian Lum, Cynthia Conti-Cook and Julie Ciccolini (2018). Layers of Bias: A Unified Approach for Understanding Problems With Risk Assessment. Criminal Justice and Behavior. November 23, 2018. © 2018 Sage Journals. All rights reserved. https://doi.org/10.1177/0093854818811379
The World According to Artificial Intelligence: Targeted by Algorithm (Part 1)
The Big Picture: The World According to AI explores how artificial intelligence is being used today, and what it means to those on its receiving end.
Patrick Ball is interviewed: “Machine learning is pretty good at finding elements out of a huge pool of non-elements… But we’ll get a lot of false positives along the way.”
Amelia Hoover Green and Patrick Ball (2019). Civilian killings and disappearances during civil war in El Salvador (1980–1992). Demographic Research, 1 October 2019. © 2019 Demographic Research. DOI: 10.4054/DemRes.2019.41.27
Amelia Hoover Green and Patrick Ball (2019). Civilian killings and disappearances during civil war in El Salvador (1980–1992). Demographic Research, 1 October 2019. © 2019 Demographic Research. DOI: 10.4054/DemRes.2019.41.27
Inaccurate statistics can damage the credibility of human rights claims—and that's why we strive to ensure that statistics about human rights violations are generated with as much rigor and are as scientifically accurate as possible.
But, what are the pitfalls leading to inaccuracy—when, where, and how do data become compromised? How are patterns biased by having only partial data? And what are the best scientific methods for collecting, managing, processing and analyzing data?
Here are the data pitfalls that HRDAG has identified, as well as some of our approaches for meeting these challenges. We believe that human rights researchers must take ...
Kristian Lum, Erwin Ma and Mike Baiocchi (2017). The causal impact of bail on case outcomes for indigent defendants in New York City. Observational Studies 3 (2017) 39-64. 31 October 2017. © 2017 Institute of Mathematical Statistics.
Kristian Lum, Erwin Ma and Mike Baiocchi (2017). The causal impact of bail on case outcomes for indigent defendants in New York City. Observational Studies 3 (2017) 39-64. 31 October 2017. © 2017 Institute of Mathematical Statistics.
Help us hold human rights violators accountable!
<< Previous post: MSE: The Basics
Q3. What are the steps in an MSE analysis?
Q4. What does data collection look like in the human rights context? What kind of data do you collect?
Q5. [In depth] Do you include unnamed or anonymous victims in the matching process?
Q6. What do you mean by "cleaning" and "canonicalization?"
Q7. [In depth] What are some of the challenges of canonicalization? (more…)
I got an email from my superheroic PhD adviser in June 2006: Would I be interested in relocating to Palo Alto for six months in order to work with Patrick Ball at the Human Rights Data Analysis Group? (She'd gotten a grant and would cover my stipend.) Since I'd spent the last several months in New Haven wrestling ineffectually with giant, brain-melting methodological problems, I said yes immediately.
The plan with my adviser was simple: I'd digitize the ancient, multiply-photocopied pages of data from the United Nations Truth Commission for El Salvador, combine them with two other datasets, match across all the records, and produce reliable ...
Much of the work we do at HRDAG involves estimating the number of undocumented deaths using a statistical technique called multiple systems estimation (MSE, described in more detail here). One of our goals is to make this class of methods more broadly available to human rights researchers. In particular, we are finding that Bayesian approaches are extremely valuable for MSE. Accordingly, we are pleased to offer a new R package called dga (“decomposable graphs approach”) that performs Bayesian model averaging for MSE.
The main function in this package implements a model created by David Madigan and Jeremy York. This model was designed to ...