705 results for search: %E3%80%8C%EC%97%94%EC%A1%B0%EC%9D%B4%ED%8F%B0%ED%8C%85%E3%80%8D%20WWW_BEX_PW%20%20%EC%82%BC%EC%84%B1%EC%A4%91%EC%95%99%EC%97%AD%EB%9E%9C%EC%B1%97%20%EC%82%BC%EC%84%B1%EC%A4%91%EC%95%99%EC%97%AD%EB%A6%AC%EC%96%BC%ED%8F%B0%ED%8C%85%E2%86%92%EC%82%BC%EC%84%B1%EC%A4%91%EC%95%99%EC%97%AD%EB%A7%8C%EB%82%A8%E2%9C%81%EC%82%BC%EC%84%B1%EC%A4%91%EC%95%99%EC%97%AD%EB%A7%8C%EB%82%A8%EA%B5%AC%ED%95%A8%E3%8A%8C%E3%81%86%E8%B9%9Eimparkation/feed/content/colombia/privacy


Innocence Discovery Lab – Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents

Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab - Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents." The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.

Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab – Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents.” The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.


HRDAG Retreat 2015

I look at the beach and then at the table surrounded by nerds, deep in thought and conversation about Dirichlet priors, matching algorithms, and armed conflicts. This peculiar (in the best way) environment catalyzes a moment of reflection: how did I get here? Four years ago, as a second-year statistics PhD student, I watched "Guatemala: The Secret Files" on PBS Frontline World. I listened to stories of family members who disappeared without answers or justice. Then the story shifted to the work being done by archivists and data experts at Guatemala's Historic Archive of the National Police. The scientists' pursuit of the truth energized me. I ...

The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool

Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.

Anjana Samant, Noam Shemtov, Kath Xu, Sophie Beiers, Marissa Gerchick, Ana Gutierrez, Aaron Horowitz, Tobi Jegede, Tarak Shah (2023). The Devil is in the Details: Interrogating Values Embedded in the Allegheny Family Screening Tool. ACLU. Summer 2023.


Happy Hacking

From my first introduction to the HRDAG community at the annual retreat it was clear to me that mentorship is an organizational priority and that the contributions of interns are valued. Much of my first couple weeks as a summer intern at HRDAG were spent familiarizing myself with Patrick’s paradigm for principled data processing. At the same time, I was learning the skills and tricks (bash, make, vim, git) that promote an effortless programming workflow, a pursuit that Patrick calls “sharpening the saw” (just like in programming, you can cut down a tree with a dull blade, but your life will be much easier if you take the time to sharpen ...

Estimated Gaza Toll May Have Missed 25,000 Deaths, Study Says

Patrick Ball, director of research at the Human Rights Data Analysis Group, and a statistician who has conducted similar estimates of violent deaths in conflicts in other regions, said the study was strong and well reasoned. But he cautioned that the authors may have underestimated the amount of uncertainty caused by the ongoing conflict.

The authors used different variations of mathematical models in their calculations, but Dr. Ball said that rather than presenting a single figure — 64,260 deaths — as the estimate, it may have been more appropriate to present the number of deaths as a range from 47,457 to 88,332 deaths, a span that encompasses all of the estimates produced by modeling the overlap among the three lists.

“It’s really hard to do this kind of thing in the middle of a conflict,” Dr. Ball said. “It takes time, and it takes access. I think you could say the range is larger, and that would be plausible.”


How many people are infected with Covid-19?

Tarak Shah (2020). How many people are infected with Covid-19? Significance. 09 April 2020. © 2020 The Royal Statistical Society.

Tarak Shah (2020). How many people are infected with Covid-19? Significance. 09 April 2020. © 2020 The Royal Statistical Society.


The Day We Fight Back

Today, February 11, is the day of national protests against the National Security Administration. The critical threat is mass surveillance. In the words of The Day We Fight Back, “Together we will push back against powers that seek to observe, collect, and analyze our every digital action. Together, we will make it clear that such behavior is not compatible with democratic governance. Together, if we persist, we will win this fight.” (more…)

Reflections: Richard Savage’s Vision Fulfilled

In 1984, as a fresh PhD, I heard Richard Savage give his presidential address at the Joint Statistical Meetings in Philadelphia. He called it "Hard/Soft Problems" and made a big pitch for statisticians to get involved in human rights data analysis. It was inspirational, and I was immediately sold. I started working with the American Statistical Association's Committee on Scientific Freedom and Human Rights (now chaired by HRDAG's own Megan Price). Over time, a growing set of statisticians became involved, initially in letter-writing campaigns to help dissident statisticians (and other quantitative academics—economists seemed to have a particular ...

Analysis of Homicide Patterns in Colombia

Last week Forensis, the Colombian National Institute of Forensic Medicine’s flagship publication, published the first of our analyses of homicide patterns in Colombia. Authored by HRDAG executive director Patrick Ball and UN colleague Michael Reed Hurtado, “Cuentas y mediciones de la criminalidad y de la violencia” (pages 529-545) explores, as the title suggests, the quality of “truth” contained within crime registries. Citing the problem of partial data, missing data, and inherent design bias, Patrick and Michael write that no register, official or unofficial, can present a true reflection of what has really happened. This publication...

Police Violence in Puerto Rico: Flooded with Data

Kilómetro Cero is making a comparison of police killings in Puerto Rico and police killings in the non-territorial United States, and HRDAG is helping to organize the data.

Celebrating Ten Years of Data from the AHPN

Ten years ago, in July 2005, human rights officers stumbled upon a nondescript warehouse in a commercial zone of Guatemala City and changed history. They had discovered an archive–its existence kept secret–belonging to the Guatemalan National Police, whose officers committed human rights atrocities on behalf of the government during the civil war. Inside the building was the bureaucratic detritus typical of a large government agency: 80 million pages detailing shifts worked, tasks assigned, assignments fulfilled, workers’ whereabouts, and who was supervising whom. The documents, which were found stacked on dirty floors, shoved into bags, ...

verdata: An R package for analyzing data from the Truth Commission in Colombia

Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.

Maria Gargiulo, María Julia Durán, Paula Andrea Amado, and Patrick Ball (2024). verdata: An R package for analyzing data from the Truth Commission in Colombia. The Journal of Open Source Software. 6 January, 2024. 9(93), 5844, https://doi.org/10.21105/joss.05844. Creative Commons Attribution 4.0 International License.


Syria’s celebrations muted by evidence of torture in Assad’s notorious prisons

The Human Rights Data Analysis Group, an independent scientific human rights organization based in San Francisco, has counted at least 17,723 people killed in Syrian custody from 2011 to 2015 — around 300 every week — almost certainly a vast undercount, it says.


Identifiers of Detained Children Have Implications for Data Security and Estimation

Identifiers being sequential could make possible estimations of the population of detained children.

Learning Day by Day: Quantitative Research at the AHPN

Working at the Historic Archive of the National Police (AHPN) of Guatemala, there are many skills I learned on the job. My many years of work on the team that studies the recovered documents have been like a custom-made course in how to do quantitative research. The Archive documents I study are the result of 36 years of creation during civil war (1960 to 1996). Many of these documents are simply administrative—but we are able to use them to understand patterns that occurred during the conflict, to get a sense of what mattered to the National Police and what didn’t. Our quantitative research shows us the Police behavior in broad strokes. ...

Frequently Asked Questions

Multiple Systems Estimation What is MSE?  What do you mean by statistical inference?  What is an overlap, and how do we know when lists overlap?   How does MSE find the total number of violations?  How was MSE originally developed?  How does the Benetech Human Rights Program use MSE?    1. What is MSE? A: Multiple Systems Estimation, or MSE, is a family of techniques for statistical inference. MSE uses the overlaps between several incomplete lists of human rights violations to determine the total number of violations. Return to Top 2. What do you mean by statistical inference? A: ...

Press Release, Timor-Leste, February 2006

SILICON VALLEY GROUP USES TECHNOLOGY TO HELP THE TRUTH COMMISSION ANSWER DISPUTED QUESTIONS ABOUT MASSIVE POLITICAL VIOLENCE IN TIMOR-LESTE Palo Alto, CA, February 9, 2006 – The Benetech® Initiative today released a statistical report detailing widespread and systematic violations in Timor-Leste during the period 1974-1999. Benetech's statistical analysis establishes that at least 102,800 (+/- 11,000) Timorese died as a result of the conflict. Approximately 18,600 (+/- 1000) Timorese were killed or disappeared, while the remainder died due to hunger and illness in excess of what would be expected due to peacetime mortality. The magnitude of deaths ...

Reflections: A Simple Plan

I got an email from my superheroic PhD adviser in June 2006: Would I be interested in relocating to Palo Alto for six months in order to work with Patrick Ball at the Human Rights Data Analysis Group? (She'd gotten a grant and would cover my stipend.) Since I'd spent the last several months in New Haven wrestling ineffectually with giant, brain-melting methodological problems, I said yes immediately. The plan with my adviser was simple: I'd digitize the ancient, multiply-photocopied pages of data from the United Nations Truth Commission for El Salvador, combine them with two other datasets, match across all the records, and produce reliable ...

Outreach at Toronto TamilFest for Counting the Dead

Michelle spent a weekend in Toronto, Canada, reaching out to the community at TamilFest, where she and a colleague invited people to sit down and talk.

Reflections: Minding the Gap

How might we learn what we don’t know? HRDAG associate Christine Grillo hits the wayback machine and recalls her first exposure to People Against Bad Things, ideas about bias and correlation versus causation, and truth.

Our work has been used by truth commissions, international criminal tribunals, and non-governmental human rights organizations. We have worked with partners on projects on five continents.

Donate