649 results for search: %EC%84%9C%EA%B5%AC%EB%A7%88%EC%B4%88%EC%9D%98%EB%B0%A4%CE%B8macho2%EF%BC%8C%EF%BC%A30%EF%BD%8D%20%EC%84%9C%EA%B5%AC%ED%82%A4%EC%8A%A4%EB%B0%A9%20%EC%84%9C%EA%B5%AC%ED%95%B8%ED%94%8C%20%EC%84%9C%EA%B5%AC%ED%9C%B4%EA%B2%8C%ED%85%94%20%EC%84%9C%EA%B5%AC%ED%95%B8%ED%94%8C


String matching for governorate information in unstructured text

code{white-space: pre;} pre:not([class]) { background-color: white; } h1 { font-size: 34px; } h1.title { font-size: 38px; } h2 { font-size: 30px; } h3 { font-size: 24px; } h4 { font-size: 18px; } h5 { font-size: 16px; } h6 { font-size: 12px; } .table th:not([align]) { text-align: left; } .main-container { max-width: 940px; margin-left: auto; margin-right: auto; } code { color: inherit; background-color: rgba(0, 0, 0, 0.04); } img { max-width:100%; height: auto; } .tabbed-pane { padding-top: 12px; } .html-widget { margin-bottom: 20px; } button.code-foldin...

Human Rights and the Decentralized Web

Our partners were eager to learn and talk about emerging decentralized technology.

FAQs on Predictive Policing and Bias

Last month Significance magazine published an article on the topic of predictive policing and police bias, which I co-authored with William Isaac. Since then, we've published a blogpost about it and fielded a few recurring questions. Here they are, along with our responses. Do your findings still apply given that PredPol uses crime reports rather than arrests as training data? Because this article was meant for an audience that is not necessarily well-versed in criminal justice data and we were under a strict word limit, we simplified language in describing the data. The data we used is a version of the Oakland Police Department’s crime report...

Timor-Leste Op-Ed

Defending Human Rights Data And The Possibility of Justice In East Timor By Patrick Ball and Romesh Silva On June 5th, armed gangs broke into the offices of the Commission for Reception, Truth and Reconciliation (CAVR) in Dili, East Timor and stole their motorbikes. Many human rights workers wondered whether the mobs would soon return to loot the irreplaceable paper records used by the CAVR to compile a definitive report on human rights abuses during the Indonesian occupation of East Timor from 1975-1999. The release of this report was preempted by the recent violence in Dili. But in the midst of the chaos, Australian military forces stepped in to ...

The task is a quantum of workflow

This post describes how we organize our work over ten years, twenty analysts, dozens of countries, and hundreds of projects: we start with a task. A task is a single chunk of work, a quantum of workflow. Each task is self-contained and self-documenting; I'll talk about these ideas at length below. We try to keep each task as small as possible, which makes it easy to understand what the task is doing, and how to test whether the results are correct. In the example I'll describe here, I'm going to describe work from our Syria database matching project, which includes about 100 tasks. I'll start with the first thing we do with files we receive ...

Timor-Leste

During the violence in Timor-Leste in June 2006, armed gangs broke into the offices of the Commission for Reception, Truth and Reconciliation (CAVR) in Dili and stole their motorbikes. The Human Rights Data Analysis Group, then at Benetech®, and other human rights observers wondered whether the mobs would soon return to loot the irreplaceable paper records used by the CAVR to compile their definitive report entitled "Chega!" The Benetech Initiative contributed to the CAVR findings and released a separate statistical report (PDF) establishing that at least 102,800 (+/- 11,000) Timorese died as a result of human rights violations in Timor-Leste ...

Scraping for Pattern: Protecting Immigrant Rights in Washington State

With HRDAG's help, the University of Washington Center for Human Rights team has been able to analyze the scraped text and search for key words such as “jail” in order to gain insight into where immigration arrests are being made.

How Review of Police Data Verified Neglect of Missing Black Women

Sloppy recordkeeping by Chicago police has compromised missing persons cases. HRDAG is working with Pulitzer Prize-winning Invisible Institute to shed light on these stories.

Open Source Summit 2018

On October 23, 2018, Patrick Ball keynoted at the Open Source Summit in Edinburgh, Scotland.

.outter-wrapper.feature { background: #15795b; } .outter-wrapper.feature hr { border-width: 0; height: 30px; } .outter-wrapper.feature h4 { /* height: 30px; */ border-width: 0; } .wrapper { padding: 20px 0; } .branding-headline { width: 100%; font-size: 40px; font-weight: 600; padding-bottom: 20px; color: #15795b; line-height: 43.2px; } .border-line { border-bottom: 1px solid #000; margin: 20px 0; } .hed-dek-illo { margin: 20px 0; } .illo { width: 100%; min-height: 200px; } .illo img { margin: 0; } .blog-pages { display: flex; } .blog-post { flex: 0 0 ...

Data on Kosovo Killings

The data on killings in Kosovo are in four files. All of the files are comma-delimited ASCII. The fields in each file are described below. If you use these data on Kosovo killings, please cite them with the following citation, as well as this note: “These are convenience sample data, and as such they are not a statistically representative sample of events in this conflict.  These data do not support conclusions about patterns, trends, or other substantive comparisons (such as over time, space, ethnicity, age, etc.).” Patrick Ball, Wendy Betts, Fritz Scheuren, Jana Dudukovich, and Jana Asher. (2002). AAAS/ABA-CEELI/Human Rights Data ...

Multiple Systems Estimation: Stratification and Estimation

<< Previous post, MSE: The Matching Process Q10. What is stratification? Q11. [In depth] How do HRDAG analysts approach stratification, and why is it important? Q12. How does MSE find the total number of violations? Q13. [In depth] What are the assumptions of two-system MSE (capture-recapture)? Why are they not necessary with three or more systems? Q14. What statistical model(s) does HRDAG typically use to calculate MSE estimates? (more…)

The Statistics of Mortality Due to Conflict in Peru

A key point is that human rights data collection prior to the TRC largely ignored violence by the Shining Path.

Rionegro

Text in English El uso de información de cementerios en la búsqueda de los desaparecidos: lecciones de un estudio piloto en Rionegro, Antioquia, Colombia Entre mayo y julio de 2009, investigadores del Grupo de Análisis de Datos de Derechos Humanos de Benetech (HRDAG por su sigla en inglés), condujeron un estudio piloto que examinó los patrones de la información sobre los cadáveres sin identificar en el cementerio de Rionegro, un municipio en el departamento de Antioquia, Colombia. El estudio se realizó en apoyo a los actuales esfuerzos de la organización socia de HRDAG, EQUITAS (Equipo Colombiano Interdisciplinario de Trabajo Forense y ...

State Coordinated Violence in Chad under Hissene Habre: A Statistical Analysis of Reported Prison Mortality in Chad’s DDS Prisons and Command Responsibility of Hissene Habre, 1982-1990.

Romesh Silva, Jeff Klingner, and Scott Weikart. “State Coordinated Violence in Chad under Hissene Habre: A Statistical Analysis of Reported Prison Mortality in Chad’s DDS Prisons and Command Responsibility of Hissene Habre, 1982-1990.” A Report by Benetech’s Human Rights Data Analysis Group to Human Rights Watch and the Chadian Association of Victims of Political Repression and Crimes. 29 January 2010. (Available in French) © 2010 Benetech. Creative Commons BY-NC-SA.


IRR: Agreement Among Coders is Key

For years I have been engaged in a quantitative study at Guatemala’s Historic Archive of the National Police, or AHPN. (See the blogposts below.) In this study coders collect data on sheets of paper according to criteria established and explained in manuals. But when collecting data, there’s always room for human error—this is why the validity of the study hinges on verifying that coders use the correct criteria. It is important to mention that the mainstay of coding is the use of a controlled vocabulary. A controlled vocabulary gives analysts a framework, or frame of reference, when converting qualitative information into categories ...

14 Questions about Counting Casualties in Syria

In early 2012, HRDAG was commissioned by the UN Office of the High Commissioner for Human Rights (OHCHR) to do an enumeration project, essentially a count of all of the reported casualties in the Syrian conflict. HRDAG has published two analyses so far, the first in January 2013, and the second in June 2013. In this post, HRDAG scientists Anita Gohdes, Megan Price, and Patrick Ball answer questions about that project. So, how many people have been killed in the Syrian conflict? This is a complicated question. As of our last report, in June 2013, we know that there have been at least 93,000 reported, identifiable conflict-related casualties. The ...

Innocence Discovery Lab – Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents

Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab - Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents." The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.

Ayyub Ibrahim, Huy Dao, and Tarak Shah (2024). “Innocence Discovery Lab – Harnessing Large Language Models to Surface Data Buried in Wrongful Conviction Case Documents.” The Wrongful Conviction Law Review 5 (1):103-25. https://doi.org/10.29173/wclawr112. 31 May, 2024. Copyright (c) 2024 Ayyub Ibrahim, Huy Dao, Tarak Shah. Creative Commons Attribution 4.0 International License.


BJS Report on Arrest-Related Deaths: True Number Likely Much Greater

(This post is co-authored by Patrick Ball and Kristian Lum.) Today the Bureau of Justice Statistics (BJS) released a report on their effort to document “all deaths that occur during the process of arrest in the United States.” The analysis estimates that the Arrest-Related Deaths (ARD) program covers only 34-49% of these deaths. A parallel program by the FBI (the Supplementary Homicide Reports, SHR) is estimated to cover approximately the same proportion of deaths. Even taking into consideration both programs, 28% of all police homicides remain unreported. In order to estimate the total number of homicides that appear on neither the ARD or ...

The Use of Unstructured Data to Study Police Use of Force

Tarak Shah, Cristian Allen, Ayyub Ibrahim, Harlan Kefalas, and Bavo Stevens (2024). The Use of Unstructured Data to Study Police Use of Force. 5 December, 2024. CHANCE, 37(4), 18–23. https://doi.org/10.1080/09332480.2024.2434437

Tarak Shah, Cristian Allen, Ayyub Ibrahim, Harlan Kefalas, and Bavo Stevens (2024). The Use of Unstructured Data to Study Police Use of Force. 5 December, 2024. CHANCE37(4), 18–23. https://doi.org/10.1080/09332480.2024.2434437


Our work has been used by truth commissions, international criminal tribunals, and non-governmental human rights organizations. We have worked with partners on projects on five continents.

Donate