Algorithmic tools like PredPol were supposed to reduce bias. But HRDAG has found that racial bias is baked into the data used to train the tools.
HRDAG is currently evaluating the quality and completeness of the Kosovo Memory Book of the Humanitarian Law Center (HLC) in Belgrade, Serbia. The objective of the Kosovo Memory Book (KMB) is to commemorate every single person who fell victim to armed conflict in Kosovo from 1998 to 2000, either through death or disappearance.
While building and reviewing their database, one of the things that HLC has to do is “record linkage,” a process also known as “matching.” Matching determines whether two records are the same people (“a match”) or different people (“a non-match”). Matching helps to identify whether two existing records refer ...
If we could glean key missing information from those fields, we would be able to use more records.
Romesh Silva and Patrick Ball. “The Demography of Conflict-Related Mortality in Timor-Leste (1974-1999): Empirical Quantitative Measurement of Civilian Killings, Disappearances & Famine-Related Deaths” In Statistical Methods for Human Rights, J. Asher, D. Banks and F. Scheuren, eds., Springer (New York) (2007)
.outter-wrapper.feature {
background: #15795b;
}
.outter-wrapper.feature hr {
border-width: 0;
height: 30px;
}
.outter-wrapper.feature h4 {
/* height: 30px; */
border-width: 0;
}
.wrapper {
padding: 20px 0;
}
.branding-headline {
width: 100%;
font-size: 40px;
font-weight: 600;
padding-bottom: 20px;
color: #15795b;
line-height: 43.2px;
}
.border-line {
border-bottom: 1px solid #000;
margin: 20px 0;
}
.hed-dek-illo {
margin: 20px 0;
}
.illo {
width: 100%;
min-height: 200px;
}
.illo img {
margin: 0;
}
.blog-pages {
display: flex;
}
.blog-post {
flex: 0 0 ...
Herb led and mentored a generation of statisticians working in human rights.
Today, February 11, is the day of national protests against the National Security Administration.
The critical threat is mass surveillance. In the words of The Day We Fight Back, “Together we will push back against powers that seek to observe, collect, and analyze our every digital action. Together, we will make it clear that such behavior is not compatible with democratic governance. Together, if we persist, we will win this fight.” (more…)
HRDAG has sampled and analyzed documents at Guatemala's AHPN and has testified against war criminals based on that analysis.
The datasets contributed by 30+ organizations do a wonderful job of tallying the violence that was observed—but they don’t account for the violence that nobody witnessed or documented.
Valentina Rozo Ángel has worked with HRDAG and the Colombian Truth Commission to acknowledge victims of the 50-year conflict who are not visible or easily counted.
I spent the two weeks over Easter working with Patrick and Megan in San Francisco, trying to figure out a strategy of how best to estimate the number of casualties the Syrian civil war has claimed in the past two years. In January, HRDAG published a report on the number of fully identified casualties reported in the Syrian Arab Republic between March 2011 and November 2012. The number of de-duplicated records of killings for this period was 59,648, a number that is likely to be an undercount since we know that many incidences of lethal violence in conflict go unreported, and that the unreported cases are not missing at random. (more…)
Identifiers being sequential could make possible estimations of the population of detained children.
There may have been more undocumented World War II-era Korean "comfort women" than known.
<<Previous post: Collection, Cleaning, and Canonicalization of Data
Q8. What do you mean by "overlap," and why are overlaps important?
Q9. [In depth] Why is automated matching so important, and what process do you use to match records?
Q8. What do you mean by "overlap," and why are overlaps important?
MSE estimates the total number of violations by comparing the size of the overlap(s) between lists of human rights violations to the sizes of the lists themselves. By "overlap," we mean the set of incidents, such as deaths, that appear on more than one list of human rights violations. Accurately and efficiently identifying overlaps between ...
Issues surrounding policing in the United States are at the forefront of our national attention. Among these is the use of “predictive policing,” which is the application of statistical or machine learning models to police data, with the goal of predicting where or by whom crime will be committed in the future. Today Significance magazine published an article on this topic that I co-authored with William Isaac. Significance has kindly made this article open access (free!) for all of October. In the article we demonstrate the mechanism by which the use of predictive policing software may amplify the biases that already pervade our criminal ...
During the violence in Timor-Leste in June 2006, armed gangs broke into the offices of the Commission for Reception, Truth and Reconciliation (CAVR) in Dili and stole their motorbikes.
The Human Rights Data Analysis Group, then at Benetech®, and other human rights observers wondered whether the mobs would soon return to loot the irreplaceable paper records used by the CAVR to compile their definitive report entitled "Chega!"
The Benetech Initiative contributed to the CAVR findings and released a separate statistical report (PDF) establishing that at least 102,800 (+/- 11,000) Timorese died as a result of human rights violations in Timor-Leste ...