276 results for search: Кто привлекает Водолея больше в insta---batmanapollo/feed/inter-rater-reliability
Data coding and inter-rater reliability (IRR)
Data coding is the process of converting unstructured information, such as a narrative testimony, into discrete facts such as names and roles of actors (victims, witnesses, perpetrators) in crimes, as well as the date and place of act. Data coding must not discard or distort information. When more than one person is identifying, classifying and counting the elements reported in a qualitative source, the results of what they find may differ slightly based on each individual's interpretation and care in doing the coding. These differences can be measured by measuring IRR (inter-rater reliability). We give the same source document to several coders and ...
New Research Shows Community Engagement Improves the Validity and Reliability of Artificial Intelligence
AI is transforming the way scientists analyze complex data.
The Art and Science of Coding AHPN Documents
The coding, from my perspective, is the heart of the project. I say this, because the coding team has the responsibility of selecting documents according to the random sample, recording the documents’ contents, and applying the criteria to convert that content into an entry in a quantitative database. Not to mention the fact that this team has the privilege of being in direct contact with the documents.
At present, because of advanced organizational processes, not everyone has a chance to hold an original document in their hands. The quantitative study had many advantages in this regard; since we started work in parallel with the archival ...
Liberia
In July 2009, The Human Rights Data Analysis Group (HRDAG) concluded a three-year project with the Liberian Truth and Reconciliation Commission to help clarify Liberia's violent history and hold perpetrators of human rights abuses accountable for their actions. (This work was conducted by HRDAG while with Benetech.)
In the course of this work, HRDAG analyzed more than 17,000 victim and witness statements collected by the Liberian Truth and Reconciliation Commission and compiled the data into a report entitled "Descriptive Statistics From Statements to the Liberian Truth and Reconciliation Commission." The report is included as an annex to the final ...
Podcast: Dr. Megan Price Explores Fact Finding in a Failed State
How do scientists and statisticians stand up to authoritarianism, especially when it happens in their home countries?
Core Concepts
Inaccurate statistics can damage the credibility of human rights claims—and that's why we strive to ensure that statistics about human rights violations are generated with as much rigor and are as scientifically accurate as possible.
But, what are the pitfalls leading to inaccuracy—when, where, and how do data become compromised? How are patterns biased by having only partial data? And what are the best scientific methods for collecting, managing, processing and analyzing data?
Here are the data pitfalls that HRDAG has identified, as well as some of our approaches for meeting these challenges. We believe that human rights researchers must take ...
IRR: Agreement Among Coders is Key
For years I have been engaged in a quantitative study at Guatemala’s Historic Archive of the National Police, or AHPN. (See the blogposts below.) In this study coders collect data on sheets of paper according to criteria established and explained in manuals. But when collecting data, there’s always room for human error—this is why the validity of the study hinges on verifying that coders use the correct criteria.
It is important to mention that the mainstay of coding is the use of a controlled vocabulary. A controlled vocabulary gives analysts a framework, or frame of reference, when converting qualitative information into categories ...
All of the ways we remember: How data scientists hold memory with and for survivors
Those most vulnerable to state violence are already marginalized and undercounted, their experiences ignored or minimized in official sources. To avoid perpetuating these harms in our analyses, we have to find ways to incorporate unofficial data sources and all of the ways we remember.
Millions of Pages of Police Use-of-Force Files Available through New Searchable Database
A new, public database will bring more oversight to police abuses in California—and may serve as a model for police accountability for other states across the country.
HRDAG was part of a coalition behind the recently-launched Police Records Access Project. The new searchable database includes ...
Reflections: Pivotal Moments in Freetown
The summer of 2002 in Washington, DC, was steamy and hot, which is how I remember my introduction to HRDAG. I had begun working with them, while they were still at AAAS, in the late spring, learning all about their core concepts: duplicate reporting and MSE, controlled vocabularies, inter-rater reliability, data models and more. The days were long, with a second shift more often than not running late into the evening. In addition to all the learning, I also helped with matching for the Chad project – that is, identifying multiple records of the same violation – back when matching was done by hand. But it was not long after I arrived in Washington ...
Reflections: A Love Letter to HRDAG
On the anniversary of the Universal Declaration of Human Rights, HRDAG executive director Megan Price tells us why she loves her work, and why she feels hopeful about the future.
FAQ about the JEP-CEV-HRDAG data integration and statistical estimation project
1. Is there a single source of information about the victims of the armed conflict in Colombia?
No. Colombia has an extensive documentation process for victims of the armed conflict. Hundreds of institutions, victims' organizations, and civil society organizations have focused their efforts on recording this information. However, each entity or organization develops their documentation process with its own limitations related to technical, logistical, social, and missionary capacities. No entity or organization is able to document the complete universe of victims. This is because it is impossible for them to reach every part of the country, ...
Quantitative Research at the AHPN Guatemala
In early 2006 I joined the Historical Archive of the National Police (Archivo Histórico de la Policía Nacional, or AHPN) without knowing the impact it would have on my future. I started with cleaning, organizing and classifying documents—and learning, with other colleagues, what a historical archive is and how it works.
By April of that year, parallel to these learning processes, I was selected along with 20 other people to begin work on the challenging Quantitative Research project. I started as a "coder," transferring key content from documents into a database. (more…)
Multiple Systems Estimation: Collection, Cleaning and Canonicalization of Data
<< Previous post: MSE: The Basics
Q3. What are the steps in an MSE analysis?
Q4. What does data collection look like in the human rights context? What kind of data do you collect?
Q5. [In depth] Do you include unnamed or anonymous victims in the matching process?
Q6. What do you mean by "cleaning" and "canonicalization?"
Q7. [In depth] What are some of the challenges of canonicalization? (more…)
Celebrating Ten Years of Data from the AHPN
Ten years ago, in July 2005, human rights officers stumbled upon a nondescript warehouse in a commercial zone of Guatemala City and changed history. They had discovered an archive–its existence kept secret–belonging to the Guatemalan National Police, whose officers committed human rights atrocities on behalf of the government during the civil war.
Inside the building was the bureaucratic detritus typical of a large government agency: 80 million pages detailing shifts worked, tasks assigned, assignments fulfilled, workers’ whereabouts, and who was supervising whom. The documents, which were found stacked on dirty floors, shoved into bags, ...
How Pretrial Risk Assessment Tools Perpetuate Unfairness
Tools like Compas allegedly help judges predict future criminal activities and eliminate bias. HRDAG and partners showed how the tools recycle bias.
String matching for governorate information in unstructured text
code{white-space: pre;}
pre:not([class]) {
background-color: white;
}
h1 {
font-size: 34px;
}
h1.title {
font-size: 38px;
}
h2 {
font-size: 30px;
}
h3 {
font-size: 24px;
}
h4 {
font-size: 18px;
}
h5 {
font-size: 16px;
}
h6 {
font-size: 12px;
}
.table th:not([align]) {
text-align: left;
}
.main-container {
max-width: 940px;
margin-left: auto;
margin-right: auto;
}
code {
color: inherit;
background-color: rgba(0, 0, 0, 0.04);
}
img {
max-width:100%;
height: auto;
}
.tabbed-pane {
padding-top: 12px;
}
.html-widget {
margin-bottom: 20px;
}
button.code-foldin...
Multiple Systems Estimation: The Matching Process
<<Previous post: Collection, Cleaning, and Canonicalization of Data
Q8. What do you mean by "overlap," and why are overlaps important?
Q9. [In depth] Why is automated matching so important, and what process do you use to match records?
Q8. What do you mean by "overlap," and why are overlaps important?
MSE estimates the total number of violations by comparing the size of the overlap(s) between lists of human rights violations to the sizes of the lists themselves. By "overlap," we mean the set of incidents, such as deaths, that appear on more than one list of human rights violations. Accurately and efficiently identifying overlaps between ...
Seeking the Truth with Documentation
The need to establish the truth around events is central to goals of transitional justice, particularly securing accountability, establishing legitimate and effective justice mechanisms, and laying the foundations for a peaceful society.
Documentation to provide verifiable and widely accepted accounts of such events is a critical component of establishing this truth, or the multiple truths that may exist for a population. It is difficult to overstate the important role of documentation in transitional justice efforts. If some of what follows sounds familiar, echoing points of previous posts, it is no coincidence. Documentation, in a word, is the ...
Lessons at HRDAG: Making More Syrian Records Usable
If we could glean key missing information from those fields, we would be able to use more records.
