Datasets

Over the last few years, we’ve tried to make the data organized in our projects publicly accessible. We have encouraged our partners to publish the data at the completion of the project. We continue to believe it is important to offer access to the data used in our projects for the sake of transparency as well as to encourage further research and analysis. However, we are increasingly concerned about how raw data are used. Data collected by what we can observe is what statisticians call a convenience sample, which is subject to selection bias.

We’re keeping these datasets available for researchers who want to use them for simulation or estimation projects that use statistical models to correct for biases. We strongly caution researchers not to use the raw data as measures of violence. (Note: several of the Kosovo datasets are estimates and can be used as measures; read the data dictionaries carefully.) Please contact us at info @ hrdag.org if you have any questions or need assistance.

You are welcome to use these datasets for your research. If you publish with them, however, we ask that you include the following text:

These are convenience sample data, and as such they are not a statistically representative sample of events in the referenced conflict.  These data do not support conclusions about patterns, trends, or other substantive comparisons (such as over time, space, ethnicity, age, etc.).

We would also very much appreciate a citation.

Statistical data

Colombia – Casanare (en español)

Guatemala – CIIDH datasets

Kosovo – Killings, Migrations, and More

Liberia – Truth and Reconciliation Commission

Sierra Leone – TRC Data and Statistical Appendix



  • > HRDAG

    The Human Rights Data Analysis Group is a non-profit, non-partisan organization that applies rigorous science to the analysis of human rights violations around the world.
  • > Recent Stories


    Patrick Ball Honored with Degree at Claremont Graduate University

    When Data Doesn’t Tell the Whole Story

    Focus on Good Science, not Scientists

    HRDAG Offers New R Package – dga

    How many police homicides in the US? A reconsideration

    HRDAG Retreat 2015

    BJS Report on Arrest-Related Deaths: True Number Likely Much Greater

    The Great Lessons in Research at the Archive

    Evaluation of the Kosovo Memory Book

    Yezidi Activists Teach HRDAG about Human Rights – updated


    Archives


    x

    You are welcome to use these datasets for your research. If you publish with them, however, we ask that you include the following text: "These are convenience sample data, and as such they are not a statistically representative sample of events in this conflict.  These data do not support conclusions about patterns, trends, or other substantive comparisons (such as over time, space, ethnicity, age, etc.)."

    For reference and further information please see this blogpost about raw data and this blogpost about convenience samples. In addition, we recommend you read the following: Dorofeev, S. and P. Grant (2006). Statistics for Real-Life Sample Surveys. Cambridge University Press; and van Belle, Gerald (2002). Statistical Rules of Thumb. Wiley.

    If you use these data, please cite them with the following reference: