Tech Notes
Using Machine Learning to Help Human Rights Investigators Sift Massive Datasets
How we built a model to search hundreds of thousands of text messages from the perpetrators of a human rights crime.
The task is a quantum of workflow
This post describes how we organize our work over ten years, twenty analysts, dozens of countries, and hundreds of projects: we start with a task. A task is a single chunk of work, a quantum of workflow. Each task is self-contained and self-documenting; I'll talk about these ideas at length below. We try to keep each task as small as possible, which makes it easy to understand what the task is doing, and how to test whether the results are correct.
In the example I'll describe here, I'm going to describe work from our Syria database matching project, which ...


