Why It Took So Long To Update the U.N.-Sponsored Syria Death Count
In this story, Carl Bialik of FiveThirtyEight interviews HRDAG executive director Patrick Ball about the process of de-duplication, integration of databases, and machine-learning in the recent enumeration of reported casualties in Syria.
New reports of old deaths come in all the time, Ball said, making it tough to maintain a database. The duplicate-removal process means “it’s a lot like redoing the whole project each time,” he said.