October 17, 2006

silhouette3.JPG From the desk of Jane Galt:

Legitimate questions on the Lancet survey

The good thing about the study is that people are having some very interesting discussions of the data. There are some good objections to it, and it makes interesting reading.

John Quiggin examines airstrikes, comparing them to the number of deaths by airstrike in the survey. His comments explode.

  • One commenter says that Mr Quiggins' estimate of the deadliness of airstrikes is way overblown:
    In the recent Israel-Lebanon War, “The IAF flew some 15,500 sorties, including some 10,000 fighter sorties, and attacked a total of around 7,000 targets.” Since Israel killed about 1,000 people by all means, I conclude that on average it takes 10 air attacks to kill 1 person, and that John Quiggin’s figures are inflated by a factor of 100.

    On the other hand, if the strikes are happening at night, people won't really know whether they got hit by an insurgent mortar or a US plane. Still, it does give pause. Can any of my many militarily knowlegeable people comment?

  • Others point out that while the numbers of deaths in the two studies (Lancet I and Lancet II) agree, the causes of death do not; the second study says that there's almost no excess mortality from non-violent causes. This is definitely disturbing. It's hard to confuse a stroke with getting blown up. Many of the commenters on Crooked Timber purport to believe that this does not matter. But the authors are claiming corroboration of Lancet I via Lancet II, when in fact the samples say something very, very different. Think of it as a survey of "excess violent deaths" in Iraq right after the invasion: Lancet I discovered approximately 60,000, while Lancet II discovered double that number, and no excess non-violent deaths.

    This is not agreement. Deaths from various causes like heart attack and stroke may vary randomly, because old people who die tend to have multiple problems; if one doesn't kill you, the other will. Also, the cause of death isn't always clear--did you die of the cancer, or the stroke it caused?--so there will be random variance just because doctors put deaths in different baskets. This is not true of "deaths by heart attack" and "deaths by gunshot", which strike different populations and thus are at best, weakly correlated, less so over longer periods of time. One expects random variance between, say, mortar and rpg deaths, but if a survey can't distinguish between "violently killed" and "natural causes", you've got a problem. To me this suggests a couple of things: either people's memories are failing them more than even critics think, or there is a substantial difference in the two samples, which is worrying.

  • The Iraqi researchers broke randomness. At first blush, this seems most likely to have surpressed the body count downwards, since they probably tried to go to less dangerous areas. On the other hand, if they *really* broke randomness--by, say, asking the locals who wanted to talk to them--it could have gone the other way, since presumably the households with high body counts would swarm to the researchers with their stories. This would also explain the sudden decline in excess mortality from non-violent causes, since people are less likely to blame Grandpa's heart attack on the Americans.
  • Interesting tidbit: one commenter alleges that if they hadn't thrown out the Fallujah sample from Lancet I, the first study would have shown something like 40% of the population of Fallujah dying in the attack. The problem with throwing out obviously erroneous outliers like this is, of course, that they may be pointing to a problem with your sample. They also may not, mind you; this is not some dispositive refutation, just an interesting item.
  • At least some people report that Roberts et. al. will not release raw data, even scrubbed of identifying information. The same people defending the study (rightly) found this sort of behaviour very suspicious when John Lott did it; that's certainly why I no longer trust anything the man says. But if there are people who have succeeded in getting the raw data, please tell me. Also let me know if it truly is standard practice in public health never to share your deidentified data with anyone except collaborators. If that is true, can you explain why? That seems totally unscientific to me; no one would ever be able to replicate anyone's results or check sampling problems.
  • Let me preface this by saying that I am completely, 100%, not in any way implying that Roberts et al. made the data up. But a commenter in John Quiggins' post asks "how could anyone tell if they just made it up?" My sense is that it's actually pretty easy to tell things like that; researchers get sloppy and tend to repeat patterns that you can pick up. That's how Steven Levitt caught teachers cheating on standardized assessment tests of their charges in Chicago. This is just a hypothetical, mind you, not any sort of accusation at all. I have absolutely no reason to believe they are making anything up, and if any one cites me in support of the proposition that they are, I will sue them. Or at least be very, very angry.

    A more interesting question is: could you tell if the Iraqi surveyors made stuff up? I find it much easier to imagine a survey team deciding that discretion was the better part of valor and faking entries rather than risk their lives. Hey, back when I was into political action, I saw people make up signatures on petitions so they could go get high. Again, I have no reason to think that this happened; I'm just curious whether it would be easier or harder to pick out researchers fudging the data.

  • A knowlegable commenter with extensive survey research experience says this:

    . . . I’m also very worried about the fieldwork itself. I believe the reported refusal rate was 0.8% (I can’t find this in the report itself, so feel free to correct me). This is simply not believable. I have never conducted a survey with anything like a refusal rate that low, and before anyone talks about cultural differences, there are many non-cultural reasons for people to refuse to participate. If my survey was in a war-zone, I would expect refusal rates to be higher than normal.

    . . . I understand that each of the clusters (40 interviews) were conducted in one day (again, I don’t have a direct cite). The paper itself is ambiguous as to how many interviewers were deployed at each cluster, and whether they worked in pairs or solo. Assuming it means 4 interviewers each interviewing 10 people, that’s a very impressive work-rate in the context of the difficulties. See Appendix B of the report where they report being held up for hours at roadblocks and the extreme suspicion of initial respondents, requiring lengthy explanations. Apparently these “lengthy explanations” migrated quickly from household to household.

    If the interviewers worked in pairs (as one would have assumed – one male and one female) then the interview rates are unbelievable.)

    . . . The report states that in 92% of when it was requested, a death certificate was produced. I also find this difficult to believe, although cultural practices might come into play.

    I don’t attribute improper motives to the John Hopkins team. As I understand they remained in Jordan and relied on their intrepid field team. I find it difficult to believe that field team did what they say they did.

    Now that it is pointed out, I also find the response/not home rates rather unbelievable. Admittedly, phone surveys are easier to opt out of, Iraqis probably have more stay-at-home Moms, and Iraqis almost certainly don't go out much these days. But if Iraq really is that violent, people will be fleeing, which means they should have knocked on more doors where no one answered. As one blogger asked:


    Don’t heads of households and their spouses in urban Iraq have jobs? Don't they go out to meet friends? Do they never visit relatives in other neighbourhoods or towns? Do they not engage in any activities outside their homes? Are they never in the middle of a family meal and don’t want to be interrupted by unknown visitors asking intrusive personal questions? Never out shopping for groceries or passing the time of day at a local coffee shop or dropping off the family car at the mechanic’s? Do they just stay around the house all day every day? In short, do those folks living in urban Iraq have any semblance of normal lives?

    I realise that armed conflict would impel most people to huddle in their homes behind locked doors (in which case they would be unlikely to open the door to strangers), but that possibility doesn’t enter into it because the locations selected for interview were altered if they appeared unsafe.

    And 92% produced death certificates? Maybe households in poorer countries keep that stuff in a special place forever, where they can instantly put their hands on it . . . but I certainly wouldn't want to bet that my family of pack rats could immediately produce a death certificate when asked for any of my dead relatives, much less all of them. And the average household had six people, yet the interviews supposedly took only 15 minutes. Have you ever sat around with a family of Arabs--or for that matter any sort of family--talking about recent family history?

    "Hussein moved to Basra in November 2003."

    "No, Ali, it was 2002. It was a month after Aunt Maryam died. He used her suitcases."

    "Aunt Maryam died in 2001. Remember? She choked on a date at Noor's wedding, and at the funeral Grandma forgot where she was and tried to do the wedding dance." (To interviewer) "My grandmother hasn't been the same since Uncle Samir was taken by Saddam's police."

    "So you're saying Hussein moved to Basra in 2001? That's ridiculous! He was still in school. I have his diploma right here. June 18th, 2002." (To interviewer, fondly) "His math teacher said he was the best student she ever had."

    "I'm not saying he moved in 2001, I'm saying he didn't go a month after Aunt Maryam died."

    "You don't have to shout, Karim." (To interviewer) "We're all so tense with the new curfew rule. Karim hasn't been to the cafe in four days. Anyway, it couldn't have been 2003, because he drove down with Cousin Mohammed, and Cousin Mohammed fled to Amman before the bombing started. His mother should die of shame, he's such a coward."

    "Don't start that again. He's been very nice about sending money."

    "He's been very nice about sending money because he's in Amman sitting in cafes instead of home taking care of his mother where he ought to be."

    Fifteen minutes my Aunt Fanny.

  • Given the description of the data collection methods--places where the researchers can walk to at least 40 houses in one day--the clusters selected are almost certainly fairly urban. Since that's where the violence is concentrated, that would bias the sample; something like 1/4-1/3 of Iraqis live in rural areas
  • A commenter at Marginal Revolution points out that if the violence really was as bad as the Lancet studies say, there should be massive refugee flows outward, but the UNHCR numbers do not show this.
  • At Blogcritics, David Nalle makes what seems to me like a good point, though he's rather more pugnacious than I think is wonted:

    With a population of 27 million people in Iraq and 50 data clusters, they allocate one cluster per 540,000 people. That's fine. The problem is that they then distribute the data clusters based on the governorates, which are political divisions and do not have evenly distributed populations. This means that when a governorate's population is not evenly divisible by 540,000 they round up or down to determine how many clusters to locate there.

    This creates an instant problem because the areas where they round down the number of clusters will be underrepresented and the areas where they round up will be overrepresented in the final numbers. As it works out, the most overrepresented governorates are two of the most violent and most populous, Diyala (+28%) and Anbar (+36%) and the most underrepresented are some of the most peaceful, Wassit (-45%), Qadissiya (-41%) and Tameem (-37%). The pattern is similar but less dramatic with the other governorates, plus two entire governorates in the more peaceful regions failed to return results at all. Only about 3 of the total of 20 regions are at all accurately represented. This leads to a cumulative effect of more violent areas likely being overrepresented by a rough figure of at least 20%.

    The math is off, but the point is valid: very violent areas were probably oversampled. Since their deaths (calculated as deaths per thousand people per year) were 5 or more times higher than those in the low-violence areas, and probably around twice that in the medium-violence areas, that would tend to skew things.

  • Posted by Jane Galt at October 17, 2006 10:14 AM | TrackBack | Technorati inbound links"); ?>
Comments
But a commenter in John Quiggins' post asks "how could anyone tell if they just made it up?" My sense is that it's actually pretty easy to tell things like that; researchers get sloppy and tend to repeat patterns that you can pick up.

Wouldn't you need access to the data to check this?

Posted by: err on October 17, 2006 8:00 PM

How can you take blog critic David Nalle seriously?

He's an ex history professor with no claimed statistical knowledge. How can we take any of his simple math seriously? He purports to reduce the Hopkins estimate by some ballpark 30% and then 20%, presumably because simple proportions is his level of math competence.

He claims that "low violence" Governates were underrepresented, but he doesn't attempt to quantify this in any meaningful way. They may have located clusters by Governate, but they weighted the cluster results by the population in that governate. There was likely very little underpresentation of low violence governates using that method, and certainly not on the level of 30%.

Further Nalle forgets to mention that the study deleted the clusters from two governates and assumed no excess deaths in those two This would tend to understate the increase in death rates. So he's basically cherry picking supposed problems.

As to death certificates in some extended family. The unit of analysis was people living in the household, not some long lost cousin. You would know who in your house had died in the last 4 years. More speculation by people who haven't even read the study?

As to the high response rate. Both the Iraq UN living conditions survey and other third world surveys show response rates of 98-99% as noted in the comments at Crooked Timber.


Posted by: chew2 on October 18, 2006 12:25 AM

The UNLC did return visits, which considerably ups your response rate. The Lancet study did not.

I was talking about people in the household. If I called my mother right now, living in my grandparents house, could she lay her hands on my grandfather's death certificate within minutes? Maybe it's in the attic. Or the desk. Or did Aunt Annie take it? Did we ever get it back from the funeral home? I don't remember where I put it, I was so upset. And Grandma's getting forgetful . . . as I say, for legal or cultural or relative poverty reasons, Iraqis might have all their death certificates right where they can lay their hands on them at all times (remember, the interviewers didn't come back, and appear to have averaged somewhere between 15-30 minutes per home, not allowing for lunch breaks, bathroom trips, or three hours spent arguing with the militiamen at the roadblock.) But in America, I would find such a high percentage of death certificates--92% of those who were asked for thme--pretty unlikely.

Posted by: Jane Galt on October 18, 2006 12:33 AM

Megan,

I don't believe the UNLC did return visits to those who refused. They did return visits to reconfirm info and to redo the infant mortality estimates because they appeared too low.

As to households, I take your point. But the ease of retrieving an important document may be a function of how much stuff you have. I will posit that Iraqi households have much less stuff than we do. If you only have one box in the house where you keep things then it has to be there.

You suggest: "I find it much easier to imagine a survey team deciding that discretion was the better part of valor and faking entries rather than risk their lives."

Can you imagine some dedicated doctors wanting to find out, in their war torn land, how many of their countrymen are dying, and working diligently and risking their lives to find out?

Maybe not. But then we come from a country that conciously says it does not do body counts whether of combatants or non-combants. A country that actively prevented, at various times, Iraqi medical authorities from releasing mortality figures to the press and other investigators.

And if you imagine that these Iraqi doctors are cutting corners, why not all the poorly paid Iraqi bureaucrats, police, and other functionaries who are supposedly the source of the offically reported deaths and those reported in the media.

Posted by: chew2 on October 18, 2006 1:41 AM

If I were an Iraqi hired to do this survey, I would say to myself, "Forget randomness, I'm not going to get myself killed wandering into a strange neighborhood. I'll get on the phone to the head men in various neighborhoods until I find some who want me to come visit. And they'll put the word out so the people who want to tell their stories will come see me, bringing their death certificates, which will speed things up so I can be home before dark, when the killing starts. Of course, I won't tell the American professors how I'm violating their rules of randomness. But they aren't going to check up on me because for an American to leave the Green Zone is to die."

I have no proof that that's what they did. That's just what I would have done if I were there.

I suspect the net effect would be to overstate the number of deaths, but it could go the other way.

Posted by: Steve Sailer on October 18, 2006 2:44 AM

Steve,

Your little slanderous fantasy is nothing more than that. It is also an idiotic plan, much more likely to get your lying ass killed (you just alerted the local death squad that you were in town doing work for Westerners? good job!) than if you had just followed the "get in, get out, don't go back" plan of the actual methods.

At least in this version of your oft repeated post, you reveal that you are indeed a lying scumbag (since this fantasy of betrayal is what you yourself would do).

Are you aware that the leader for the survey team was one of the authors of the paper? Presumably he was aware both that the methods mattered and that they were being followed.

But hey, slander away!

Posted by: Charles S on October 18, 2006 6:49 AM

Charles: About "the 'get in, get out, don't go back' plan of the actual methods", you forgot the part where they were supposed to interview 40 households. That's going to give the "local death squad" time to learn about your presence, drink some coffee, collect their friends, kill, roast, and eat a sheep, and then go down and kill you, if they're interested in doing that.

Which means that the second-safest plan is to pre-arrange your visit with someone who might be able to talk the insurgents into leaving you alone - or welcoming you, with 40 "heads of households" lined up and ready to slander the Americans. The safest plan, of course, is to not go at all and say you did.

Posted by: markm on October 18, 2006 9:16 AM

Here's an examination of the Hopkins study by a critical stats reviewer, who deals with some of the criticisms raised above, in particular the claim that urban areas were oversampled.

The Science of Counting the Dead
October 17, 2006
Rebecca Goldin Ph.D.
A recent study published in the Lancet claims that over 650,000 “excess” deaths have occurred in Iraq since the invasion in March, 2003. STATS look at how scientists figure these numbers out, how their methods compare to other counts, and whether criticism of the numbers is justified.

http://www.stats.org/stories/the_science_ct_dead_oct17_06.htm

This appears to be a great site for stats analysis of issues in the news.

Posted by: chew2 on October 18, 2006 10:44 AM

Jane ,

It seems very high response rates are quite common in Iraq, some even 100%.

http://www.ukpollingreport.co.uk/blog/archives/884

So, what could have gone wrong? The more excitable fringes of the US blogosphere have come out with some interesting stuff. Let’s look at criticisms that don’t hold water first. Firstly, the turnout is unbelievably high. The report suggests that over 98% of people contacted agreed to be interviewed. For anyone involved in market research in this country the figure just sounds stupid. Phone polls here tend to get a response rate of something like 1 in 6. However, the truth is that - incredibly - response rates this high are the norm in Iraq. Earlier this year Johnny Heald of ORB gave a paper at the ESOMAR conference about his company’s experience of polling in Iraq - they’ve done over 150 polls since the invasion, and get response rates in the region of 95%. In November 2003 they did a poll that got a response rate of 100%. That isn’t rounding up. They contacted 1067 people, and 1067 agreed to be interviewed.


More here:
http://crookedtimber.org/2006/10/18/floating-the-fraud-balloon/#more-5250

Posted by: GT on October 18, 2006 10:51 AM

Megan,

Regarding the argument that a greater proportion of the excess deaths in Hopkins I were non-violent than in Hopkins II.

The sample size in Hopkins I was too small to permit any statistically significant claim about the breakdown of excess deaths between non-violent and violent. While there was an increase in non-violent deaths in Hopkins I, the study could not show any statistically significant increase in the non-violent death rate in the post war period. Similarly, in Hopkins II while there was a smaller increase in non-violent deaths, the study could not show an increase in the non-violent death rate. There is no statistical contradiction between Hopkins I and Hopkins II regarding non-violent deaths.


Claims that, in Hopkins I, 40,000 of the excess deaths were non-violent and 60,000 were violent, were probably simple extrapolations from the death data, but were not statistically significant statements. All you could say in Hopkins I was that there was a statistically significant increase in the *overall* death rate, i.e. that this couldn't happen by chance. In contrast you couldn't tell whether the increase in non-violent deaths or even violent deaths wasn't just by chance because the sample sizes were too small.

So trying to compare the number of non-violent deaths in Hopkins I to those in Hopkins II doesn't tell you much, since the estimate of non-violent deaths in Hopkins I is too imprecise due to the small sample size.

What you can do is compare the estimated overall death rate in Hopkins I to the overall death rate in Hopkins II for the same time period, which is what the researchers did and found that the estimates were in the same ball park.

Posted by: chew2 on October 18, 2006 11:37 AM

Jane, you say David Nalle's math is off but that his point is still valid. But does it matter? Are you saying that, um, de-oversampling these areas would significantly reduce the lower bound of the confidence interval? Are we talking about, say, 200,000 to 600,000 excess deaths? If so, Nalle's point seems less valid than plain old beside the point.

Posted by: Dave on October 18, 2006 2:01 PM

"As one blogger asked:

Don’t heads of households and their spouses in urban Iraq have jobs? "

Apparently not.

"Local officials and NGOs put the unemployment rate countrywide to be more than 60 percent. In particularly troubled areas such as Anbar Governorate, this rate could be much higher."

Posted by: chew2 on October 18, 2006 2:44 PM

Chew2... nice self refutation!

40% employment... that's still lots of people not home. Lots and Lots of people.

But the CI is all we need. Since it's so huge, especially as compared to the lower bound, the survey is useless, no matter the merits or demerits of its methodology. Get me a prceise survey and we'll talk. But you can't, so you gave us this POS so you had a splashy number to talk about.

With that, I'll say that based on internet sampling of commenters, 7 out of 10 democrats have IQs of 20, with a +- of 60%.

Posted by: Hey on October 19, 2006 12:40 AM

I have a more basic question about Lancet's conclusions.

From Page 3:
Researchers interviewed 12,801 people in 1,849 households and recorded 629 deaths in the past 54 months. Of these, 82 (13%) occurred prior to the invasion, and 547 (87%) occurred after the invasion. Of the post-invasion deaths, the raw data says that 247 (46%) were non-violent (e.g., heart disease, cancer) and 300 (55%) were violent (e.g., gunfire, explosive device, car bomb).

Now, if the 629 deaths, extrapolated across all of Iraq, comes to an estimated 654,965, then the number of post invasion, violent deaths should be 654,965 x 87% x 55%, which is 313,400 (as opposed to the 601,027 that is mentioned in the oft-quoted report summary). Either I'm missing something big here, or something is drastically wrong.

Thoughts?

Posted by: Brian Greenberg on October 19, 2006 3:40 AM

Yep, you're missing something. The 655,000 are "excess deaths", meaning that there are apparently 655,000 people more dead than if mortality had stayed at pre-war levels. The 547 extrapolate to something like 1,100,000 (26,000,000/12,800 * 547) death, while under normal circumstances only 400,000 or so were expected.

Posted by: Marius on October 19, 2006 5:58 PM

Hey
"40% employment... that's still lots of people not home. Lots and Lots of people."

Arabic culture, the vast majority of wives don't work and stay home most of the day.

Brian

I answered your question on another thread. The 247 non-violent deaths in the sample were not excess deaths. They were about what you would have expected, based on the pre-invasion death rate.

Posted by: chew2 on October 20, 2006 1:30 AM
Post a comment