Skip Navigation

This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Blakely, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blakely, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

International Journal of Epidemiology 2002;31:166-167
© International Epidemiological Association 2002


Theory and Methods

Commentary: Estimating direct and indirect effects—fallible in theory, but in the real world?

Tony Blakely

Department of Public Health, Wellington School of Medicine, University of Otago, PO Box 7343, Wellington, New Zealand. E-mail: tblakely{at}wnmeds.ac.nz

A classic finding of the Whitehall Study was that only a third of the association of occupational grade (a socioeconomic ranking of occupations within the British civil service) with coronary heart disease mortality was ‘explained’ after adjusting for known cardiovascular risk factors.1 In current terminology, a third of the association of socioeconomic position with coronary heart disease mortality was estimated as ‘indirect’ via known risk factors, and two-thirds (that unexplained) was estimated as ‘direct’. This direct effect, it is assumed (e.g. ref. 2), represents the mediating or indirect effects of other factors (e.g. unmeasured and/or unknown dietary and lifestyle behaviours, psychosocial factors). For example, subsequent research on the Whitehall Study has suggested that workplace characteristics of control and demand explained much of the remaining two-thirds3—although this analysis has been criticized for conflating measures of socioeconomic position.4,5 The point here, though, is that this original finding from the Whitehall Study is just one example of a widespread (almost universal) practice within epidemiology that aims to describe and quantify causal pathways by controlling for possible mediating variables using standard epidemiological methods.

Enter Cole and Hernán6 who, in this edition of the International Journal of Epidemiology, build on prior work of Robins and Greenland (1992)7 and Poole and Kaufman (2000)8 to demonstrate that such ‘standard’ epidemiological practice may be misleading. In brief, using counterfactual models and causal graphs they demonstrate that if an unknown variable (e.g. genotype) confounds the association of the mediating variable with the outcome, then stratifying the exposure-disease association by the mediating variable may not accurately partition the total effect into its direct and indirect components. Previous methodological work demonstrating this fallibility has used completely hypothetical examples. The Cole and Hernán example starts with actual data on randomized aspirin (exposure) and subsequent myocardial infarction (MI; outcome). However, the distribution of the potential mediating variable (platelet aggregation) and the genotype confounder remain constructed. Their data distribution is consistent with a causal and protective effect of aspirin on MI (relative risk of 0.6) being entirely due to platelet aggregation, yet when they stratify the aspirin-MI association by platelet aggregation the relative risk is unchanged. According to standard epidemiological expectation the stratified relative risk should have been 1.0.

What happened? To help understand Cole and Hernán's example, I have rearranged the data to determine RRUM|E—the relative risk of the confounder -> high platelet aggregation association, stratified by the randomized aspirin exposure (Table 1Go). It is striking that 95% of those people ‘exposed’ to the confounder U within the unexposed (E = 0; no aspirin on a randomized intervention) had high platelet aggregation compared to 50% among the exposed (E = 1). The difference in the percentages among those not ‘exposed’ to the confounder U is also striking: 50% compared to 5%. Thus, the confounder U is strongly associated with the intermediary variable M.


View this table:
[in this window]
[in a new window]
 
Table 1 RRUM|E (confounder -> high platelet aggregation, given randomization by aspirin exposure) based on Table 3 of Cole and Hernán6
 
There is also a strong association of the confounder U with D (Table 2Go). The crude RRUD is 0.23, and stratified by the exposure is either 0.20 or 0.28. But what is more, the presence of the confounder (U = 1) is now protective against disease, despite also being a cause of high platelet aggregation (M = 1) which is positively associated with disease.


View this table:
[in this window]
[in a new window]
 
Table 2 RRUD|E (confounder -> disease, given randomization by aspirin exposure) based on Table 3 of Cole and Hernán6
 
To summarize, the Cole and Hernán example is equivalent to saying that:

  • aspirin use (E = 1) is strongly protective against MI (D = 1)
  • in truth, the association with aspirin use is purely mediated by low platelet aggregation (M = 1)
  • but as observed, RRED|M was no different from the crude RRED because:
    • actually, some unmeasured confounder was both:
      • strongly associated with high platelet aggregation (which in turn is positively associated with an increased risk of MI)
      • yet, in crude analyses this unmeasured confounding factor was strongly and negatively associated a decreased risk of MI.


This type of situation described by Cole and Hernán is extreme—purposely so for illustrative purposes—but nevertheless it is possible. For example, smoking is associated with MI, but smoking may also result in a decreased body mass index that (of itself) is associated with a decreased risk of MI. The key issues are in what direction and with what magnitude does this bias tend to occur in the real world. Regarding direction, we would usually expect the crude association of an unmeasured confounder with an outcome to at least be in the same direction as that mediated by some pathway variable. For example, a ‘bad’ lifestyle (U) is associated with the constellation of high blood pressure, smoking, and being overweight. Consequently, controlling for just blood pressure (M) in the association between, say, socioeconomic status (E) and coronary heart disease (D) will probably overestimate the indirect effect via blood pressure alone as blood pressure is positively correlated with these other risk factors. (This overestimation of indirect effects is opposite to Cole and Hernán's example where the indirect effect was [completely] underestimated.)

At least as important as the source of error described by Cole and Hernán is measurement error of potential mediating variables that will usually cause indirect effects to be underestimated. For example, it is highly probably that if multiple (including across the life-course) and more accurate measurements of the known cardiovascular disease risk factors had been available in the Whitehall study,1 more than a third of the association of socioeconomic position with coronary heart disease would have been explained (e.g. ref. 9).

There is an urgent need for further methodological research that determines the likely magnitude and direction of bias in the estimation of direct and indirect effects, both by the use of real data sets and sensitivity analyses that determine if less extreme hypothetical examples than that presented by Cole and Hernán still produce noteworthy bias. (Ideally, this methodological research should also incorporate bias from measurement error.) In the meantime, it would be foolish to ignore the alarming findings of researchers such as Cole and Hernán that seriously question the validity of a common epidemiological practice, but neither should we abandon (yet) our standard methods of estimating direct and indirect effects. Accordingly, the three recommendations issued by Cole and Hernán are sensible: plan to collect information on potential confounders of the mediator-outcome association, include these potential confounders in the analysis, and clearly state the implicit assumptions of the standard method used to measure direct and indirect effects.

Acknowledgments

Tony Blakely is funded by the Health Research Council of New Zealand. Useful comments on drafts of this paper were received from Charlie Poole, Stephen Cole and Miguel Hernán.

References

1 Marmot MG, Shipley M, Rose G. Inequalities in death—specific explanations of a general pattern? Lancet 1984;ii:1003–06.

2 Victora C, Huttly S, Fuchs S, Olinto M. The role of conceptual frameworks in epidemiological analysis: a hierarchical approach. Int J Epidemiol 1997;26:224–27.[Abstract/Free Full Text]

3 Marmot MG, Bosma H, Hemingway H, Brunner E, Stansfeld S. Contribution of job control and other risk factors to social variations in coronary heart disease incidence. Lancet 1997;350:235–39.[CrossRef][Web of Science][Medline]

4 White M. Contribution of job control to social gradient in coronary disease [letter]. Lancet 1997;350:1404–05.[Web of Science][Medline]

5 Lynch J, Kaplan G. Socioeconomic position. In: Berkman L, Kawachi I (eds). Social Epidemiology. New York: Oxford University Press, 2000, pp.13–35.

6 Cole SR, Hernán MA. Fallibility in estimating direct effects. Int J Epidemiol 2002;31:163–65.[Abstract/Free Full Text]

7 Robins J, Greenland S. Identifiability and exchangeability for direct and indirect effects. Epidemiology 1992;3:143–55.[Web of Science][Medline]

8 Poole C, Kaufman J. What does standard adjustment for downstream mediators tell us about social effect pathways? [Conference poster, Society for Epidemiologic Research, June 15–17 2000, Seattle.] Am J Epidemiol 2000;151(11): Abstract 208.[Free Full Text]

9 Lynch J, Kaplan G, Cohen R, Tuomilehto J, Salonen J. Do cardiovascular risk factors explain the relation between socioeconomic status, risk of all-cause mortality, cardiovascular mortality, and acute myocardial infarction? Am J Epidemiol 1996;144:934–42.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Arch Intern MedHome page
A. H. Auchincloss, A. V. D. Roux, M. S. Mujahid, M. Shen, A. G. Bertoni, and M. R. Carnethon
Neighborhood Resources for Physical Activity and Healthy Foods and Incidence of Type 2 Diabetes Mellitus: The Multi-Ethnic Study of Atherosclerosis
Arch Intern Med, October 12, 2009; 169(18): 1698 - 1704.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
V. Skalicka, F. van Lenthe, C. Bambra, S. Krokstad, and J. Mackenbach
Material, psychosocial, behavioural and biomedical factors in the explanation of relative socio-economic inequalities in mortality: evidence from the HUNT study
Int. J. Epidemiol., October 1, 2009; 38(5): 1272 - 1284.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
M. P Fox, D. R Brooks, L. Kuhn, G. Aldrovandi, M. Sinkala, C. Kankasa, R. Horsburgh, and D. M Thea
Role of breastfeeding cessation in mediating the relationship between maternal HIV disease stage and increased child mortality among HIV-exposed uninfected children
Int. J. Epidemiol., April 1, 2009; 38(2): 569 - 576.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
M. Tobias, T. Blakely, D. Matheson, K. Rasanathan, and J. Atkinson
Changing trends in indigenous inequalities in mortality: lessons from New Zealand
Int. J. Epidemiol., March 30, 2009; (2009) dyp156v1.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
S. Wacholder, N. Chatterjee, and N. Caporaso
Intermediacy and Gene-Environment Interaction: The Example of CHRNA5-A3 Region, Smoking, Nicotine Dependence, and Lung Cancer
J Natl Cancer Inst, November 5, 2008; 100(21): 1488 - 1491.
[Full Text] [PDF]


Home page
J. Epidemiol. Community HealthHome page
F I. Gunasekara, K Carter, and T Blakely
Glossary for econometrics and epidemiology
J Epidemiol Community Health, October 1, 2008; 62(10): 858 - 861.
[Abstract] [Full Text] [PDF]


Home page
J. Epidemiol. Community HealthHome page
N L Fleischer and A V D. Roux
Using directed acyclic graphs to guide analyses of neighbourhood health effects: an introduction
J Epidemiol Community Health, September 1, 2008; 62(9): 842 - 846.
[Abstract] [Full Text] [PDF]


Home page
Am J EpidemiolHome page
A. H. Auchincloss, A. V. Diez Roux, D. G. Brown, E. S. O'Meara, and T. E. Raghunathan
Association of Insulin Resistance with Distance to Wealthy Areas: The Multi-Ethnic Study of Atherosclerosis
Am. J. Epidemiol., February 15, 2007; 165(4): 389 - 397.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
T. Blakely and N. Wilson
The contribution of smoking to inequalities in mortality by education varies over time and by sex: two national cohort studies, 1981-84 and 1996-99
Int. J. Epidemiol., October 1, 2005; 34(5): 1054 - 1062.
[Abstract] [Full Text] [PDF]


Home page
Int J EpidemiolHome page
T. Blakely, I. Kawachi, J. Atkinson, and J. Fawcett
Income and mortality: the shape of the association and confounding New Zealand Census-Mortality Study, 1981-1999
Int. J. Epidemiol., August 1, 2004; 33(4): 874 - 883.
[Abstract] [Full Text] [PDF]


Home page
Epidemiol RevHome page
A. V. Diez Roux
The Study of Group-Level Factors in Epidemiology: Rethinking Variables, Study Designs, and Analytical Approaches
Epidemiol. Rev., July 1, 2004; 26(1): 104 - 111.
[Full Text] [PDF]


Home page
J. Epidemiol. Community HealthHome page
N Wilson, G Thomson, M Tobias, and T Blakely
How much downside? Quantifying the relative harm from tobacco taxation
J Epidemiol Community Health, June 1, 2004; 58(6): 451 - 454.
[Abstract] [Full Text] [PDF]


Home page
J. Epidemiol. Community HealthHome page
G Turrell, T Blakely, C Patterson, and B Oldenburg
A multilevel analysis of socioeconomic (small area) differences in household food purchasing behaviour
J Epidemiol Community Health, March 1, 2004; 58(3): 208 - 215.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Blakely, T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Blakely, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?