Skip Navigation


IJE Advance Access originally published online on December 3, 2008
International Journal of Epidemiology 2009 38(1):3-6; doi:10.1093/ije/dyn253
This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
38/1/3    most recent
dyn253v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rutter, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rutter, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published by Oxford University Press on behalf of the International Epidemiological Association © The Author 2008; all rights reserved.

Editorial

Epidemiological methods to tackle causal questions

Michael Rutter

MRC SGDP Centre, PO 80, Institute of Psychiatry, King's College London, De Crespigny Park, Denmark Hill, London, SE5 8AF, UK. E-mail: Camilla.azis{at}iop.kcl.ac.uk

Keywords Confounders, Growth Curves, Propensity Scores, Direct Acyclic Graphs, Natural Experiments, Experimental Medicine

Accepted 4 September 2008

It is a dull day when there is not at least one media report of a claim that research has identified some new environmental cause of disease. Such claimed causes concern a wide range of supposed hazards including medical interventions such as the measles-mumps-rubella (MMR) vaccine, the thimerasol (mercury) preservative in other vaccines, dietary factors of many different kinds (coffee, alcohol, food additives, etc.), prenatal stresses or the intra-uterine exposure to the effects of maternal smoking or ingestion of alcohol, use of mobile phones, and living near radiation sources—to mention just a few examples. It is problematic, however, that many of these claims are not confirmed by other research and some are even reversed. Unsurprisingly both professionals and the lay public have developed a scepticism about claims on environmental causes of disease. Because much of the evidence derives from epidemiological studies of one kind or another, epidemiological science itself has come under scrutiny both within and outside the profession. Some have argued that only laboratory experiments and randomized controlled trials (RCT) can provide acceptable evidence on causation. That cannot be a solution, however, because so many of the putative environmental causes are ones that cannot be manipulated in humans—for a mixture of both ethical and practical reasons.

Because the identification of environmental causes is of such importance, and because the solution of the causal inference problem has no obvious single, simple solution, the Academy of Medical Sciences set up a working party (made up of clinical scientists, epidemiologists, statisticians, policy leaders and scientific media specialists) to answer the key questions of ‘how should we decide what to believe and when to take action?’ The report1 emphasized the need to focus on individual components in the causal process and not on some misleading abstract notion of a single basic cause. It also notes the many reasons why an observed association or correlation might not reflect an environmentally mediated causal effect. Such reasons include genetic mediation of the causal risk effect, social selection (allocation bias), and reverse causation to mention but three out of a much longer list of possibilities. It was also noted that the main problem in moving from an observed association to a causal inference did not lie in the effect of known, measured confounders, but rather in the effect of unknown, unmeasured confounders.

Statistical innovations

In recent years, there have been important statistical developments that go beyond the traditional multivariate regression techniques that seek to ‘control’ for confounders. Four approaches may be used as exemplars of the main strategies that are available. First, growth curve trajectories have been used to classify individuals according to their propensity or liability for some outcome of interest. Their value for the testing of postulated causal inferences is that they can be used to determine whether the occurrence of some particular environmental circumstance significantly alters the identified trajectory. Thus, Nagin et al.2 described such an application to test whether an individual's participation in a delinquent peer group increased the likelihood of antisocial behaviour. As they pointed out, one advantage of this technique is that it also identifies whether any causal effect mainly applies to those with a high propensity for antisocial behaviour, or to those with a low propensity. The possible ‘confound’ being dealt with here concerns the overall liability to the disease/disorder outcome.

A rather different approach is provided by propensity scores.3,4 These aim to deal with social selection or allocation biases. The scores reflect the conditional probability of being exposed to the postulated causal agent, given the relevant background factors. One important value of propensity scores is that they identify levels of exposure that are so different in the groups to be compared that the key cells with minimal overlap should be omitted.5 A further use, however, is provided by the statistical techniques of inverse probability of treatment weighting (IPTW) devised by Robins.6 Expressed simply, the technique in effect creates two groups matched on their likelihood of risk exposure. Sampson et al.7 for example, used the approach to good effect to test whether marriage was associated with a within-individual reduction in antisocial behaviour.

The third approach is exemplified by direct acyclic graphs—so-called DAGs.8 Robins9 has argued that the operation of confounders should always be considered in terms of background knowledge of how they were likely to operate. The graphs are designed to make the probable pathways explicit, indicating the assumptions involved and how they may be tested. These then lead on to the appropriate statistical modelling to rule in, or rule out, the various postulated and tested alternatives. As always, much necessarily depends on the reality of the models and the quality of the background information available.

The fourth approach involves the concept of mediation.10,11 The objective here is different in that it is not concerned with identifying an environmental cause as such, but rather with testing whether an observed association between a risk factor and disorder outcome is mediated by a postulated mechanism involving some third variable. The rationale is that if the association between the independent variable ‘A’ and the outcome ‘C’ is mediated by the postulated variable ‘B’, the association between ‘A’ and ‘C’ should drop to zero when ‘B’ is included in the model.12,13

Natural experiments

Advances in design features to test causal inferences are illustrated by the use of natural experiments14—meaning design elements that provide an approximation to experimental conditions—to provide leverage on the causal inference.15 Essentially, the designs ‘pull apart’ variables that ordinarily go together and, by so doing, enable key direct contrasts between alternative hypotheses without the necessity for statistical manipulations that have to rely on some form of modelling to determine what the hypothetical findings would have been if the pattern of variables (and the associations) had been different.16

Natural experiments can be considered in terms of five broad groups that differ in terms of what they seek to achieve. First, there are genetically sensitive designs that focus particularly on the need to differentiate between genetic and environmental mediation of risk effects—a need that arises as a result of gene-environment correlations. Multivariate twin designs do this by including both across-twin and across-trait analyses.17 Discordant twin pairs do so by contrasting disease/disorder outcomes, within pairs. Those who did and did not experience the relevant postnatal causal factors.18 Assisted conception allows the comparison of outcomes following the risk experience (such as prenatal exposure to the effects of maternal smoking) according to whether or not the mother and child are genetically related.19 With egg donation or surrogacy, for example, they are not. Children of Twins (COT) designs do so by capitalizing on the fact that the offspring of monozygotic twins are social cousins but genetic half-siblings.20,21 Adoption/fostering designs separate genetic and environmental mediation by contrasting rearing by biological and by nonbiological parents.22 Migration strategies provide yet another relevant contrast by utilizing the fact that when people migrate they bring their genes with them but may radically alter their lifestyle or be exposed to very different environmental circumstances.23

A second group of genetic and related strategies uses sampling, not to control for genetic mediation, but rather to avoid a biased or confounded association between the putative environmental cause and the disease outcome. Thus, Mendelian Randomization (MR) uses molecular genetic polymorphisms as an instrumental variable that has a strong influence on the independent risk variable being studied but no direct association with the disease outcome other than through the environmental pathway leading from the independent risk variable.24 It was first introduced to deal with the possibility of reverse causation. Early onset of puberty follows the same instrumental variable rationale, albeit one not due to an identified gene. Adoption designs may be informative when adoption involves a move from a severely depriving environment to a good rearing environment after adoption. Within-individual change related to the rapid and radical change in rearing circumstances constitute the natural experiment provided that certain other conditions are met.25

The third group of natural experiments have the main aim of avoiding selection bias by focusing on the very special situation when, unusually, the environmental hazard is experienced by the whole population. The Dutch famine in World War II provides one such example.26 The removal of the MMR vaccine in Japan but not the rest of the world provides a different example.27

The final group consists of regression discontinuity design (RD).28,29 These capitalize on a major selection bias, provided that it is under strict control. The key defining feature is that allocation for some planned intervention is according to a strict predetermined cut-off, rather than by randomization. The intervention effect is shown by a difference between slopes rather than between means (as in an RCT). Provided that certain key criteria are met, the RD design provides an unbiased estimate of a causal effect.

Contrasting the case studies with relatively strong causal claims with those giving rise to probably misleading causal claims showed that the successes shared several common features.1 First, they either concerned a very large effect (as with smoking and lung cancer) or they applied to rare and unusual outcomes with distinctive features (as with the foetal alcohol syndrome). Second, detailed careful attention was paid to alternative noncausal explanations of how to test for their possible role. Third, all made use of multiple research designs, including actual experiments. Fourth, the causal inference was tested in multiple populations that differed in their characteristics. Fifth, animal models and human experimental studies contributed invaluable findings on biological processes, which complemented those of the rigourous high quality epidemiological studies.

I conclude that it is possible to use epidemiological methods to tackle causal questions but only if key criteria are met. The findings also indicate the falsity of the dichotomy between basic and applied research, with the need for an iterative interplay between the two and always for a seeking of ways using multiple research strategies in a hypothesis-testing style that typifies experimental medicine.30

Conflict of interest: None declared.

References

1 Academy of Medical Sciences. Identifying the environmental causes of disease: How should we decide what to believe and when to take action? (2007) London: Academy of Medical Sciences.

2 Nagin DS, Barker T, Lacourse E, Tremblay RE. The Interrelationship of temporally distinct risk markers and the transition from childhood physical aggression to adolescent violent delinquency. In: Applied Data Techniques for Turning Points Research (2008) New York: Routledge. 17–36. Ch. 2.

3 D’Agostino RB. Propensity score methods for bias reduction in the comparison of a treatment to a nonrandomized control group. Stats Med (1998) 17:2265–81.[CrossRef][Web of Science][Medline]

4 Mortimer KM, Neugebauer R, van der Laan M, Tager IB. An application of model-fitting procedures for marginal structural models. Am J Epidemiol (2005) 162:382–88.[Abstract/Free Full Text]

5 Melhuish E, Belsky J, Leyland AH, Barnes J, the Nat Eval of Sure Start Research team. Effects of fully-established Sure start Local Programmes on 3-year-old children and their families living in England: a quasi-experimental observational study. The Lancet (2008) 8:1641–47.

6 Robins JM, Hernán MA, Brumback B. Marginal structural models and causal inference in epidemiology. Int J Epidemiol (2000) 11:550–60.[CrossRef]

7 Sampson RJ, Laub JH, Wimer C. Does marriage reduce crime? A counterfactual approach to within-individual causal effects. Criminology (2006) 44:465–508.[CrossRef][Web of Science]

8 Greenland S, Pearl J, Robins JM. Causal diagrams for epidemiologic research. Int J Epidemiol (1999) 10:37–48.

9 Robins J. A graphical approach to the identification and estimation of causal parameters in mortality studies with sustained exposure periods. J Chronic Dis (1987) 40(Suppl 2):139S–161S.[Medline]

10 Baron RM, Kenny DA. The moderator-mediator variable distinction in social psychological research: Conceptual, strategic, and statistical considerations. J Pers Soc Psy (1986) 51:1173–82.[CrossRef]

11 Kraemer HC, Stice E, Kazdin A, Offord D, Kupfer D. How do risk factors work together? Mediators, moderators, and independent, overlapping, and proxy risk factors. Am J Psychiat (2001) 158:848–56.[Abstract/Free Full Text]

12 Sonuga-Barke EJS, Beckett C, Kreppner J, et al. Is sub-nutrition necessary and/or sufficient for later life problems in children who have experienced severe early institutional deprivation? Dev Med Child Neurol (2008) 50:664–71.[Web of Science][Medline]

13 Thorpe K, Rutter M, Greenwood R. Twins as a natural experiment to study the causes of mild language delay: II: family interaction risk factors. J Child Psych Psychiatry (2003) 44:342–55.[CrossRef]

14 Shadish WR, Cook TD, Campbell DT. Experimental and Quasi-experimental Designs for Generalized Causal Inference (2002) Boston, MA/New York: Houghton Mifflin Company.

15 Rutter M. Proceeding from observed correlation to causal inference: the use of natural experiments. Persp Psych Sci (2007) 2:377–95.[CrossRef]

16 Rutter M, Pickles A, Murray R, Eaves L. Testing hypotheses on specific environmental causal effects on behavior. Psychol Bull (2001) 127:291–324.[CrossRef][Web of Science][Medline]

17 Jaffee SR, Caspi A, Moffitt TE, Polo-Thomas M, Price TS, Taylor A. The limits of child effects: Evidence for genetically mediated child effects on corporal punishment but not on physical maltreatment. Dev Psychol (2004) 40:1047–58.[CrossRef][Web of Science][Medline]

18 Caspi A, Moffitt TE, Morgan J, et al. Maternal expressed emotion predicts children's antisocial behavior problems: using MZ-twin differences to identify environmental effects on behavioral development. Dev Psychol (2007) 40:149–61.

19 Thapar A, Harold G, Rice F, et al. Do intrauterine or genetic influences explain the foetal origins of chronic disease? A novel experimental method for disentangling effects. BMC Med Res Methodol (2007) 7:25.[CrossRef][Medline]

20 D’Onofrio B, Turkheimer E, Eaves L, et al. The role of the children of twins design in elucidating causal relations between parent characteristics and child outcomes. J Child Psychol Psychiatry (2003) 44:1130–44.[CrossRef][Web of Science][Medline]

21 Silberg JL, Meyer J, Eaves LJ. Analysing the contributions of genes and parent–child interaction to childhood behavioural and emotional problems: a model for the children of twins. Psychol Med (2004) 34:347–56.[CrossRef][Web of Science][Medline]

22 Duyme M, Dumaret A-C, Tomkiewicz S. How we can boost IQs of ‘dull children’? A late adoption study. Proc Nat Acad Sci (1999) 96:8790–94.[Abstract/Free Full Text]

23 Marmot MG, Syme SL. Acculturation and coronary heart disease in Japanese-Americans. Am J Epidemiol (1976) 104:225–47.[Abstract/Free Full Text]

24 Davey Smith G, Ebrahim S. Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol (2003) 32:1–22.[Abstract/Free Full Text]

25 Rutter M, Beckett C, Castle J, et al. Effects of profound early institutional deprivation: an overview of findings from a UK longitudinal study of Romanian adoptees. Eur J Dev Psychol (2007) 4:332–50.[CrossRef]

26 Stein ZA, Susser M, Saenger G, Marolla F. Famine and human development: The Dutch hunger winter of 1944–1945 (1975) New York: Oxford University Press.

27 Honda H, Shimizu Y, Rutter M. No effect of MMR withdrawal on the incidence of autism: a total population study. J Child Psychol Psychiatry (2005) 46:572–79.[CrossRef][Web of Science][Medline]

28 Thistlewaite DL, Campbell DT. Regression-discontinuity analysis: An alternative to the ex-post facto experiment. J Ed Psychol (1960) 51:309–17.[CrossRef]

29 Cahan S, Cohen N. Age versus schooling effects on intelligence development. Child Dev (1989) 60:1239–49.[CrossRef][Web of Science][Medline]

30 Rutter M, Plomin R. Pathways from science findings to health benefits. Psychol Med (2008) 38:1–14.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Int J EpidemiolHome page
M.-J. Brion
Commentary: Assessing the impact of breastfeeding on child health: where conventional methods alone fall short for reliably establishing causal inference
Int. J. Epidemiol., February 1, 2010; 39(1): 306 - 307.
[Full Text] [PDF]


Home page
Int J EpidemiolHome page
B. Maughan
Unravelling prenatal influences: the case of smoking in pregnancy
Int. J. Epidemiol., June 1, 2009; 38(3): 619 - 621.
[Full Text] [PDF]


Home page
Int J EpidemiolHome page
G. D. Smith
Alcohol and health: bad for you, good for you, or somewhere in between?
Int. J. Epidemiol., February 1, 2009; 38(1): 1 - 2.
[Full Text] [PDF]


This Article
Right arrow Extract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
38/1/3    most recent
dyn253v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Rutter, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Rutter, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?