Skip Navigation


IJE Advance Access originally published online on December 3, 2007
International Journal of Epidemiology 2008 37(1):106-112; doi:10.1093/ije/dym240
This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
37/1/106    most recent
dym240v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Cliff, A. D
Right arrow Articles by Smallman-Raynor, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cliff, A. D
Right arrow Articles by Smallman-Raynor, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published by Oxford University Press on behalf of the International Epidemiological Association © The Author 2007; all rights reserved.

An exploratory method for estimating the changing speed of epidemic waves from historical data

Andrew D Cliff1, Peter Haggett2,* and Matthew Smallman-Raynor3

1Department of Geography, University of Cambridge, Cambridge CB2 3EN, UK.
2Institute for Advanced Studies, University of Bristol, BS8 1UR, Bristol, UK.
3School of Geography, University of Nottingham, Nottingham NG7 2RD, UK.

*Corresponding author. Institute for Advanced Studies, University of Bristol, Royal Fort House, Bristol BS8 1UR, UK. E-mail: p.haggett{at}bristol.ac.uk


    Abstract
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
Background: Historical data are necessary to establish long-term trends in disease incidence but pose analytical problems since their accuracy and reliability may be poorly specified.

Methods: A robust measure of the spatial velocity, R0A, of epidemic waves from space-time series is proposed using binary data. The method was applied to the historical records of influenza morbidity for the island of Iceland over a 61-year period of influenza seasons from 1915–16 to 1975–76.

Results: The onset of influenza waves tended to speed up over the period studied and the three pandemic waves associated with viral shifts in influenza A [Spanish influenza H1N1 (1918–19), Asian influenza H2N2 (1957–58) and Hong Kong influenza H3N2 (1968–69)] spread more rapidly around the island and struck earlier in the influenza season than did inter-pandemic waves, even when the latter were equally intensive as measured by total number of cases and case incidence.

Discussion: The potential for using R0A in a real-time context is explored using French influenza data.

Conclusions: The new measure of wave velocity appears to be applicable to those historical time series where breakdown into regional or local areas is available. The study is being extended to (i) other countries where similar influenza time series are available and (ii) to other diseases within Iceland.


Keywords Time series, exploratory data analysis, epidemic velocity, influenza, pandemics, Iceland

Accepted 24 October 2007


    Background
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
In epidemiology, long-term morbidity series are of interest where there is a need to establish historical trends in disease incidence. But the longer the series, the less likely is the sensitivity and specificity of disease reporting to have remained constant over time. Over the long term, morbidity recording is likely to be biased with the degree of bias often unknown and non-constant. Even over the short term, it is not clear whether during a period of high-disease incidence (e.g. the peak of an influenza pandemic), cases are (i) over reported because of assumptions of the prevailing cause or (ii) under reported because reporting physicians are overwhelmed by more pressing clinical demands on time. Historical evidence points in both directions.

Epidemiology is not alone in trying to establish trends from data of uncertain accuracy; economists, in particular, have been plagued by trying to establish economic trends from historically variable sources.1 Among the many methods proposed to overcome this problem, the work of John Tukey has been particularly important in establishing robust methods [what he termed exploratory data analysis (EDA)] to work alongside classical statistical models.2

In this article we propose, in the Tukey tradition, a robust method for exploring the speed of epidemic waves from long-term morbidity data. The spatial velocity of epidemic waves across human populations has attracted considerable theoretical interest3–5 and is a matter of practical public-health concern.6,7 The faster a wave of infection strikes a susceptible population, the less is the time available for implementing protective responses (e.g. isolation and vaccination). Velocity is thus particularly critical in the case of new diseases (e.g. SARS) or old diseases with changing characteristics (e.g. pandemic outbreaks of influenza where genetic shifts in the causative A virus mean that an infection is introduced into a population with little or no resistance to the new strain8,9). Here, we explore the second case.


    Methods
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
Sources of influenza morbidity data
Unusually among western countries, Iceland has since 1895 required direct notification of influenza cases by physicians.10 This concern for data collection stems from the island's early history that was marked by disastrous externally introduced epidemics, including the 1843 influenza outbreak that, although lasting only 2 months, doubled the expected death rate for the year.11 Annual totals and other summary data have been published in that country's annual public health reports (Heilbrigðisskyrslur) since that date with, for influenza, national monthly time series available from 1913 (Figure 1a). Physicians’ commentaries on the local course of the disease are still kept in manuscript form in the National Archives in Reykjavík. Of exceptional interest is the 61-year period spanning the middle of the 20th century from 1915 (Figure 1b). For these years, monthly data are broken down to a local level for some 50 medical districts (Figure 1c). This allows mapping of ways in which the disease spread around the island (Figure 1d), along with the calculation of general estimates of its changing spatial and temporal velocity over the period.


Figure 1
View larger version (50K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1 Reported influenza morbidity in Iceland. (a) Monthly records for total reported influenza cases from January 1913 to December 1976. (b) Monthly record of the number of Icelandic medical districts reporting one or more cases of influenza, January 1915 to December 1976. (c) Map of boundaries of Icelandic medical districts for the year 1945. (d) Cumulative total of reported monthly influenza cases for each Icelandic medical district in the period January 1915 to December 1976. Circles are drawn proportional to the cumulative number of cases but note the special position of the capital city, Reykjavík. All influenza data are based on Heilbrigðisskyrslur (Public Health in Iceland)

 
The intricate spatial infrastructure of medical districts allows the local spread of the disease to be monitored across the island. This spatial network is, however, unstable over time as district boundaries have been modified to reflect changes in population and medical provision. Major changes to the 1907 medical district boundaries, current at the beginning of our study period, occurred in 1932, 1945 and 1955. We have therefore developed a standardized set of districts, based upon the 1945 configuration, which preserves spatial continuity over time (Figure 1c).

Methods for estimating velocity
The velocity of an epidemic wave through a population can be measured in different ways.12,13 Here we propose a robust method that uses (i) the binary (presence/absence) of a reported disease rather than the actual number of reported cases, (ii) the spatial extent of the outbreak in terms of infected districts and (iii) the time taken in months from the start of an outbreak for a disease to reach each district.14

We term the month in which an influenza case is first recorded in a given district the leading edge (LE) of the outbreak in that district, and the last month of record as the following edge (FE). Standard statistical analysis of the distribution of the two edges enables us to define a time-weighted arithmetic mean, Formula and Formula for each edge. In this study, the first month of the influenza season (September) is coded as t = 1. The subsequent months of the season are then coded serially as t = 2, t = 3, ... t = T, where T is the number of monthly periods from the beginning to the end of the season (i.e. T = 12 in our yearly cycle). For the LE, the equation is


Formula 1

(1)
where N is the total number of districts (50 in Iceland), nt is the number of districts whose LE occurred in month t and N = {sum} nt. The time-weighted mean is a useful measure of the velocity of the wave in terms of average time to district infection. A similar equation can be written for FE, and higher order moments can also be specified. To allow comparison between diseases with different wave characteristics, we convert these time-weighted means to a velocity ratio, V, (0 ≤ V ≤ 1),


Formula 2

(2)
where D is the duration of the wave (12 months in the case of our influenza seasons). A similar equation can be written for VFE.

The basic reproduction number (or rate or ratio), R0, is one of the most useful parameters used to characterize mathematically infectious disease processes. R0 is defined as the ratio between an infection rate (β) and a recovery rate ({gamma}):


Formula 3

(3)

In terms of cases, R0 is the average number of secondary infections produced when one infected individual is introduced into a host virgin population. Methods for estimating the basic reproduction number for infectious diseases are given by Dietz,15 and an example of their use appears in the work by Watts.16

If we consider the two-phase shifts from susceptible to infective status (S -> I) and from infective to recovered status (I -> R) when an infectious disease arrives in an area, this raises the prospect of defining a spatial version of R0. In spatial terms, A, the spatial reproduction number, R0A, is the average number of secondary districts produced from one infected district in a virgin area. In a given study area, the integral SA (the proportion of the study area at risk of infection) is given by:


Formula 4

(4)
while the proportion of the area that is infected (the infected areas integral) is


Formula 5

(5)

The recovered areas integral, RA, is


Formula 6

(6)

All three integrals are dimensionless numbers with values in the range (0, 1).

The integral SA has parallels to β in that a small value indicates a very rapid spread while the integral RA has parallels to {gamma} in that a small value indicates a very rapid recovery. Since both terms are inversely related to their power, we suggest that their complement might be substituted in estimating a spatial version of R0, namely


Formula 7

(7)

Such a spatial reproduction number would measure the propensity of an infected district to spawn other infected districts in later time periods. In effect it provides an indicator of the tendency of an infected district to produce secondaries. Values over unity imply a tendency to spread and calibrate the velocity of such spread (the larger the value, the greater the rate of spread).


    Results
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
Over the whole 61 years studied, Iceland's doctors reported 530 276 cases of influenza, half of them from Reykjavík and the immediate surrounding areas (Figure 1d). Although reported cases are likely to be under estimates, the broad shape of outbreaks in both space and time is readily discernible. The distribution throughout the year shows clear peaks in March–April with the low periods in August–September, a pattern typical of many northern latitude countries. For Iceland we have thus used, as our temporal unit, an influenza season running from September 1 to August 31 of the following year.

For Iceland as a whole, the time series of both edges is shown in Figure 2a. The equations used to estimate their values are given in the ‘Methods’ section of the article [equation (1)]. Despite marked year to year variation, the average trend shown by the linear regression line for the LE is distinctly upward implying that waves have speeded up over time, i.e. influenza waves moved around the island faster at the end than at the beginning of the study period. In contrast the position of the FE when influenza incidence ceased in any influenza season has remained essentially unchanged. This implies that the duration of reported influenza incidence grew slowly longer, from around 2.5 months in 1915–16 to nearly 4.0 months in 1975–76 (Figure 2b).


Figure 2
View larger version (33K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2 Velocity of epidemic waves in Iceland for the influenza seasons 1915–16 to 1975–76. (a) Velocities of the LE (solid lines) and FE (pecked lines) are shown as velocity ratios, VLE and VFE, where V is in the range, 0 ≤ V ≤ 1. The larger the value of V, the faster is the edge. (b) The widening time gap between the two edges illustrated in (a) is confirmed in this graph that plots the average duration of each district's epidemic wave in months. In both (a) and (b) the trend lines were determined by OLS linear regression

 
Three of the 61 seasons studied were associated with pandemics of influenza A (the Spanish, Asian and Hong Kong pandemics). Figure 3 uses data on the spatial extent of influenza in each season to plot the position of the pandemic front with reference to the three seasons which immediately preceded or followed it. In the Spanish and Asian pandemics the front stands out clearly but in the third (Figure 3c) the Hong Kong front appears to spread over two seasons, supporting Viboud's17 concept that it was a ‘smoldering’ pandemic.


Figure 3
View larger version (67K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3 Pandemic seasons in the Icelandic morbidity records. (a) Profiles showing the cumulative number (scaled to unity) of medical districts reporting influenza cases by month as a measure of the spatial extent of influenza in Iceland for the three shift seasons of 1918–19, 1957–58 and 1968–69. In each case the profile for the shift season is compared with the profiles of the three preceding and three following non-shift seasons. (b) Value of the spatial version of the basic reproduction ratio, R0A, for Iceland for the influenza seasons 1915–16 to 1975–76. Peaks are associated with the three viral shift years for influenza A (1918–19, 1957–58 and 1968–69) and for three other non-shift seasons (1927–28, 1943–44 and 1972–73)

 
Although pandemic years had large numbers of influenza cases, they were not the largest recorded over the period. The 1937–38 season had the largest number of cases (21 977) and the highest monthly rate and, as Figure 1a shows, monthly case numbers in several inter-pandemic years exceeded those with pandemics. In Table 1 we divided the 61 seasons into three groups: (a) pandemic (3), (b) high-intensity inter-pandemic (24), with case rates greater than the lowest pandemic season and (c) low-intensity inter-pandemic (34), with case rates lower than the lowest pandemic season. The average velocity of the LEs for the three groups is equal to (a) 2.83 months, (b) 5.53 months and (c) 6.03 months, respectively. This suggests that pandemic seasons have higher velocities than inter-pandemic years, and that this higher velocity is maintained even compared with inter-pandemic influenza seasons of similar intensity levels as pandemic seasons. Values for R0A were calculated for each influenza season (Figure 3d) and for the three categories of influenza season, (a), (b) and (c), defined above. The same differentials were observed (Table 1).


View this table:
[in this window]
[in a new window]

 
Table 1 Characteristics of Iceland's 61 influenza waves, 1915–16 to 1975–76

 

    Discussion
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
The method described may be used with surveillance data in a real-time situation and, since it may be used to compare epidemic speeds from one country to another, may have an early warning potential. The parameters of the model given in equations (1)–(7)GoGoGoGoGoGo may be calculated for each month from the start of an outbreak and progressively re-calculated in each successive month (as fresh data becomes available) through to the end of the outbreak. An historical example of this method is shown in Figure 4. Here monthly influenza data from France over the 103 years from 1887–88 to 1998–99 were studied, the 5 shift seasons identified and these compared with the 2 years that preceded and followed them. Figure 4 shows the spatial reproductive numbers (R0A) measured in real time from the start of each influenza season. The contrasting behaviour of R0A in shift and non-shift years suggest there may be a capability for the method to pick up unusual wave characteristics early in the course of an outbreak and alert other yet unaffected areas. Influenza and monthly data have been used here for ease of illustration; in principle the methods proposed could be used with a wide range of infectious diseases and for shorter (i.e. weekly or daily) reporting periods.


Figure 4
View larger version (17K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 4 Extension of the method to French influenza records for the 103-year period, 1887–88 to 1998–99. Contrasts in the behaviour of R0A for shift seasons in contrast to non-shift seasons are shown when the parameter is measured in real-time (re-computing R0A month by month as new data arrive from the start of the influenza season)

 

    Conclusions
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
The new measure of wave velocity applied to a 61-year run of influenza morbidity records shows that for Iceland (i) the onset of waves tended to speed up over the period 1914–75 and (ii) waves in three viral shift (pandemic) seasons spread faster than did other equally large waves in non-shift (inter-pandemic) seasons. In principle, the proposed measure of wave velocity should be applicable to other epidemiological time series where breakdown into regional or local areas is available. The study is being extended to (i) other countries where similar influenza time series are available (notably France) and (ii) to other diseases within Iceland. If our findings are confirmed elsewhere, this may have wider implications for public health measures. It would suggest that any new influenza pandemic in the 21st century, whether emerging from avian influenza or other sources, is likely to appear earlier than normal in the influenza season and to spread spatially more rapidly than normal. Both conclusions underscore the role of surveillance and early virus watch identification.


    Acknowledgements
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
This work on Icelandic records has been supported by a History of Medicine grant from the Wellcome Trust.

Conflicts of interest: None declared.


KEY MESSAGES

  • Although the accuracy and reliability of past epidemiological records are often poorly specified, their use in providing context for current outbreaks makes the search for robust methods for their analysis of value.
  • A robust measure of the spatial velocity, R0A, of epidemic waves from space-time series is proposed using binary data.
  • Application to historical records of influenza (Iceland, 1915–76) suggests increases in wave speed over the period studied and distinctive contrasts between pandemic and inter-pandemic periods.
  • The potential of the method for real-time applications is discussed.

 


    References
 Top
 Abstract
 Background
 Methods
 Results
 Discussion
 Conclusions
 Acknowledgements
 References
 
1 Morgenstern O. On the Accuracy of Economic Observations. (1963) Princeton: Princeton University Press.

2 Hoaglin DC, Mosteller F, Tukey JW, eds. Understanding Robust and Exploratory Data Analysis. (1983) New York: Wiley.

3 Abramson G, Kenkre VM, Yates TL, Parmenter RR. Travelling waves of infections in the Hantavirus epidemics. Bull Math Biol (2003) 65:519–34.[CrossRef][Web of Science][Medline]

4 Grenfell BT, Bjørnstad ON, Kappey J. Travelling waves and spatial hierarchies in measles epidemics. Nature (2001) 414:716–23.[CrossRef][Medline]

5 Zhao X-Q, Wang W. Fisher waves in an epidemic model. Discret Contin Dyn S (2004) 4:1117–28.[CrossRef]

6 Meltzer MI, Cox NJ, Fukada K. The impact of pandemic influenza in the United States: priorities for intervention. Emerg Infect Dis (1999) 5:659–71.[Web of Science][Medline]

7 Fedson DS. Pandemic influenza and global vaccine supply. Clin Infect Dis (2003) 36:1552–61.[CrossRef][Web of Science][Medline]

8 Hay AJ, Gregory V, Douglas AR, Lin YP. The evolution of human influenza viruses. Philos Trans R Soc Lond B Biol Sci (2001) 356:1861–70.[Abstract/Free Full Text]

9 Patterson KD. Pandemic Influenza, 1700–1900. (1986) Totowa, NJ: Rowman & Littlefield.

10 Cliff AD, Haggett P, Ord JK. Spatial Aspects of Influenza Epidemics. (1986) London: Pion.

11 Schleisner PA. Vital statistics of Iceland. Q J Stat Soc Lond (1851) 14:1–10.[CrossRef]

12 Cliff AD, Haggett P. Methods for the measurement of epidemic velocity from time-series data. Int J Epidemiol (1981) 11:82–89.[CrossRef][Web of Science]

13 Trevelyan B, Smallman-Raynor M, Cliff AD. The spatial dynamics of poliomyelitis in the United States: emergence to vaccine-induced retreat, 1910–1971. Ann Assoc Am Geogr (2005) 95:269–93.[CrossRef][Web of Science][Medline]

14 Cliff AD, Haggett P. A swash-backwash model of the single epidemic wave. J Geogr Syst (2006) 8:227–52.[CrossRef][Medline]

15 Dietz K. The estimation of the basic reproduction number for infectious diseases. Stat Methods Med Res (1993) 2:23–41.[Abstract/Free Full Text]

16 Watts DS, Muhamad R, Medina DC, Dodds PS. Multiscale, resurgent epidemics in a hierarchical metapopulation model. Proc Natl Acad Sci USA (2005) 102:1157–62.[Abstract/Free Full Text]

17 Viboud C, Grais RF, Lafont BA, Miller MA, Simonsen L. Multinational impact of the 1968 Hong Kong influenza pandemic: evidence for a smoldering pandemic. J Infect Dis (2005) 192:233–48.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
37/1/106    most recent
dym240v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Cliff, A. D
Right arrow Articles by Smallman-Raynor, M.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cliff, A. D
Right arrow Articles by Smallman-Raynor, M.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?