↓ Skip to main content

A framework for evaluating epidemic forecasts

Overview of attention for article published in BMC Infectious Diseases, May 2017
Altmetric Badge

Citations

dimensions_citation
46 Dimensions

Readers on

mendeley
97 Mendeley
citeulike
1 CiteULike
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
A framework for evaluating epidemic forecasts
Published in
BMC Infectious Diseases, May 2017
DOI 10.1186/s12879-017-2365-1
Pubmed ID
Authors

Farzaneh Sadat Tabataba, Prithwish Chakraborty, Naren Ramakrishnan, Srinivasan Venkatramanan, Jiangzhuo Chen, Bryan Lewis, Madhav Marathe

Abstract

Over the past few decades, numerous forecasting methods have been proposed in the field of epidemic forecasting. Such methods can be classified into different categories such as deterministic vs. probabilistic, comparative methods vs. generative methods, and so on. In some of the more popular comparative methods, researchers compare observed epidemiological data from the early stages of an outbreak with the output of proposed models to forecast the future trend and prevalence of the pandemic. A significant problem in this area is the lack of standard well-defined evaluation measures to select the best algorithm among different ones, as well as for selecting the best possible configuration for a particular algorithm. In this paper we present an evaluation framework which allows for combining different features, error measures, and ranking schema to evaluate forecasts. We describe the various epidemic features (Epi-features) included to characterize the output of forecasting methods and provide suitable error measures that could be used to evaluate the accuracy of the methods with respect to these Epi-features. We focus on long-term predictions rather than short-term forecasting and demonstrate the utility of the framework by evaluating six forecasting methods for predicting influenza in the United States. Our results demonstrate that different error measures lead to different rankings even for a single Epi-feature. Further, our experimental analyses show that no single method dominates the rest in predicting all Epi-features when evaluated across error measures. As an alternative, we provide various Consensus Ranking schema that summarize individual rankings, thus accounting for different error measures. Since each Epi-feature presents a different aspect of the epidemic, multiple methods need to be combined to provide a comprehensive forecast. Thus we call for a more nuanced approach while evaluating epidemic forecasts and we believe that a comprehensive evaluation framework, as presented in this paper, will add value to the computational epidemiology community.

Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 97 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 97 100%

Demographic breakdown

Readers by professional status Count As %
Student > Ph. D. Student 18 19%
Researcher 16 16%
Student > Master 14 14%
Student > Bachelor 7 7%
Other 5 5%
Other 12 12%
Unknown 25 26%
Readers by discipline Count As %
Computer Science 17 18%
Agricultural and Biological Sciences 10 10%
Medicine and Dentistry 8 8%
Mathematics 6 6%
Engineering 5 5%
Other 20 21%
Unknown 31 32%