↓ Skip to main content

Data-driven methods for imputing national-level incidence in global burden of disease studies

Overview of attention for article published in Bulletin of the World Health Organization, February 2015
Altmetric Badge

Mentioned by

policy
2 policy sources
twitter
2 X users

Citations

dimensions_citation
17 Dimensions

Readers on

mendeley
39 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
Data-driven methods for imputing national-level incidence in global burden of disease studies
Published in
Bulletin of the World Health Organization, February 2015
DOI 10.2471/blt.14.139972
Pubmed ID
Authors

Scott A McDonald, Brecht Devleesschauwer, Niko Speybroeck, Niel Hens, Nicolas Praet, Paul R Torgerson, Arie H Havelaar, Felicia Wu, Marlène Tremblay, Ermias W Amene, Dörte Döpfer

Abstract

To develop transparent and reproducible methods for imputing missing data on disease incidence at national-level for the year 2005. We compared several models for imputing missing country-level incidence rates for two foodborne diseases - congenital toxoplasmosis and aflatoxin-related hepatocellular carcinoma. Missing values were assumed to be missing at random. Predictor variables were selected using least absolute shrinkage and selection operator regression. We compared the predictive performance of naive extrapolation approaches and Bayesian random and mixed-effects regression models. Leave-one-out cross-validation was used to evaluate model accuracy. The predictive accuracy of the Bayesian mixed-effects models was significantly better than that of the naive extrapolation method for one of the two disease models. However, Bayesian mixed-effects models produced wider prediction intervals for both data sets. Several approaches are available for imputing missing data at national level. Strengths of a hierarchical regression approach for this type of task are the ability to derive estimates from other similar countries, transparency, computational efficiency and ease of interpretation. The inclusion of informative covariates may improve model performance, but results should be appraised carefully.

X Demographics

X Demographics

The data shown below were collected from the profiles of 2 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 39 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 39 100%

Demographic breakdown

Readers by professional status Count As %
Professor 1 3%
Student > Ph. D. Student 1 3%
Professor > Associate Professor 1 3%
Unknown 36 92%
Readers by discipline Count As %
Agricultural and Biological Sciences 1 3%
Economics, Econometrics and Finance 1 3%
Engineering 1 3%
Unknown 36 92%