↓ Skip to main content

A Likelihood-Based Approach to Identifying Contaminated Food Products Using Sales Data: Performance and Challenges

Overview of attention for article published in PLoS Computational Biology, July 2014
Altmetric Badge

About this Attention Score

  • In the top 5% of all research outputs scored by Altmetric
  • High Attention Score compared to outputs of the same age (95th percentile)
  • High Attention Score compared to outputs of the same age and source (91st percentile)

Mentioned by

news
2 news outlets
blogs
2 blogs
twitter
9 X users
peer_reviews
1 peer review site

Citations

dimensions_citation
21 Dimensions

Readers on

mendeley
51 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Title
A Likelihood-Based Approach to Identifying Contaminated Food Products Using Sales Data: Performance and Challenges
Published in
PLoS Computational Biology, July 2014
DOI 10.1371/journal.pcbi.1003692
Pubmed ID
Authors

James Kaufman, Justin Lessler, April Harry, Stefan Edlund, Kun Hu, Judith Douglas, Christian Thoens, Bernd Appel, Annemarie Käsbohrer, Matthias Filter

Abstract

Foodborne disease outbreaks of recent years demonstrate that due to increasingly interconnected supply chains these type of crisis situations have the potential to affect thousands of people, leading to significant healthcare costs, loss of revenue for food companies, and--in the worst cases--death. When a disease outbreak is detected, identifying the contaminated food quickly is vital to minimize suffering and limit economic losses. Here we present a likelihood-based approach that has the potential to accelerate the time needed to identify possibly contaminated food products, which is based on exploitation of food products sales data and the distribution of foodborne illness case reports. Using a real world food sales data set and artificially generated outbreak scenarios, we show that this method performs very well for contamination scenarios originating from a single "guilty" food product. As it is neither always possible nor necessary to identify the single offending product, the method has been extended such that it can be used as a binary classifier. With this extension it is possible to generate a set of potentially "guilty" products that contains the real outbreak source with very high accuracy. Furthermore we explore the patterns of food distributions that lead to "hard-to-identify" foods, the possibility of identifying these food groups a priori, and the extent to which the likelihood-based method can be used to quantify uncertainty. We find that high spatial correlation of sales data between products may be a useful indicator for "hard-to-identify" products.

X Demographics

X Demographics

The data shown below were collected from the profiles of 9 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 51 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Finland 1 2%
United States 1 2%
Australia 1 2%
Unknown 48 94%

Demographic breakdown

Readers by professional status Count As %
Researcher 15 29%
Student > Master 9 18%
Student > Bachelor 5 10%
Student > Ph. D. Student 4 8%
Professor 3 6%
Other 6 12%
Unknown 9 18%
Readers by discipline Count As %
Agricultural and Biological Sciences 11 22%
Medicine and Dentistry 6 12%
Business, Management and Accounting 3 6%
Mathematics 3 6%
Computer Science 2 4%
Other 15 29%
Unknown 11 22%
Attention Score in Context

Attention Score in Context

This research output has an Altmetric Attention Score of 35. This is our high-level measure of the quality and quantity of online attention that it has received. This Attention Score, as well as the ranking and number of research outputs shown below, was calculated when the research output was last mentioned on 31 March 2017.
All research outputs
#1,138,401
of 25,371,288 outputs
Outputs from PLoS Computational Biology
#924
of 8,958 outputs
Outputs of similar age
#11,026
of 242,252 outputs
Outputs of similar age from PLoS Computational Biology
#13
of 162 outputs
Altmetric has tracked 25,371,288 research outputs across all sources so far. Compared to these this one has done particularly well and is in the 95th percentile: it's in the top 5% of all research outputs ever tracked by Altmetric.
So far Altmetric has tracked 8,958 research outputs from this source. They typically receive a lot more attention than average, with a mean Attention Score of 20.4. This one has done well, scoring higher than 89% of its peers.
Older research outputs will score higher simply because they've had more time to accumulate mentions. To account for age we can compare this Altmetric Attention Score to the 242,252 tracked outputs that were published within six weeks on either side of this one in any source. This one has done particularly well, scoring higher than 95% of its contemporaries.
We're also able to compare this research output to 162 others from the same source and published within six weeks on either side of this one. This one has done particularly well, scoring higher than 91% of its contemporaries.