IBM uses grocery scanner data to speed foodborne illness investigationsIBM uses grocery scanner data to speed foodborne illness investigations
Method successfully applied to an E. coli illness outbreak in Norway.
August 16, 2016
Foodborne illnesses are a major public health concern affecting more than one out of six Americans each year, according to the Centers for Disease Control & Prevention (CDC). During a foodborne illness outbreak, rapidly identifying the contaminated food source is vital to minimizing illness, loss and the impact on society.
IBM Research–Almaden (IBM) recently announced that its scientists have discovered that analyzing retail scanner data from grocery stores against maps of confirmed cases of foodborne illness can speed early investigations. In the study, the researchers demonstrated that, with as few as 10 medical examination reports of foodborne illness, they can narrow down the investigation to 12 suspected food products in just a few hours.
In the study, the researchers created a data analytics methodology to review spatio-temporal data, including geographic location and possible time of consumption, for hundreds of grocery product categories. The researchers also analyzed each product for its shelf life, geographic location of consumption and likelihood of harboring a particular pathogen and then mapped the information to the known location of illness outbreaks. The system then ranked all grocery products by likelihood of contamination in a list from which public health officials could test the top 12 suspected foods for contamination and alert the public accordingly.
A traditional investigation can take anywhere from weeks to months, and the timing can significantly influence the economic and health impact of a disease outbreak. The typical process employs interviews and questionnaires to trace the contamination source.
In 2011, it took more than 60 days to identify the source of an outbreak of Escherichia coli in Europe, which turned out to be imported fenugreek seeds. By the time the investigation was completed, all of the sprouts produced from the seeds had been consumed. Nearly 4,000 people became ill in 16 countries, and more than 50 people died before public health officials could pinpoint the source, according to the European Food Safety Authority.
“When there’s an outbreak of foodborne illness, the biggest challenge facing public health officials is the speed at which they can identify the contaminated food source and alert the public,” said Kun Hu, public health research scientist at IBM Research–Almaden in San Jose, Cal. “While traditional methods like interviews and surveys are still necessary, analyzing big data from retail grocery scanners can significantly narrow down the list of contaminants in hours for further lab testing. Our study shows that big data and analytics can profoundly reduce investigation time and human error and have a huge impact on public health.”
Already, the method in this study has been applied to an actual E. coli illness outbreak in Norway. With just 17 confirmed cases of infection, public health officials were able to use this methodology to analyze grocery scanner data related to more than 2,600 possible food products and create a short list of 10 possible contaminants. Further lab analysis pinpointed the source of contamination down to the batch and lot numbers of the specific product: sausage.
The study, “From Farm to Fork: How Spatial-Temporal Data Can Accelerate Foodborne Illness Investigation in a Global Food Supply Chain,” was published in the Association for Computing Machinery’s Sigspatial journal.
About the Author(s)
You May Also Like
House passes rail contract, mandates sick timeJan 12, 2023
Current Conditions for
New York, NY
Enter a zip code to see the weather conditions for a different location.