Estimating spatial patterning of dietary behaviors using grocery transaction data
AbstractObjectiveTo demonstrate a method for estimating neighborhood foodselection with secondary use of digital marketing data; grocerytransaction records and retail business registry.IntroductionUnhealthy diet is becoming the most important preventablecause of chronic disease burden (1). Dietary patterns vary acrossneighborhoods as a function of policy, marketing, social support,economy, and the commercial food environment (2). Assessmentof community-specific response to these socio-ecological factorsis critical for the development and evaluation policy interventionsand identification of nutrition inequality. Mass administration ofdietary surveys is impractical and prohibitory expensive, and surveystypically fail to address variation of food selection at high geographicresolution. Marketing companies such as the Nielsen cooperationcontinuously collect and centralize scanned grocery transactionrecords from a geographically representative sample of retail foodoutlets to guide product promotions. These data can be harnessed todevelop a model for the demand of specific foods using store andneighborhood attributes, providing a rich and detailed picture of the“foodscape” in an urban environment. In this study, we generated aspatial profile of food selection from estimated sales in food outletsin the Census Metropolitan Area (CMA) of Montreal, Canada,using regular carbonated soft drinks (i.e. non-diet soda) as an initialexample.MethodsFrom the Nielsen cooperation, we obtained weekly grocerytransaction data generated by a sample of 86 grocery stores and 42pharmacies in the Montreal CMA in 2012. Extracted store-specificsoda sales were standardized to a single serving size (240ml) andaveraged across 52 weeks, resulting in 128 data points. Using linearregression, natural log-transformed soda sales were modelled as afunction of store type (grocery vs. pharmacies), chain identificationcode and socio-demographic attributes of store neighborhood, whichare median family income, proportion of individuals who receivedpost-secondary diplomas, and population density as measured by the2011 Canadian Household Survey. Selection of the predictors andfirst-order interaction terms was guided by the minimization of themean squared error using 10-fold cross-validation. The final modelwas applied to all operating chain grocery stores and pharmacies in2012 (n=980) recorded in a comprehensive and commonly availablebusiness establishment database. The resulting predicted store-specific weekly average soda sales was spatially interpolated toprovide a graphical representation of the soda sales (representing anunhealthy foodscape) across the Montreal CMA.ResultsFigure 2 demonstrates the spatial distribution of the predicted sodasales in the Montreal CMA.ConclusionsThe current lack of neighborhood-level dietary surveillanceimpedes effective public health actions aimed at encouraging healthyfood selection and subsequent reduction of chronic illness. Ourmethod leverages existing grocery transaction data and store locationinformation to address the gap in population monitoring of nutritionstatus and urban foodscapes. Future applications of our methodologyto other store types (e.g. convenience stores) and food productsacross multiple time points (e.g. mouths and years) will permit acomprehensive, timely and automated assessment of dietary trends,identification of neighborhoods in special dietary needs, developmentof tailored community health promotions, and the measurement ofneighbourhood-specific response to nutrition policies and unhealthyfood advertising.
How to Cite
Mamiya, H., Moodie, E., & Buckeridge, D. (2017). Estimating spatial patterning of dietary behaviors using grocery transaction data. Online Journal of Public Health Informatics, 9(1). https://doi.org/10.5210/ojphi.v9i1.7715
Non-Infectious Disease Surveillance Use Cases