Neighborhood attributes have been shown to influence health, but advances in neighborhood research has been constrained by the lack of neighborhood data for many geographical areas and few neighborhood studies examine features of nonmetropolitan locations. We leveraged a massive source of Google Street View (GSV) images and computer vision to automatically characterize national neighborhood built environments. Using road network data and Google Street View API, from December 15, 2017-May 14, 2018 we retrieved over 16 million GSV images of street intersections across the United States. Computer vision was applied to label each image. We implemented regression models to estimate associations between built environments and county health outcomes , controlling for county-level demographics, economics, and population density . At the county level, greater presence of highways was related to lower chronic diseases and premature mortality . Areas characterized by street view images as ‘rural’ (having limited infrastructure) had higher obesity, diabetes , fair/poor self-rated health, premature mortality, physical distress, physical inactivity and teen birth rates but lower rates of excessive drinking. Analyses at the census tract level for 500 cities revealed similar adverse associations as was seen at the county level for neighborhood indicators of less urban development. Possible mechanisms include the greater abundance of services and facilities found in more developed areas with roads, enabling access to places and resources for promoting health. GSV images represents an underutilized resource for building national data on neighborhoods and examining the influence of built environments on community health outcomes across the United States.
Located in
MPRC People
/
Quynh Nguyen, Ph.D., M.S.P.H.
/
Quynh Nguyen Publications
Obesity is a global epidemic affecting millions. Implementation of interventions to curb obesity rates requires timely surveillance. In this study, we estimated sex-specific obesity prevalence using social media, search queries, demographics and built environment variables. We collected 3,817,125 and 1,382,284 geolocated tweets on food and exercise respectively, from Twitter’s streaming API from April 2015 to March 2016. We also obtained searches related to physical activity and diet from Google Search Trends for the same time period. Next, we inferred the gender of Twitter users using machine learning methods and applied mixed-effects state-level linear regression models to estimate obesity prevalence. We observed differences in discussions of physical activity and foods, with males reporting higher intensity physical activities and lower caloric foods across 40 and 48 states, respectively. In addition, counties with the highest percentage of exercise and food tweets had lower male and female obesity prevalence. Lastly, our models separately captured overall male and female spatial trends in obesity prevalence. The average correlation between actual and estimated obesity prevalence was 0.797(95% CI, 0.796, 0.798) and 0.830 (95% CI, 0.830, 0.831) for males and females, respectively. Social media can provide timely community-level data on health information seeking and changes in behaviors, sentiments and norms. Social media data can also be combined with other data types such as, demographics, built environment variables, diet and physical activity indicators from other digital sources (e.g., mobile applications and wearables) to monitor health behaviors at different geographic scales, and to supplement delayed estimates from traditional surveillance systems.
Located in
MPRC People
/
Quynh Nguyen, Ph.D., M.S.P.H.
/
Quynh Nguyen Publications