Can Google Street View Data Improve Public Health?

Big data and artificial intelligence are transforming how we think about health, from detecting diseases and spotting patterns to predicting outcomes and speeding up response times.

In a new study analyzing two million Google Street View images from New York City streets, a team of New York University researchers evaluated the utility of this digital data in informing public health decision-making. Their findings, published in the Proceedings of the National Academy of Sciences (PNAS), show how relying on street view images alone may lead to inaccuracies and misguided interventions, but combining it with other knowledge grows its potential.

"There's a lot of excitement around leveraging new data sources to gain a holistic view of health, including bringing in machine learning and data science methods to extract new insights," said Rumi Chunara, associate professor of biostatistics at NYU School of Global Public Health, associate professor of computer science at NYU Tandon School of Engineering, and the study’s senior author.

"Our study highlights the potential of digital data sources such as street view images in enhancing public health research, while also pointing out the limitations of data and the complex dynamics between the environment, individual behavior, and health outcomes," said Miao Zhang, a PhD student at NYU Tandon School of Engineering and the study’s first author.

A street-level view of health

In recent years, researchers have begun using street view images to connect an area’s environment and infrastructure with outcomes such as mental health, infectious diseases, or obesity - a task that would be challenging to measure by hand.

“We know that a city’s built environment can shape our health, whether it's the availability of sidewalks and greenspaces for walking, or groceries stores carrying healthy foods,” said Chunara. “Some studies show that the availability of sidewalks correlates with lower obesity rates - but is that the whole story?”

"Our motivation for this study was to dive deeper into these associations to see if there are potential factors driving them," said Zhang.

Chunara, Zhang, and their colleagues analyzed more than two million Google Street View images of every New York City street, using artificial intelligence to assess the availability of sidewalks and crosswalks in the images. They then compared this information with localized data on obesity, diabetes, and physical activity from the Centers for Disease Control and Prevention to see if the built environment predicted health outcomes.

The researchers found that neighborhoods with more crosswalks had lower rates of obesity and diabetes. However, no significant link was found between sidewalks and health outcomes, in contrast to earlier research.

"This may be because a lot of the sidewalks in New York City are in places that people don't use - along a highway, on a bridge, or in a tunnel - so sidewalk density may not reflect neighborhood walkability as accurately as crosswalks," said Zhang.

They also surfaced issues with the accuracy of the AI-generated labels for the street view images, cautioning that they may not match the "ground truth" and be a reliable measure on their own. When comparing existing data on New York City's sidewalk availability with the labeled street view images, they found that many were incorrectly labeled as having or not having sidewalks, which may have been due to cars or shade obscuring them in photos.

If you build it, will they come?

While crosswalks were linked to lower rates of obesity and diabetes, the researchers applied a public health lens to determine what could explain this association. Their analyses of the CDC data revealed that physical activity - not just crosswalks, as measured in street view images - accounted for the decrease in obesity and diabetes. In one test, they found that increasing physical activity could result in a four times larger decrease in obesity and 17 times greater decrease in diabetes than could be achieved through installing more crosswalks.

"We saw that physical activity delivers the benefits of crosswalks, so it’s important to take such mechanisms into account, especially when they act on different levels like the built environment versus individuals," said Zhang.

Based on their findings, the researchers conclude that public health decision-making shouldn’t rely on new data sources alone, but must also consider domain knowledge. When analyzing street view images, incorporating computer science knowledge - for instance, how image processing techniques can improve accuracy or how to correct for bias in algorithms - and public health knowledge - what drives the associations between built environment and health outcomes - are critical. Layering this expertise over big data can inform how programs are designed and implemented to improve public health.

In this case, building more sidewalks and crosswalks would be less effective at improving health outcomes than the same increase in physical activity, such as through local exercise classes for the community.

"While growing amounts of digital data can be useful in informing decision-making, our results show that simply using associations from new data sources may not lead to the most useful interventions or best allocation of resources," added Chunara. "A more nuanced approach using big data in conjunction with expertise is needed to make the best use of this new data."

Zhang M, Rahman S, Mhasawade V, Chunara R.
Utilizing big data without domain knowledge impacts public health decision-making.
Proc Natl Acad Sci U S A. 2024 Sep 24;121(39):e2402387121. doi: 10.1073/pnas.2402387121

Most Popular Now

500 Patient Images per Second Shared thr…

The image exchange portal, widely known in the NHS as the IEP, is now being used to share as many as 500 images each second - including x-rays, CT, MRI...

Is Your Marketing Effective for an NHS C…

How can you make sure you get the right message across to an NHS chief information officer, or chief nursing information officer? Replay this webinar with Professor Natasha Phillips, former...

We could Soon Use AI to Detect Brain Tum…

A new paper in Biology Methods and Protocols, published by Oxford University Press, shows that scientists can train artificial intelligence (AI) models to distinguish brain tumors from healthy tissue. AI...

Welcome Evo, Generative AI for the Genom…

Brian Hie runs the Laboratory of Evolutionary Design at Stanford, where he works at the crossroads of artificial intelligence and biology. Not long ago, Hie pondered a provocative question: If...

Telehealth Significantly Boosts Treatmen…

New research reveals a dramatic improvement in diagnosing and curing people living with hepatitis C in rural communities using both telemedicine and support from peers with lived experience in drug...

AI can Predict Study Results Better than…

Large language models, a type of AI that analyses text, can predict the results of proposed neuroscience studies more accurately than human experts, finds a new study led by UCL...

Using AI to Treat Infections more Accura…

New research from the Centres for Antimicrobial Optimisation Network (CAMO-Net) at the University of Liverpool has shown that using artificial intelligence (AI) can improve how we treat urinary tract infections...

Research Study Shows the Cost-Effectiven…

Earlier research showed that primary care clinicians using AI-ECG tools identified more unknown cases of a weak heart pump, also called low ejection fraction, than without AI. New study findings...

New Guidance for Ensuring AI Safety in C…

As artificial intelligence (AI) becomes more prevalent in health care, organizations and clinicians must take steps to ensure its safe implementation and use in real-world clinical settings, according to an...

Remote Telemedicine Tool Found Highly Ac…

Collecting images of suspicious-looking skin growths and sending them off-site for specialists to analyze is as accurate in identifying skin cancers as having a dermatologist examine them in person, a...

Philips Aims to Advance Cardiac MRI Tech…

Royal Philips (NYSE: PHG, AEX: PHIA) and Mayo Clinic announced a research collaboration aimed at advancing MRI for cardiac applications. Through this investigation, Philips and Mayo Clinic will look to...

Deep Learning Model Accurately Diagnoses…

Using just one inhalation lung CT scan, a deep learning model can accurately diagnose and stage chronic obstructive pulmonary disease (COPD), according to a study published today in Radiology: Cardiothoracic...