Can Google Street View Data Improve Public Health?

Big data and artificial intelligence are transforming how we think about health, from detecting diseases and spotting patterns to predicting outcomes and speeding up response times.

In a new study analyzing two million Google Street View images from New York City streets, a team of New York University researchers evaluated the utility of this digital data in informing public health decision-making. Their findings, published in the Proceedings of the National Academy of Sciences (PNAS), show how relying on street view images alone may lead to inaccuracies and misguided interventions, but combining it with other knowledge grows its potential.

"There's a lot of excitement around leveraging new data sources to gain a holistic view of health, including bringing in machine learning and data science methods to extract new insights," said Rumi Chunara, associate professor of biostatistics at NYU School of Global Public Health, associate professor of computer science at NYU Tandon School of Engineering, and the study’s senior author.

"Our study highlights the potential of digital data sources such as street view images in enhancing public health research, while also pointing out the limitations of data and the complex dynamics between the environment, individual behavior, and health outcomes," said Miao Zhang, a PhD student at NYU Tandon School of Engineering and the study’s first author.

A street-level view of health

In recent years, researchers have begun using street view images to connect an area’s environment and infrastructure with outcomes such as mental health, infectious diseases, or obesity - a task that would be challenging to measure by hand.

“We know that a city’s built environment can shape our health, whether it's the availability of sidewalks and greenspaces for walking, or groceries stores carrying healthy foods,” said Chunara. “Some studies show that the availability of sidewalks correlates with lower obesity rates - but is that the whole story?”

"Our motivation for this study was to dive deeper into these associations to see if there are potential factors driving them," said Zhang.

Chunara, Zhang, and their colleagues analyzed more than two million Google Street View images of every New York City street, using artificial intelligence to assess the availability of sidewalks and crosswalks in the images. They then compared this information with localized data on obesity, diabetes, and physical activity from the Centers for Disease Control and Prevention to see if the built environment predicted health outcomes.

The researchers found that neighborhoods with more crosswalks had lower rates of obesity and diabetes. However, no significant link was found between sidewalks and health outcomes, in contrast to earlier research.

"This may be because a lot of the sidewalks in New York City are in places that people don't use - along a highway, on a bridge, or in a tunnel - so sidewalk density may not reflect neighborhood walkability as accurately as crosswalks," said Zhang.

They also surfaced issues with the accuracy of the AI-generated labels for the street view images, cautioning that they may not match the "ground truth" and be a reliable measure on their own. When comparing existing data on New York City's sidewalk availability with the labeled street view images, they found that many were incorrectly labeled as having or not having sidewalks, which may have been due to cars or shade obscuring them in photos.

If you build it, will they come?

While crosswalks were linked to lower rates of obesity and diabetes, the researchers applied a public health lens to determine what could explain this association. Their analyses of the CDC data revealed that physical activity - not just crosswalks, as measured in street view images - accounted for the decrease in obesity and diabetes. In one test, they found that increasing physical activity could result in a four times larger decrease in obesity and 17 times greater decrease in diabetes than could be achieved through installing more crosswalks.

"We saw that physical activity delivers the benefits of crosswalks, so it’s important to take such mechanisms into account, especially when they act on different levels like the built environment versus individuals," said Zhang.

Based on their findings, the researchers conclude that public health decision-making shouldn’t rely on new data sources alone, but must also consider domain knowledge. When analyzing street view images, incorporating computer science knowledge - for instance, how image processing techniques can improve accuracy or how to correct for bias in algorithms - and public health knowledge - what drives the associations between built environment and health outcomes - are critical. Layering this expertise over big data can inform how programs are designed and implemented to improve public health.

In this case, building more sidewalks and crosswalks would be less effective at improving health outcomes than the same increase in physical activity, such as through local exercise classes for the community.

"While growing amounts of digital data can be useful in informing decision-making, our results show that simply using associations from new data sources may not lead to the most useful interventions or best allocation of resources," added Chunara. "A more nuanced approach using big data in conjunction with expertise is needed to make the best use of this new data."

Zhang M, Rahman S, Mhasawade V, Chunara R.
Utilizing big data without domain knowledge impacts public health decision-making.
Proc Natl Acad Sci U S A. 2024 Sep 24;121(39):e2402387121. doi: 10.1073/pnas.2402387121

Most Popular Now

European Artificial Intelligence Act Com…

The European Artificial Intelligence Act (AI Act), the world's first comprehensive regulation on artificial intelligence, enters into force. The AI Act is designed to ensure that AI developed and used...

Patient Safety must be Central to the De…

An EPR system brings together different patient information in one place, making it easier to access for healthcare professionals. This information can include patients' own notes, test results, observations by...

ChatGPT Shows Promise in Answering Patie…

The groundbreaking ChatGPT chatbot shows potential as a time-saving tool for responding to patient questions sent to the urologist's office, suggests a study in the September issue of Urology Practice®...

Survey: Most Americans Comfortable with …

Artificial intelligence (AI) is all around us - from smart home devices to entertainment and social media algorithms. But is AI okay in healthcare? A new national survey commissioned by...

AI can Help Rule out Abnormal Pathology …

A commercial artificial intelligence (AI) tool used off-label was effective at excluding pathology and had equal or lower rates of critical misses on chest X-ray than radiologists, according to a...

AI Spots Cancer and Viral Infections at …

Researchers at the Centre for Genomic Regulation (CRG), the University of the Basque Country (UPV/EHU), Donostia International Physics Center (DIPC) and the Fundación Biofisica Bizkaia (FBB, located in Biofisika Institute)...

Video Gaming Improves Mental Well-Being

A pioneering study titled "Causal effect of video gaming on mental well-being in Japan 2020-2022," published in Nature Human Behaviour, has conducted the most comprehensive investigation to date on the...

New Diabetes Research Links Blood Glucos…

As part of its ongoing exploration of vocal biomarkers and the role they can play in enhancing health outcomes, Klick Labs published a new study in Scientific Reports - confirming...

Machine learning helps identify rheumato…

A machine-learning tool created by Weill Cornell Medicine and Hospital for Special Surgery (HSS) investigators can help distinguish subtypes of rheumatoid arthritis (RA), which may help scientists find ways to...

New AI Software could Make Diagnosing De…

Although Alzheimer's is the most common cause of dementia - a catchall term for cognitive deficits that impact daily living, like the loss of memory or language - it's not...

A New AI Tool for Cancer

Scientists at Harvard Medical School have designed a versatile, ChatGPT-like AI model capable of performing an array of diagnostic tasks across multiple forms of cancers. The new AI system, described Sept...

Vision-Based ChatGPT Shows Deficits Inte…

Researchers evaluating the performance of ChatGPT-4 Vision found that the model performed well on text-based radiology exam questions but struggled to answer image-related questions accurately. The study's results were published...