Researchers Develop Highly Accurate Modeling Tool to Predict COVID-19 Risk

As new coronavirus variants emerge and quickly spread around the globe, both the public and policymakers are faced with a quandary: maintaining a semblance of normality, while also minimizing infections. While digital contact tracing apps offered promise, the adoption rate has been low, due in part to privacy concerns.

At USC, researchers are advocating for a new approach to predict the chance of infection from COVID-19: combining anonymized cellphone location data with mobility patterns - broad patterns of how people move from place to place.

To produce "risk scores" for specific locations and times, the team used a large dataset of anonymous, real-world location signals from cell phones across the US in 2019 and 2020. The system shows a 50% improvement in accuracy compared to current systems, said the researchers.

"Our results show that it is possible to predict and target specific areas that are high-risk, as opposed to putting all businesses under one umbrella. Such risk-targeted policies can be significantly more effective, both for controlling COVID-19 and economically," said lead author Sepanta Zeighami, a computer science Ph.D. student advised by Professor Cyrus Shahabi.

"It's also unlikely that COVID-19 will be the last pandemic in human history, so if we want to avoid the chaos of 2020 and the tragic losses while keeping daily life as unaffected as possible when the next pandemic happens, we need such data-driven approaches."

To address privacy concerns, the mobility data comes in an aggregated format, allowing the researchers to see patterns without identifying individual users. The data is not being used for contact tracing, identifying infected individuals, or where they are going, said the researchers.

"Our approach relies on anonymized aggregate data," said Shahabi, study co-author and Helen N. and Emmett H. Jones Professor in Engineering and Professor of Computer Science, Electrical and Computer Engineering, and Spatial Sciences. "It is the same as traffic data, where an individual’s information is not revealed, but the aggregate data will help you to make a decision on whether to use a certain freeway at a certain time."

The paper will appear in the ACM Transactions on Spatial Algorithms and Systems.

Data-driven approaches

According to the researchers, existing risk score tools do not provide enough detailed information about infection rates at specific places, or they make unrealistic assumptions about how populations mix.

"The risk of infection varies a lot based on the location, and having a single policy, for instance, at a county level, ignores how some areas are riskier than others," said Zeighami.

So, using real-world mobility data and existing knowledge about the spread of COVID-19, the team created a simulator to generate realistic infection patterns. In the simulation, some “agents” are initially infected and spread the disease as they move around.

Then, the researchers created a Hawkes process-based model, which assigns risk scores based on location density and mobility patterns at a given time and place. Using the simulator, the researchers tested the model to determine if it could accurately predict the number of infections at different locations. It turned out, the risk scores were indeed a reliable metric for tracking infections in cities across the US, including San Francisco, New York, Chicago and Los Angeles.

The researchers found, predictably, that popular destinations in a city are riskier. But they also found that incorporating the infection mobility - how people move - as opposed to just relying on the popularity of an area helped to improve infection prediction. This, said the researchers, underscores the importance of bringing together mobility patterns and infection spread prediction models to generate risk scores.

There are two key ways the system could be used in the real world, said the researchers. The more straightforward case is to make neighborhood-level policy decisions: for instance, bars in Santa Monica, CA, should close today due to high risk in that neighborhood.

For more targeted locations, such as a specific concert stadium event, the system would crunch the mobility data from similar concerts in the past to learn how the infection risk changes in the area following this type of event. Then, using the researchers’ model and current mobility data across LA, the system could make predictions and assign risk scores.

Going forward, the team plans to develop user-specific, yet still privacy-preserving, risk scores, and to include long-term forecasting capabilities for several weeks into the future.

"The very high resolution of this mobility data, as well as our scalable approach, will enable us to estimate risk scores at a very fine-grain spatial and temporal resolution, for example, a specific restaurant at dinner time, or a shopping mall at lunchtime," said Shahabi.

"As an individual, you may want to avoid areas deemed high-risk, and policymakers could warn the public to avoid an area known to be a potential hotspot of infection. The scores can also be used for closure or reduced capacity decisions. Instead of making these decisions at the county level, public health experts can make those decisions at city, neighborhood or zip code levels."

Sirisha Rambhatla, Sepanta Zeighami, Kameron Shahabi, Cyrus Shahabi, Yan Liu.
Toward Accurate Spatiotemporal COVID-19 Risk Scores Using High-Resolution Real-World Mobility Data.
ACM Transactions on Spatial Algorithms and Systems, Volume 8, Issue 2, 2022. doi: https://doi.org/10.1145/3481044

Most Popular Now

Giving Doctors an AI-Powered Head Start …

Detection of melanoma and a range of other skin diseases will be faster and more accurate with a new artificial intelligence (AI) powered tool that analyses multiple imaging types simultaneously...

Philips Foundation 2024 Annual Report: E…

Marking its tenth anniversary, Philips Foundation released its 2024 Annual Report, highlighting a year in which the Philips Foundation helped provide access to quality healthcare for 46.5 million people around...

Scientists Argue for More FDA Oversight …

An agile, transparent, and ethics-driven oversight system is needed for the U.S. Food and Drug Administration (FDA) to balance innovation with patient safety when it comes to artificial intelligence-driven medical...

AI Agents for Oncology

Clinical decision-making in oncology is challenging and requires the analysis of various data types - from medical imaging and genetic information to patient records and treatment guidelines. To effectively support...

Start-ups in the Spotlight at MEDICA 202…

17 - 20 November 2025, Düsseldorf, Germany. MEDICA, the leading international trade fair and platform for healthcare innovations, will once again confirm its position as the world's number one hotspot for...

AI Medical Receptionist Modernizing Doct…

A virtual medical receptionist named "Cassie," developed through research at Texas A&M University, is transforming the way patients interact with health care providers. Cassie is a digital-human assistant created by Humanate...

Using Data and AI to Create Better Healt…

Academic medical centers could transform patient care by adopting principles from learning health systems principles, according to researchers from Weill Cornell Medicine and the University of California, San Diego. In...

AI Tool Set to Transform Characterisatio…

A multinational team of researchers, co-led by the Garvan Institute of Medical Research, has developed and tested a new AI tool to better characterise the diversity of individual cells within...

AI Detects Hidden Heart Disease Using Ex…

Mass General Brigham researchers have developed a new AI tool in collaboration with the United States Department of Veterans Affairs (VA) to probe through previously collected CT scans and identify...

Highland Marketing Announced as Official…

Highland Marketing has been named, for the second year running, the official communications partner for HETT Show 2025, the UK's leading digital health conference and exhibition. Taking place 7-8 October...

MHP-Net: A Revolutionary AI Model for Ac…

Liver cancer is the sixth most common cancer globally and a leading cause of cancer-related deaths. Accurate segmentation of liver tumors is a crucial step for the management of the...

Human-AI Collectives Make the Most Accur…

Diagnostic errors are among the most serious problems in everyday medical practice. AI systems - especially large language models (LLMs) like ChatGPT-4, Gemini, or Claude 3 - offer new ways...