New Medical AI Tool Identifies more Cases of Long COVID from Patient Health Records

Investigators at Mass General Brigham have developed an AI-based tool to sift through electronic health records to help clinicians identify cases of long COVID, an often mysterious condition that can encompass a litany of enduring symptoms, including fatigue, chronic cough, and brain fog after infection from SARS-CoV-2. The results, which are published in the journal Med, could identify more people who should be receiving care for this potentially debilitating condition. The number of cases they identified also suggests that the prevalence of long COVID could be greatly underrecognized.

"Our AI tool could turn a foggy diagnostic process into something sharp and focused, giving clinicians the power to make sense of a challenging condition," said senior author Hossein Estiri, PhD, head of AI Research at the Center for AI and Biomedical Informatics of the Learning Healthcare System (CAIBILS) at Mass General Brigham and an associate professor of Medicine at Harvard Medical School. "With this work, we may finally be able to see long COVID for what it truly is - and more importantly, how to treat it."

Long COVID, also known as Post-Acute Sequelae of SARS-CoV-2 infection (PASC), includes a wide range of symptoms. For the purposes of their study, Estiri and colleagues defined it as a diagnosis of exclusion that is also infection associated. That means the diagnosis could not be explained in the patient's unique medical record and it also had to associate with a COVID infection. In addition, the diagnosis needed to have persisted for 2 months or longer in a 12-month follow up window.

The algorithm used in the AI tool was developed by drawing de-identified patient data from the clinical records of nearly 300,000 patients across 14 hospitals and 20 community health centers in the Mass General Brigham system. Rather than having to rely on a single diagnosis code, the AI utilizes a novel method developed by Estiri and colleagues called "precision phenotyping" that sifts through individual records to identify symptoms and conditions linked to COVID-19 and to track symptoms over time in order to differentiate them from other illnesses. For example, the algorithm can detect if shortness of breath may be the result of pre-existing conditions like heart failure or asthma rather than a long COVID. Only when every other possibility was exhausted would the tool flag the patient as having long COVID.

"Physicians are often faced with having to wade through a tangled web of symptoms and medical histories, unsure of which threads to pull, while balancing busy caseloads. Having a tool powered by AI that can methodically do it for them could be a game-changer," said Alaleh Azhir, MD, the co-lead author who is an internal medicine resident at Brigham Women's Hospital, a founding member of the Mass General Brigham healthcare system.

The patient-centered diagnoses provided by this new method may also help alleviate biases built into current diagnostics for long COVID, according to the researchers, who note that patients diagnosed with the official ICD-10 diagnostic code for long COVID trend towards those with easier access to healthcare. While other diagnostic studies have suggested that approximately 7% of the population suffers from long COVID, this new approach reveals a much higher estimate - 22.8%. The authors stated that this figure aligns more closely with national trends and paints a more realistic picture of the pandemic’s long-term toll.

The researchers determined their tool was about 3 percent more accurate than what ICD-10 codes capture, while being less biased. Specifically, their study demonstrated that the individuals they identified as having long COVID mirror the broader demographic makeup of Massachusetts, unlike long COVID algorithms that rely on a single diagnostic code or individual clinical encounters, skewing results toward certain populations such as those with more access to care. "This broader scope ensures that marginalized communities, often sidelined in clinical studies, are no longer invisible," said Estiri.

Limitations of the study and AI tool include that health record data used in the algorithm to account for long COVID symptoms may be less complete than what is captured by physicians in post-visit clinical notes. Another limitation was the algorithm did not capture possible worsening of a prior condition, which may have been a long COVID symptom. For example, if a patient had COPD and prior episodes of it worsened before they developed COVID-19, the algorithm might have removed them even if their persisting symptoms were a long COVID indicator. Declines in the amount of COVID-19 testing in recent years also makes it difficult to identify when a patient may have first gotten COVID-19. The study was also limited to patients in Massachusetts.

Future studies may explore the algorithm in cohorts of patients with specific conditions, like COPD or diabetes. The researchers also plan to release this algorithm publicly on open access where physicians and healthcare systems globally can use it in their patient populations.

In addition to opening the door to better clinical care, this work may lay the foundation for future research into the genetic and biochemical factors behind long COVID's various subtypes. "Questions about the true burden of long COVID - questions that have thus far remained elusive - now seem more within reach," said Estiri.

Azhir A, Hügel J, Tian J, Cheng J, Bassett IV, Bell DS, Bernstam EV, Farhat MR, Henderson DW, Lau ES, Morris M, Semenov YR, Triant VA, Visweswaran S, Strasser ZH, Klann JG, Murphy SN, Estiri H.
Precision phenotyping for curating research cohorts of patients with unexplained post-acute sequelae of COVID-19.
Med. 2024 Nov 2:S2666-6340(24)00407-0. doi: 10.1016/j.medj.2024.10.009

Most Popular Now

Research Shows AI Technology Improves Pa…

Existing research indicates that the accuracy of a Parkinson's disease diagnosis hovers between 55% and 78% in the first five years of assessment. That's partly because Parkinson's sibling movement disorders...

AI in Healthcare: How do We Get from Hyp…

The Highland Marketing advisory board met to consider the government's enthusiasm for AI. To date, healthcare has mostly experimented with decision support tools, and their impact on the NHS and...

Who's to Blame When AI Makes a Medi…

Assistive artificial intelligence technologies hold significant promise for transforming health care by aiding physicians in diagnosing, managing, and treating patients. However, the current trend of assistive AI implementation could actually...

First Therapy Chatbot Trial Shows AI can…

Dartmouth researchers conducted the first clinical trial of a therapy chatbot powered by generative AI and found that the software resulted in significant improvements in participants' symptoms, according to results...

DMEA sparks: The Future of Digital Healt…

8 - 10 April 2025, Berlin, Germany. Digitalization is considered one of the key strategies for addressing the shortage of skilled workers - but the digital health sector also needs qualified...

DeepSeek: The "Watson" to Doct…

DeepSeek is an artificial intelligence (AI) platform built on deep learning and natural language processing (NLP) technologies. Its core products include the DeepSeek-R1 and DeepSeek-V3 models. Leveraging an efficient Mixture...

Stepping Hill Hospital Announced as SPAR…

Stepping Hill Hospital, part of Stockport NHS Foundation Trust, has replaced its bedside units with state-of-the art devices running a full range of information, engagement, communications and productivity apps, to...

DMEA 2025: Digital Health Worldwide in B…

8 - 10 April 2025, Berlin, Germany. From the AI Act, to the potential of the European Health Data Space, to the power of patient data in Scandinavia - DMEA 2025...